Crawling

April 26, 2019

Niels Stuck CEO & Founder

ÜBER DEN AUTOR

SEO expert mit über 10 Jahren Erfahrung. Ich helfe Unternehmen, online sichtbar zu werden.

Learn more →

« Back to Glossary Index

What is crawling?

Crawling (or spidering) is when Google or another Search Engine sends a bot to a website or web post and "reads" the page. Do not confuse this with the indexing of this page. Crawling is the first part where a Search Engine recognizes your page and displays it in the search results. However, having your page crawled does not necessarily mean that your page has been indexed and will be found.

Pages are crawled for a variety of reasons, including:

An XMLSitemap with the concerned URL, die an Google übermittelt wurde.
Internal links pointing to the page.
Have external links pointing to the page.
Bring an increase in traffic to the site

To ensure that your page is crawled, you should create an XMLSitemap in die Google Search Console (früher Google Webmaster Tools) upload to give Google the roadmap for all your new content.

In Google Search Console you can see what has been submitted and what is indexed was.

What crawling means is that Google looks at the page. Depending on whether Google thinks that the content is "new" or whether it otherwise has something to "give to the internet", it may schedule an indexing, which means that it has the opportunity to create a Ranking to perform.

Also, when Google crawls a page, it looks at the links on that page and schedules the Google Bot to check out those pages as well. The exception is when the link is followed by a nofollow-tag was added.

How can I improve the crawlability of my website?

Think of Google's crawlers as little spider robots that "crawl" into your website and look around. The easier you make it for these little guys, the better for you. Here are a few tips to make your website as inviting as possible for the Google spiders.

1. xml sitemap

First, you need a good map of your home, that is, your website. An XMLSitemap Google shows exactly which pages you have and how they are connected. It's like giving your visitor a map of your giant maze so they don't get lost.

2. flat hierarchy

Try to keep the structure of your website as simple as possible. No deeply nested pages that take seven clicks to reach. That would be like sending someone through seven different doors just to find the bathroom.

3. fast loading times

Nobody likes to wait. Not even Google. Optimized images, minimized CSS and JavaScript can speed up the loading times of your website. It's like a good restaurant where the service is super snappy. You don't want guests to leave because the coffee takes too long.

4. responsive design

Your site should look good on all devices and be easy to use. The Google spiders also check how mobile-friendly your site is. Imagine your house also has a miniature version for little guests. They should feel just as comfortable as the "normal-sized" visitors.

5. internal linking

Make sure you have clever internal links that guide the crawlers through your website. But beware of endless loops or "Broken Links". It would be like a door in your house leading nowhere or going round in circles.

6. robots.txt

With a Robots.txt file, you can tell the crawlers which areas they should avoid. It's like a "Do not disturb" sign on the door of your private room.

7. avoid error pages

404-Fehlerseiten sind für Crawler wie Sackgassen. Versuch also, diese Fehler zu minimieren oder durch 301-Weiterleitungen zu ersetzen. Das ist, als würdest du eine Umleitung einrichten, wenn die Hauptstraße gesperrt ist.

Typical mistakes you should avoid:

Before you roll out the red carpet for Google crawlers, let's talk briefly about the stumbling blocks you're better off avoiding.

1. clogged Robots.txt

Your Robots.txt is like the bouncer of your club. If it's too strict, no one will get in. So, check your Robots.txt and make sure you're not accidentally blocking important areas of your site.

2. too many 404 errors

Imagine inviting guests to your home and half the doors are locked or lead to nowhere. Not cool, right? Too many 404 errors can irritate crawlers and waste your crawl budget.

3. poor internal linking

It's like you're stuck in a maze with no signposts. The crawlers need a clear structure to find their way around. So link relevant pages to each other, but don't overdo it.

4. slow loading times

Slow websites are like restaurants where the Food takes forever. In the end, you lose patience. And so does Google. So, optimize loading times wherever possible.

5. endless URLs and parameters

Imagine a URL wie eine Straßenadresse vor. Wenn die Adresse ewig lang und voller komischer Zeichen ist, wird es schwierig, sie zu finden. Halte die URLs sauber und einfach.

6. cloaking

Cloaking is like a trap - you show one content to crawlers and another to users. Google hates that and you could be penalized for it. So, just don't do it.

7. duplicate content

It's like hanging the same painting in every room of your house. Google won't know which one is the "original" and could devalue all versions.

8. poor mobile optimization

If your site looks bad or loads slowly on mobile, it's like a restaurant that doesn't have room for strollers. Google wants the best experience for all users, so optimize for mobile.

Conclusion:

Crawlability is like hospitality in a hotel. You want your guests (and Google crawlers) to feel comfortable, find their way around, and be happy to come back. Make it easy for them, and you'll be rewarded in search results.

« Back to Glossary Index

FAQ

What is SEO crawling?

SEO crawling is the process of crawling and capturing websites with the help of search engine crawlers. The crawler reads the content of a website and analyzes it to help search engines understand it better. SEO crawling is an essential part of search engine optimization, as it allows search engines to identify and index the relevant content on a website.

Why should you do SEO crawling?

SEO crawling is an essential part of search engine optimization. Search engine crawlers help index websites and identify their relevant content. This enables websites to achieve better search engine rankings, generate more visibility and traffic, and generate more sales. In addition, SEO crawling can also help identify potentially problematic elements on a website, such as duplicate content or technical issues.

What are the different types of crawlers?

There are basically two types of crawlers: search engine crawlers and user crawlers. Search engine crawlers are specifically designed to crawl websites and identify relevant content. User crawlers are often used to collect data for web analytics and reporting.

How does SEO crawling work?

SEO crawling starts with retrieving a URL from a website, which is then analyzed. The crawler follows links on the page and makes further requests for each URL it finds. The crawler then saves the content it captures and sends it to the search engine to be indexed.

How long does it take for a crawler to crawl a website?

The duration of the crawl process depends on the size of the website and the number of links the crawler needs to retrieve. A small website with few links can be crawled within a few minutes, while a larger website with many links can take several hours.

How to create your own crawler?

There are several ways to create your own crawler. Some websites offer crawler tools that allow users to easily create their own crawlers. There are also many open source tools that allow you to create a crawler on your own. However, it is advisable to hire an expert to create a professional crawler, as proper programming of a crawler requires specialized knowledge.

What are their advantages of SEO crawling?

SEO crawling is an essential component of search engine optimization. It enables search engines to identify and index the relevant content of a website. It helps websites achieve better search engine rankings, generate more visibility and traffic, and generate more sales. In addition, crawling can also reveal potentially problematic elements on a website, such as duplicate content or technical issues.

Are there any disadvantages of SEO crawling?

Some disadvantages of SEO crawling are that it is a slow process that takes a lot of time. In addition, the cost of creating and developing a professional crawler is often high. SEO crawling can also lead to higher server load, as the crawler may send many requests to the server.

Can SEO crawlers damage websites?

No, SEO crawlers do not damage websites. SEO crawlers are specifically designed to crawl websites and capture relevant content. Since they do not make any changes to the website, they do not damage the website.

What tools can be used to perform SEO crawling?

There are many different tools that can be used to perform SEO crawling. Some of the most popular tools are Google Search Console, Screaming Frog SEO Spider, DeepCrawl, Serpstat and Majestic SEO. These tools provide users with various features that can make the SEO crawling process easier.

With top positions to the new sales channel.

Let Google work for you, because visitors become customers.

About the author

Niels Stuck

Niels Stuck has 10 years of SEO experience and is the founder of the SEO agency "WOLF OF SEO". He gained practical experience by building 20+ affiliate sites alongside his marketing studies. Finally, he wrote his bachelor thesis about the influence of SEO on Google rankings, traffic and sales development in the form of a case study. Today, he specializes in e-commerce SEO and helps more than 80 companies build a sustainable organic revenue channel through SEO. Niels advises startups, established brands and corporations in search engine optimization of their online stores and primarily focuses on data-based content strategies and link building. He shares his knowledge about SEO and online marketing in this blog, as a speaker at conferences, in podcasts and as a guest author for OMT, Forbes, Starting Up and many more platforms.

All contributions

Social Media & Links:

Crawling

ÜBER DEN AUTOR

What is crawling?

Pages are crawled for a variety of reasons, including:

How can I improve the crawlability of my website?

1. xml sitemap

2. flat hierarchy

3. fast loading times

4. responsive design

5. internal linking

6. robots.txt

7. avoid error pages

Typical mistakes you should avoid:

1. clogged Robots.txt

2. too many 404 errors

3. poor internal linking

4. slow loading times

5. endless URLs and parameters

6. cloaking

7. duplicate content

8. poor mobile optimization

Conclusion:

Content

FAQ

With top positions to the new sales channel.

About the author

Niels Stuck

Arrange free SEO initial consultation

Open questions? Shoot!

Our services

Overview

SEO top posts

Top ratings

Gifts

SEO Scaling Framework

Request video + PDF now!

SEO Funnel Breakdowns

Jetzt Videos anfordern!