Google New Update Specifies 15MB Page Size for Indexing

Google New Update Specifies 15MB Page Size for Indexing

SUPERCHARGE YOUR ONLINE VISIBILITY! CONTACT US AND LET’S ACHIEVE EXCELLENCE TOGETHER!

    Page Performance and Page SIze have always been primary KPIs while monitoring and optimizing the technical aspects of SEO.

    Recently, Google released a document in Search Central, about this and specified the following about Page size limits.

    15mb page size indexing

    “Namely, we’ve documented that Googlebot only ever “sees” the first 15 megabytes (MB) when fetching certain file types.“

    However, the biggest question in the SEO community is revolving around whether the page size limit includes images, videos and other resources as well.

    Understanding Google’s Indexing Process

    Understanding Google’s indexing process is crucial for anyone seeking to maximize their online presence and ensure that their web pages are effectively discovered and ranked by the world’s most popular search engine. At its core, Google’s indexing process involves two main stages: crawling and discovering web content, and indexing and storing web pages in Google’s vast database.

    A. How Google Indexes Web Pages

    Crawling and Discovering Web Content:

    Google employs automated programs known as “spiders” or “crawlers” to systematically browse the internet and discover new web pages. These crawlers follow links from one page to another, continually traversing the interconnected network of the World Wide Web. When a crawler encounters a new web page, it analyzes the page’s content, structure, and metadata to determine its relevance and importance.

    Key Points:

    • Crawlers use algorithms to prioritize which pages to visit based on factors like page rank, recency of content updates, and authority of the website.
    • XML sitemaps and internal linking structures play a crucial role in guiding crawlers to important pages within a website.

    Indexing and Storing Web Pages in Google’s Database:

    Once a crawler accesses a web page, it extracts and processes the page’s content, including text, images, and other multimedia elements. This information is then added to Google’s index, a massive database containing information about billions of web pages. Google’s index serves as the foundation for its search results, enabling users to quickly find relevant information in response to their queries.

    Key Points:

    • Google’s index is continuously updated as crawlers discover new pages, revisit existing ones, and detect changes to page content.
    • Pages that meet Google’s quality guidelines and provide valuable, relevant content are more likely to be indexed and ranked prominently in search results.

    B. Importance of Page Size in Indexing

    Impact on Crawlability and Indexing Efficiency:

    The size of a web page can significantly influence its crawlability and indexing efficiency. Large pages with extensive content, complex structures, or numerous multimedia elements may take longer for crawlers to process and index. As a result, excessively large pages may experience delays in being fully indexed or may be indexed incompletely.

    Key Points:

    • Google recommends keeping web page sizes reasonable to ensure efficient crawling and indexing.
    • Large pages should be optimized to prioritize the most important content and reduce unnecessary bloat, such as oversized images or excessive JavaScript code.

    Factors Influencing Google’s Indexing Decisions:

    Google’s indexing decisions are influenced by various factors, including page size, relevance, authority, and user experience. While page size alone does not determine whether a page will be indexed, excessively large pages may be deprioritized in favor of smaller, more streamlined alternatives.

    Key Points:

    • Google aims to provide users with a diverse range of high-quality content in its search results.
    • Pages that offer valuable, well-organized content in a user-friendly format are more likely to be indexed and ranked favorably by Google.

    Understanding Google’s indexing process and the importance of page size therein is essential for website owners, marketers, and content creators alike. By optimizing web pages for efficient crawling, indexing, and user experience, businesses can increase their visibility in search results and attract more organic traffic to their websites.

    What does the Size Limit Correspond to?

    Here’s what John Muller had to say about it. 

    “It’s specific to the HTML file itself like it’s written.Embedded resources/content pulled in with IMG tags is not a part of the HTML file.”

    Rest assured there is no cause for concern. 15 MB is a reasonably large file size the recommended file size for HTML pages is not more than 100KB. 

    Whenever we open a new website, our browser first downloads the HTML page as it is. Based on the HTML contents it makes additional requests for other resources. So rest assured all Images and videos that are referenced by the img and video tags are indexed separately.

    For example: <img src=”https://example.com/images/puppy.jpg” alt=”cute puppy looking very disappointed” />

    So if you are being a good SEO and following all of its best practices, you shouldn’t be having a problem. There are a few pages on the internet of that size, and you may never encounter one unless you want to fit an entire book within a single page.

    What are its SEO Implications?

    This update however does not change the Status Quo of the SEO Best Practices. The best practice still dictates the recommended page size to be 100KB. The 15 MB threshold was present previously, only that it was never officially announced. The recent article changes that.

    Read full article > https://developers.google.com/search/blog/2022/06/googlebot-15mb

    However, if your page is media-heavy and contains a lot of resources, it probably would be wise to perform certain tasks.

    The following steps might help in Indexing a page near to or exceeding 15MB including resources:

    • The HTML code must be structured in a way that all SEO Relevant content stays on the upper side of the page.
    • The images and videos should be compressed and not directly related to the Web Pages.

    Although putting a limit to the page size may sound worrisome, however, 15MB is a reasonably high size for a page.

    If you Ever have Pages with more size than 15MB you probably would have other underlying issues. 

    Future Trends and Considerations

    Google’s page size limit involves examining its potential impact on web development practices, anticipating the evolution of Google’s indexing algorithms and criteria, and exploring emerging technologies and approaches for optimizing page size and performance.

    A. Potential Impact of the Page Size Limit on Web Development Practices

    Google’s implementation of a page size limit for indexing is likely to influence how websites are developed and optimized in the future. Web developers will need to prioritize page size optimization as a fundamental aspect of website design and development. This shift in focus may lead to changes in coding practices, resource management, and content delivery strategies.

    One potential impact is a greater emphasis on modular and lightweight web components. Developers may adopt techniques such as code splitting, lazy loading, and resource prioritization to reduce the initial page payload and improve loading times. Additionally, optimizing images, scripts, and other assets will become standard practice to ensure compliance with Google’s page size guidelines.

    The page size limit may prompt the adoption of alternative technologies and frameworks that prioritize performance and efficiency. Lightweight frameworks, serverless architectures, and content delivery networks (CDNs) may see increased adoption as developers seek to optimize page size and improve website performance.

    B. Evolution of Google’s Indexing Algorithms and Criteria

    Google’s indexing algorithms and criteria are continually evolving to reflect changes in user behavior, technological advancements, and the evolving landscape of the web. The introduction of a page size limit is just one example of how Google adapts its algorithms to prioritize user experience and relevance.

    In the future, we can expect Google to place even greater emphasis on page speed, mobile-friendliness, and overall user experience as ranking factors. This may lead to further refinements in how Google evaluates and indexes web pages based on their performance metrics, including page size, loading speed, and interactivity.

    Google may introduce new tools and resources to help webmasters monitor and optimize page size and performance effectively. This could include enhancements to Google Search Console, PageSpeed Insights, and other developer tools to provide actionable insights and recommendations for improvement.

    C. Emerging Technologies and Approaches for Optimizing Page Size and Performance

    As web development practices continue to evolve, new technologies and approaches for optimizing page size and performance are emerging. These innovations aim to address the challenges posed by increasing page complexity, multimedia content, and user expectations for fast and responsive web experiences.

    One emerging technology is WebAssembly, which allows developers to run high-performance, low-level code directly in the browser. WebAssembly enables more efficient execution of computationally intensive tasks, reducing the need for large JavaScript bundles and improving overall page performance.

    Advances in image and video compression algorithms are enabling more efficient delivery of multimedia content while maintaining visual quality. Technologies such as AVIF (AV1 Image File Format) and HEIC (High Efficiency Image Format) offer superior compression compared to traditional image formats like JPEG and PNG, reducing file sizes and improving page load times.

    Serverless architectures and edge computing are gaining traction as scalable and efficient solutions for delivering dynamic content and applications. By leveraging serverless platforms and CDNs, developers can optimize content delivery and reduce latency, resulting in faster page load times and improved user experiences.

    The future of web development will be shaped by the need to optimize page size and performance in accordance with Google’s indexing guidelines and evolving user expectations. By embracing emerging technologies and best practices for optimization, developers can create faster, more efficient, and more engaging web experiences for users worldwide.

    How to Measure Page SIze and perform Technical Analysis

    There are many tools that can measure page size. Any one of them can be referred to while finding page size.

    Tools to measure Page Size

    • Google Page Speed Insights.
    • GTMetrix
    • Screaming Frog(Best option)

    You can also check the Page Size using Chrome Developer Tools:

    However, analyzing technical issues with your website might require a full-scale crawl. 

    Tools to Perform Technical Analysis

    • SERanking
    • SEOOptimer
    • SEMRush
    • Ahrefs
    • SEOSiteCheckUp