Course → Module 0: The Visibility Lie
Session 3 of 5

Open your website in a browser. You see colors, fonts, images, navigation, and carefully written paragraphs. That is the human view. Now right-click and select "View Page Source." What you see is what Google sees: raw HTML, script tags, meta elements, and (usually) very little structured information about who you are.

Googlebot does not render your page the way Chrome does. It fetches the HTML, processes JavaScript in a rendering queue, and extracts signals. The signals it cares about most are not your prose. They are your markup, your metadata, your structured data declarations, and your link relationships.

The Googlebot Pipeline

When Googlebot visits your website, it follows a specific sequence. Understanding this sequence reveals where most websites fail.

graph LR A["Discover URL"] --> B["Fetch HTML"] B --> C["Queue for Rendering"] C --> D["Render JavaScript"] D --> E["Extract Signals"] E --> F["Index or Discard"] style A fill:#2a2a28,stroke:#c8a882,color:#ede9e3 style B fill:#2a2a28,stroke:#c8a882,color:#ede9e3 style C fill:#2a2a28,stroke:#8a8478,color:#ede9e3 style D fill:#2a2a28,stroke:#8a8478,color:#ede9e3 style E fill:#2a2a28,stroke:#6b8f71,color:#ede9e3 style F fill:#2a2a28,stroke:#c47a5a,color:#ede9e3

At the "Extract Signals" stage, Google looks for specific data points. Here is what it prioritizes and what most business websites actually provide:

Signal CategoryWhat Google ExtractsTypical Business Site
Page identityTitle tag, meta description, H1Usually present
ContentBody text, headings, images with alt textUsually present
Structured dataJSON-LD schema blocksAbsent on 70%+ of SMB sites
Entity declarationsOrganization, Person, LocalBusiness schemaRare
Link signalsInternal links, external links, anchor textRandom or minimal
External corroborationLinks from directories, citations, mentionsInconsistent or absent
Technical healthLoad speed, mobile-friendliness, HTTPSUsually acceptable

Most business websites pass on page identity, content, and technical health. They fail on structured data, entity declarations, link signals, and external corroboration. That means they provide Google with text to read but no identity to attach it to.

Google can read your content without knowing who you are. But it ranks content from known entities higher than content from unknown sources.

The Source Code Test

You can perform a quick diagnosis of your own website right now. View your homepage source code and search for these strings:

If you find none of these, your website is structurally anonymous. Google sees text on a domain but has no machine-readable way to connect that domain to a specific business entity.

What Your Competitors' Source Code Reveals

Run the same test on a competitor who ranks well. You will likely find JSON-LD blocks declaring their organization name, address, logo, founding date, social profile URLs, and contact information. Their pages may also include Article or Product schema, BreadcrumbList markup, and FAQ schema.

Each of these schema blocks is a direct communication to Google: "Here is who we are. Here is what this page contains. Here is how this page relates to our entity." Without these declarations, Google must infer everything from context. Inference is slower, less reliable, and less confident than explicit declaration.

The Rendering Gap

One additional complication: Google renders JavaScript in a separate queue from its initial HTML fetch. If your website relies on JavaScript frameworks to load content (React, Vue, Angular without server-side rendering), Googlebot may initially see an empty page. The content only appears after rendering, which can be delayed by hours or days.

This rendering gap means JavaScript-heavy sites face a double penalty: delayed content discovery and (usually) no structured data, since many SPA frameworks do not include JSON-LD in their server-rendered output.

Static HTML sites or server-side rendered sites avoid this problem. Google sees the full content and any structured data on the first fetch. For entity infrastructure purposes, simpler technical architectures are better.

Further Reading

Assignment

Right-click on your homepage, select "View Page Source." Search for "schema" or "application/ld+json." If you find nothing, your website has zero structured identity. Write down exactly what you found (or did not find). Then do the same for a competitor who ranks well in your industry. Document the difference.