📊 My Projects

Welcome Back

Sign in to access your saved projects and analyses

or use password
Don't have an account? Sign up

Analyze how AI sees your website. Get expert recommendations to improve your visibility in ChatGPT, Perplexity, and other AI assistants.

https://
🎯

AI Visibility Score

See how often AI mentions your brand

🏆

Competitor Analysis

Know who's beating you in AI answers

📊

Expert Insights

SEO, Marketing, AI & PR expertise

Actionable Fixes

Prioritized recommendations

example.com
Filter by AI:
What's New
Summarized just now

Analyzing your brand's AI visibility...

Share of Voice ?
--
Calculating...
Average Position ?
--
Calculating...
Visibility Trend ?
--
Calculating...

💡 Executive Summary

Loading analysis...

⚡ Quick Wins ?
High-impact, low-effort improvements
🎯 Brand Associations ?
How well your content aligns with key concepts
💡
What is GEO Score?
Generative Engine Optimization (GEO) measures how likely AI models are to cite your content. Research shows 72.4% of ChatGPT-cited posts include answer capsules (120-150 char summaries). This score analyzes your content structure, data density, and technical markup to predict citability.
📊 GEO Content Scorer ?
Content structure analysis for AI citation optimization
-- GEO Score
-
--
Answer Capsules
Direct answer summaries
--
Section Structure
120-180 words optimal
--
Original Data
Stats & research density
--
Technical SEO
Schema, lists, tables
🎯 Top GEO Improvements
Highest-impact fixes to increase AI citation probability
📄 Page-by-Page Analysis
Click any page to see GEO breakdown and get generated answer capsules
🔍
What is Citation Source Mapping?
When AI answers questions, it pulls information from specific sources. This analysis shows which third-party sites AI is citing in your industry — and crucially, where your competitors appear but you don't. These gaps represent PR and content opportunities.

🛤️ Citation Decision Journey

How AI citations guide users through their buying journey

Your Brand
Competitors
Other Sources
0/4
Stage Coverage
--
Strongest Stage
--
Gap Opportunity
0
Total Citations
🔗 Citation Source Mapping ?
Aggregated citation analysis across all AI prompts tested
--
Total Citations Found
--
Your Domain Cited
--
Citation Gaps
Competitor-only sources
--
Authority Sources
News, .edu, .gov
⚠️ Citation Gaps
High-value sources that cite competitors but not you - your biggest opportunities
🏆 Top Citation Sources
Most frequently cited sources in your category
📊 Competitor Citation Frequency
How often competitors appear in AI responses
🗺️ Vector Embedding Map
Visual representation of your brand's semantic position
📊 Legend
Page Scores
Strong (≥50%)
Medium (≥35%)
Weak (<35%)
Click to Explore
Topic circles → breakdown
Page dots → details
ℹ️ First 10 pages get deep analysis
🎯 Content Themes & Gaps ?
📄 Selected Page ?
Select a page
CLUSTER AFFINITIES ?
⚔️ Competitor Brands ?
📚 Top Cited Sources ?
🔀 Query Fanout Analysis ?
How user prompts expand into multiple search intents
🔴 Critical Issues ?
Problems affecting your AI visibility
🧠

Strategy Hub

AI-powered insights combining your data with cutting-edge marketing intelligence

🔬 Data-Backed 0 data points
⏱️ Knowledge Real-time 2026
🎯 Confidence
📊 Sources Platform + AI
Top Priority Insight Highest Impact
Analyzing your data...
👁️
AI Visibility
0
📝
Content
0
🏆
Authority
0
⚔️
Competitive
0
📈
Growth
0
Generating strategic insights...

🏆 Competitive Intelligence

How you compare in the AI landscape

🗺️ Implementation Roadmap

Recommended sequence based on impact and dependencies

📋 Strategy Brief

🔍 Keyword Research & Search Trends ?
Research keywords, analyze page content, and track search trends

🔎 Keyword Explorer

💡 Suggestions will appear here
🔗 Related Searches
Search for a keyword above

📈 Search Trend

Research a keyword to see trends
Avg Interest
--
Trend
--
Results
--

❓ People Also Ask

Research a keyword to discover related questions

📄 Page Keyword Performance

Extract keywords from your crawled pages and check their organic rankings
--
Pages Analyzed
--
Keywords Found
--
Ranking Top 10
--
Ranking Top 20
--
Not Ranking
🎯 Keyword Opportunities
📊
Click "Analyze Page Keywords" to discover opportunities
Requires a website analysis first • Extracts keywords from your crawled pages and checks their Google rankings

🎯 Generate AI Prompts from Keywords

Test how AI responds to prompts based on your keywords • See if your brand appears in AI answers
Select Keywords to Test:
Run a website analysis first to see extracted keywords here
Consumer Intent Types:

🔥 Trending Searches

📅 Daily Trending
Click "Get Trends" to see what's trending
📈 Rising Searches
Enter a category to see rising searches
🛒 Shopping AI Intelligence ?
Rufus, Alibaba & multi-platform shopping query visibility

🎯 Configure Monitor

💡 Comma-separated list

📝 Query Templates

"Best [category] for [use case]"
"[brand] vs [competitor] [product]"
"[category] under $[price]"
"Should I buy [A] or [B]?"
to
Site Traffic
Monitor human visitors and AI systems crawling your website
🚀
Set Up AI Bot Tracking
Track how often AI systems like GPTBot, ClaudeBot, and PerplexityBot crawl your website. This reveals how visible your content is to AI training and search systems.
👥 AI-Cited Influencers
Discover influential voices that LLMs cite in your industry
🎯
How Influencer Discovery Works
We analyze citations from AI responses to identify individual content creators, thought leaders, and industry experts whose content LLMs trust and cite. These influencers have high potential for brand partnerships, guest content, or outreach as their content directly influences AI recommendations.
0
Influencers Found
0
High Impact
0
Topic Aligned
0
Avg Score
👥
Discovering Creators & Thought Leaders
We scan citations for individual creators on platforms like YouTube, Substack, Medium, and personal blogs.
📄 Pages & Analysis Mode
0 pages tracked for analysis
Analysis Mode
Current: When you click "Update", these exact 0 pages will be re-analyzed for consistent comparison over time.
💡
Managing Your Tracked Pages
Add or remove pages below. Click "Save Changes" to persist updates, then "Update" in the header to re-analyze.
🎭 Synthetic Focus Group ?
Data-grounded consumer personas for concept testing and message validation

🎯 Recommended Tests Based on your data

Loading recommendations...

👥 Brand Personas

🎭
Loading personas...

Create New Focus Group Test

📋 Recent Sessions

No sessions yet. Create your first focus group test above!

Try a specific query to see how AI responds and whether your brand appears.

🏆 Competitor Comparison
⚡ Content Optimizations
📖 How Lighthouse Works

🎯 What is Generative Engine Optimization (GEO)?

When users ask AI assistants like ChatGPT, Perplexity, or Google Gemini questions, these AI models don't search the web like traditional search engines. Instead, they understand the meaning of content through something called vector embeddings — mathematical representations of concepts and ideas.

GEO is the practice of optimizing your content so AI models understand, recommend, and cite your brand when users ask relevant questions. It's the next evolution of SEO.

The Science Behind AI Understanding

Large Language Models (LLMs) like GPT-4 process text through a transformer architecture that converts words into high-dimensional vectors. These vectors capture semantic meaning — words with similar meanings have similar vectors.

When you ask ChatGPT "What's the best electric shaver?", it doesn't keyword-match. It converts your question into a vector and finds content whose vectors are semantically close. This is fundamentally different from traditional SEO where keywords and backlinks dominate.

Key difference: Google ranks pages. AI understands concepts. A page can rank #1 on Google but be invisible to AI if its semantic embeddings don't align with user intent.

🔬 Our Analysis Process

1 Website Crawling

We scan your website's pages, extracting titles, headings, meta descriptions, and body content. This is the raw material AI models use to understand what your site is about.

Crawling Implementation

We use fetch() with custom headers to request your pages, then parse HTML using Cheerio (a Node.js HTML parser). We extract:

  • <title> tag — primary page identifier
  • <h1> through <h3> — content structure
  • <meta name="description"> — summary content
  • Body text — full content stripped of HTML/scripts
  • Schema.org JSON-LD — structured data

We limit to ~20 pages per analysis to balance depth with API costs. Internal links are followed to discover connected pages.

// Simplified crawl logic
const response = await fetch(url, { headers: { 'User-Agent': 'Lighthouse-GEO/1.0' }});
const html = await response.text();
const $ = cheerio.load(html);
const title = $('title').text();
const bodyText = $('body').text().substring(0, 30000);
2 Vector Embedding Generation

We convert your page content into vector embeddings using the same AI technology (OpenAI) that powers ChatGPT. Each page becomes a point in "meaning space" — similar concepts cluster together, different concepts are far apart.

How Embeddings Work

We use OpenAI's text-embedding-3-large model to convert text into 3,072-dimensional vectors. Each dimension captures a different aspect of meaning.

Example: The words "shaver" and "razor" will have vectors pointing in similar directions because they're semantically related. "Shaver" and "pizza" will point in very different directions.

// Embedding generation
const response = await openai.embeddings.create({
  model: 'text-embedding-3-large',
  input: pageContent.substring(0, 30000),
  dimensions: 3072
});
const embedding = response.data[0].embedding; // [0.023, -0.041, 0.089, ...]

Why this matters: This is the same technology ChatGPT uses to understand content. By analyzing your embeddings, we see your content the way AI sees it.

3 Topic Alignment Analysis

We generate topic clusters relevant to your industry and measure how well each of your pages aligns with those topics using cosine similarity — a mathematical measure of how close two concepts are in meaning.

Cosine Similarity Explained

Cosine similarity measures the angle between two vectors. If two vectors point in exactly the same direction, similarity = 1.0 (100%). If perpendicular, similarity = 0. If opposite, similarity = -1.0.

// Cosine similarity calculation
function cosineSim(a, b) {
  let dot = 0, normA = 0, normB = 0;
  for (let i = 0; i < a.length; i++) {
    dot += a[i] * b[i];
    normA += a[i] * a[i];
    normB += b[i] * b[i];
  }
  return dot / (Math.sqrt(normA) * Math.sqrt(normB));
}

Topic clusters are generated dynamically based on your industry using GPT-4o-mini. We then embed each topic cluster and compare it against each page's embedding.

Interpretation: A page with 70% similarity to "sensitive skin shaving" is semantically well-aligned with that topic. AI is more likely to cite it when users ask related questions.

⚡ Performance Note: To balance analysis depth with API costs, the first 10 pages receive full deep semantic analysis (concept extraction, semantic signals, detailed affinities). Remaining pages receive embedding alignment scores using the same cosine similarity method.

4 AI Visibility Testing

We ask AI models real questions your customers might ask — both with and without your brand name. We track whether AI mentions your brand, which competitors it recommends, and what sources it cites.

Prompt Testing Methodology

We use the OpenAI Responses API with gpt-4o and force the web_search tool to ensure real-time, cited results.

Prompt types:

  • Unbranded: "best electric shaver for sensitive skin" — tests if AI discovers you organically
  • Branded: "Philips shaver reviews" — tests brand recognition
// AI visibility test
const response = await openai.responses.create({
  model: 'gpt-4o',
  tools: [{ type: 'web_search' }],
  tool_choice: { type: 'web_search' }, // Force live search
  input: "What's the best electric shaver for sensitive skin?"
});
// Parse response for brand mentions, competitors, citations

Citation extraction: We parse the response's annotations array to extract URLs the AI cited as sources.

5 Competitive Intelligence

We identify which competitors AI recommends most often and which sources (citations) AI uses to back up its answers. This reveals who's winning the AI visibility battle and why.

Competitor & Citation Extraction

We use regex patterns and NLP heuristics to identify company names mentioned in AI responses. We filter out:

  • Generic terms (e.g., "Premium Choice", "Best Value")
  • Product attributes (e.g., "Sensitive Skin", "Long Battery")
  • The brand being analyzed (to focus on competitors)

We maintain a whitelist of known brands (Braun, Panasonic, Remington, etc.) for accurate detection and aggregate mentions across all prompts.

Citations are extracted from the API's url_citation annotations, which tell us exactly which sources the AI used to formulate its answer.

6 GEO Content Scoring (LLM Citability)

We analyze your page structure for factors that make AI more likely to cite you. Research shows 72.4% of ChatGPT-cited posts include specific content patterns we detect.

GEO Score Components (0-100)

Each page gets four sub-scores (0-25 each):

  • Answer Capsules (0-25): Detects 120-150 character answer summaries after headings. AI loves citing these concise answers.
  • Section Structure (0-25): Optimal paragraph length is 120-180 words. Too short = lacks depth. Too long = loses focus.
  • Original Data (0-25): Counts statistics (%), research citations, years, specific numbers. Data-rich content is more authoritative.
  • Technical SEO (0-25): Schema markup (especially FAQPage/HowTo), lists, and tables. These make content machine-readable.
// GEO Score calculation
const geoScore = answerCapsuleScore + sectionStructureScore 
               + originalDataScore + technicalSEOScore;
// citabilityRating = geoScore >= 70 ? "High" : geoScore >= 45 ? "Medium" : "Low"
7 Citation Source Mapping

We aggregate all citations across all prompts to show which third-party sources AI trusts in your industry. More importantly, we identify citation gaps — sources that cite competitors but not you.

Citation Gap Analysis

We track every URL cited by AI and cross-reference with competitor mentions:

  • Citation Gap: A source that appeared with competitor mentions but not your brand. These are PR opportunities.
  • Authority Sources: High-trust domains (.edu, .gov, news sites) that carry more weight.
  • Category Breakdown: News, reviews, educational, forums — each has different outreach strategies.
// Citation gap detection
const citationGaps = topSources.filter(source => 
  source.competitorsMentioned.length > 0 && 
  !source.includesYourBrand
);

Action: Target citation gap sources for guest posts, press releases, or product reviews. Getting featured there means AI will cite you too.

8 Answer Capsule Generator

For pages missing good answer capsules, we use AI to generate 120-150 character summaries optimized for citation. Copy-paste these after your H2 headings.

Why 120-150 Characters?

Research on ChatGPT citation patterns found:

  • 72.4% of cited posts have concise answer summaries immediately after headings
  • Optimal length: 120-150 characters — long enough for detail, short enough to cite verbatim
  • Direct answers (not questions or lead-ins) perform best

We use GPT-4o-mini to analyze your page sections and generate capsules that match your content while following citation best practices.

9 Multi-LLM Cross-Platform Testing

We test your brand visibility across 7 different AI platforms simultaneously: ChatGPT, Claude, Gemini, DeepSeek, Qwen, Perplexity, and Google AI Overview. Different AIs recommend different brands!

Why Multi-LLM Matters

Each AI has different training data, biases, and search capabilities:

  • 🟢 ChatGPT: OpenAI's GPT-4o with live web search — largest market share
  • 🟣 Claude: Anthropic's AI — known for accuracy and safety focus
  • 🔵 Gemini: Google's AI — integrated with Google Search data
  • 🟤 DeepSeek: Chinese AI gaining global traction — different training corpus
  • 🔶 Qwen: Alibaba's multilingual AI — strong in Chinese, English, and other languages
  • 🟠 Perplexity: Search-first AI with real-time web access — heavily citation-focused
  • 🔴 Google AI Overview: Appears in Google Search results — massive visibility impact

We aggregate visibility across all platforms to give you a true cross-platform AI visibility score.

// Parallel testing across all LLMs
const results = await Promise.allSettled([
  searchChatGPT(prompt),  // Real-time search
  searchClaude(prompt),    // Training data
  searchGemini(prompt),    // Google ecosystem
  searchDeepSeek(prompt),  // Asian market focus
  searchQwen(prompt),      // Alibaba multilingual
  searchPerplexity(prompt), // Search-first
  searchGoogleAIOverview(prompt) // via SERPapi
]);
10 AI Commerce Intelligence 🛒

Track how AI systems recommend your products in shopping-intent queries. Measure your AI Share of Shelf, recommendation position, merchant citations, and sentiment.

Shopping Query Monitor

We generate shopping-intent prompts tailored to your industry:

  • General: "best [category] for [use case]", "top rated [category] 2026"
  • Comparison: "[brand] vs [competitor] [product]"
  • Price-based: "[category] under $[price]"
  • Head-to-head: "should I buy [product A] or [product B]?"

Key Metrics:

  • AI Share of Shelf: % of shopping queries where your product appears vs. competitors
  • Recommendation Position: Average rank in AI's product recommendations
  • Sentiment Score: How positively AI describes your products
  • Merchant Distribution: Which retailers AI recommends buying from
// Commerce query execution
const shoppingQueries = generateShoppingQueries(config);
for (const query of shoppingQueries) {
  const results = await searchAllLLMs(query);
  // Parse: product mentions, prices, merchants, sentiment
  // Track: brand position, competitor mentions
}

Competitive Battlecards: We run head-to-head comparison queries and analyze which product AI prefers, along with pros/cons mentioned for each.

11 Site Traffic

Monitor which AI crawlers are visiting your website. See visits from GPTBot (OpenAI), ClaudeBot (Anthropic), PerplexityBot, QwenBot (Alibaba), Google-Extended, and more. Compare human vs. bot traffic side-by-side.

How Bot Tracking Works

We provide a lightweight tracking pixel you add to your website. It works in two ways:

  • Human Tracking: A 1x1 transparent pixel fires on page load, recording visits, referrers, and device info
  • Bot Detection: Server-side user-agent analysis identifies AI crawlers by their signature patterns
// AI bots we detect
const AI_BOTS = [
  { pattern: /GPTBot/i, provider: 'openai', type: 'training' },
  { pattern: /ChatGPT-User/i, provider: 'openai', type: 'browsing' },
  { pattern: /ClaudeBot/i, provider: 'anthropic', type: 'training' },
  { pattern: /PerplexityBot/i, provider: 'perplexity', type: 'search' },
  { pattern: /Google-Extended/i, provider: 'google', type: 'training' },
  // ... and more
];

Key Metrics:

  • Human Pageviews: Total human visits tracked via pixel
  • Bot Visits: AI crawler visits by provider (OpenAI, Anthropic, etc.)
  • Bot Traffic %: Ratio of AI bots to human visitors
  • Top Crawled Pages: Which pages AI is most interested in
  • Top Referrers: Where your human traffic comes from

Why it matters: If AI bots are crawling your site frequently, your content is being ingested into AI training data or real-time search indexes. More crawls = higher likelihood of being cited.

📊 Understanding Your Metrics

GEO Score (0-100)

Your content's citability score — how likely AI is to quote your content directly. Based on answer capsules, section structure, data density, and technical markup. Aim for 60+ (Grade B).

GEO Score = Answer Capsules (0-25) + Section Structure (0-25) + Original Data (0-25) + Technical SEO (0-25)

Grade interpretation:
• A (80-100) = Highly citable, well-structured content
• B (60-79) = Good structure, minor improvements needed
• C (40-59) = Moderate citability, needs optimization
• D/F (<40) = Low citability, significant restructuring needed

Topic Alignment Score

How semantically close your page content is to topics customers search for. Higher = AI more likely to cite you for that topic. Aim for 60%+ on priority topics.

Calculated as cosineSimilarity(pageEmbedding, topicEmbedding) * 100. We generate embeddings for both your page content and a rich description of each topic, then measure their vector similarity.

Score interpretation:
• 60%+ = Strong alignment (green) — likely to be cited
• 35-60% = Moderate (yellow) — may be cited, room for improvement
• <35% = Weak (red) — unlikely to be cited for this topic

Brand Visibility

How often AI mentions your brand when answering relevant questions. We test both branded (users ask about you by name) and unbranded (users search the category) prompts.

Calculated as (promptsWithBrandMention / totalPrompts) * 100. We check if your brand name (extracted from your domain) appears anywhere in the AI's response text.

Unbranded visibility is the critical metric — it shows whether AI recommends you when users don't specifically ask for your brand. This is the GEO equivalent of ranking for non-branded keywords.

Semantic Signals

Quality indicators AI looks for: Expertise (depth of knowledge), Trustworthiness (credible sources), Specificity (detailed content), Actionability (practical advice).

These signals are estimated using GPT-4o-mini analysis of your page content. The model evaluates each dimension on a 0-1 scale based on:

  • Expertise: Technical depth, industry knowledge, original insights
  • Trustworthiness: Citations, data, balanced perspective, authoritative tone
  • Specificity: Concrete details vs generic claims, specific examples
  • Actionability: Clear steps, practical advice, implementable recommendations
Multi-LLM Visibility (%)

The percentage of AI platforms that mention your brand. We test 7 different AIs — each has different training data and biases. A brand visible in ChatGPT may be invisible in Gemini!

Calculated as (llmsWithBrandMention / totalLLMsTested) * 100. For each prompt, we test across:

  • ChatGPT (OpenAI): ~70% market share, live web search
  • Claude (Anthropic): Growing enterprise adoption
  • Gemini (Google): Integrated with Google ecosystem
  • DeepSeek: Dominant in Asian markets
  • Perplexity: Search-first AI with citations
  • Google AI Overview: Appears in Google Search results

Why it matters: Your competitors may be crushing you on one platform but losing on another. Multi-LLM testing reveals your true cross-platform visibility.

💡 Why This Matters for Your Business

📱 AI is becoming the new search. Millions of users now ask ChatGPT, Perplexity, and Google Gemini for recommendations instead of Googling.

🎯 AI gives ONE answer, not ten links. Unlike Google's 10 blue links, AI gives a single recommendation. If you're not in that answer, you're invisible.

📈 Early movers win. Brands optimizing for AI visibility now will dominate their categories as AI adoption accelerates.

🚀 What Should You Do?

  1. Fix weak pages first. Pages with low alignment scores (red dots) are invisible to AI. Add more depth, specificity, and relevant concepts.
  2. Target missing concepts. If AI expects certain topics in your category but you don't cover them, create content for those gaps.
  3. Build semantic authority. Create comprehensive, expert content that demonstrates E-E-A-T (Experience, Expertise, Authoritativeness, Trust).
  4. Monitor competitors. Track which competitors AI recommends and study what content makes them visible.
  5. Earn citations. Get your content cited by authoritative sources — AI trusts content that other trusted sources reference.

🛠️ Technical Stack

Embeddings: OpenAI text-embedding-3-large (3072 dims)
AI Testing: GPT-4o with web_search tool
Analysis: GPT-4o-mini for concept extraction
Similarity: Cosine similarity on embedding vectors
Backend: Node.js + Express on Railway
Crawling: Cheerio HTML parser + ScraperAPI

Built with full transparency. All methods described above are exactly how this platform operates.
Data reflects real-time AI model behavior as of your analysis date.

— checks API keys
🔧 LLM Connection Test
Testing all LLM connections...
📄 Page Embedding Analysis

👑 Admin Dashboard Admin

📊 Recent Analyses
Loading...
📊 Add Site Traffic Tracking
📊 What You'll Track
See when AI systems crawl your website:
🤖 OpenAI GPTBot
🧠 Anthropic ClaudeBot
🔍 Perplexity PerplexityBot
🔎 Google Google-Extended
👤 Meta AI Crawler
🍎 Apple Applebot
⚡ One Simple Step
Add this script to every page of your website. Works with Google Tag Manager, WordPress, Shopify, or any HTML site.
<script src="[YOUR_DOMAIN]/t/tracker.js"></script>
📦 Installation Options
Google Tag Manager
1. Go to GTM → Tags → New
2. Choose "Custom HTML"
3. Paste the script above
4. Trigger: All Pages
5. Save and Publish
WordPress
1. Go to Appearance → Theme Editor
2. Open footer.php
3. Paste the script before </body>
4. Save
Or use a plugin like "Insert Headers and Footers"
Shopify
1. Go to Online Store → Themes
2. Click Actions → Edit code
3. Open theme.liquid
4. Paste the script before </body>
5. Save
Any HTML Site
Add the script tag before </body> on every page, or in your site's footer template/include file.
✨ What happens next?
Once installed, we'll start tracking AI bot visits automatically. Data will appear in this dashboard within 24-48 hours as bots crawl your site.
!
🎯
New to Lighthouse? Take a quick tour to learn how to get the most from your analysis.
Step 1 of 8 Welcome
Tour content goes here
Aurora
Ready to help
👋 Welcome

I'm your GEO Expert Agent

I can help you understand your AI visibility analysis and provide expert recommendations. Try asking: