seo(robots): disallow tag-filtered blog URLs by emir-karabeg · Pull Request #4247 · simstudioai/sim

emir-karabeg · 2026-04-21T22:08:30Z

Summary

Adds Disallow: /blog*tag= to the generated robots.txt to prevent crawlers from indexing tag-filtered blog views (both /blog?tag=X and paginated /blog?page=N&tag=X). Per SEO team request to reduce duplicate content surfaced from tag filters.

Type of Change

Other: SEO / robots.txt

Testing

Visit /robots.txt after deploy and confirm the new Disallow: /blog*tag= line is present.

Checklist

Code follows project style guidelines
Self-reviewed my changes
Tests added/updated and passing
No new warnings introduced
I confirm that I have read and agree to the terms outlined in the Contributor License Agreement (CLA)

vercel · 2026-04-21T22:08:34Z

The latest updates on your projects. Learn more about Vercel for GitHub.

1 Skipped Deployment

Project	Deployment	Actions	Updated (UTC)
docs	Skipped		Apr 21, 2026 10:08pm

cursor · 2026-04-21T22:08:39Z

PR Summary

Low Risk
Low risk SEO-only change that only adjusts robots.txt crawling rules; it could affect blog discoverability if the pattern is overly broad.

Overview
Updates the generated robots.txt to disallow crawling of tag-filtered blog URLs by adding '/blog*tag=' to the disallow list, preventing indexing of /blog?tag=... (including paginated variants).

^{Reviewed by Cursor Bugbot for commit eb3281b. Configure here.}

greptile-apps · 2026-04-21T22:10:07Z

Greptile Summary

Adds Disallow: /blog*tag= to the generated robots.txt via Next.js MetadataRoute.Robots, preventing crawlers from indexing tag-filtered blog pages (e.g. /blog?tag=X, /blog?page=N&tag=X). The wildcard pattern is a widely-supported Google/Bing extension and Next.js renders it verbatim in the output file.

Confidence Score: 5/5

Safe to merge — single-line addition to an SEO config file with no logic changes.

The change is minimal and correct: the wildcard pattern /blog*tag= is a widely-supported robots.txt extension that matches all tag-filtered blog URLs as intended. Next.js renders disallow strings verbatim so the * is preserved. No logic, auth, or data paths are affected.

No files require special attention.

Important Files Changed

Filename	Overview
apps/sim/app/robots.ts	Adds `/blog*tag=` to the disallow list to prevent crawlers from indexing tag-filtered blog URLs; single-line, low-risk SEO change.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[Crawler requests /robots.txt] --> B[Next.js robots function]
    B --> C{URL pattern match}
    C -->|matches /blog*tag=| D[Disallowed - not crawled]
    C -->|matches /api/ or /workspace/ etc| D
    C -->|no disallow match| E[Allowed - crawled and indexed]
    D --> F[Crawler skips URL]
    E --> G[Crawler indexes page]

_{Reviews (1): Last reviewed commit: "seo(robots): disallow tag-filtered blog ..." | Re-trigger Greptile}

seo(robots): disallow tag-filtered blog URLs from crawlers

eb3281b

emir-karabeg merged commit 3a0e7b8 into staging Apr 21, 2026
10 checks passed

emir-karabeg deleted the robots-blog-tag branch April 21, 2026 22:13

waleedlatif1 mentioned this pull request Apr 22, 2026

v0.6.52: data retention, docs updates, slack manifest generator, security hardening, contact page, 404 page, access control, SES, SNS #4249

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

seo(robots): disallow tag-filtered blog URLs#4247

seo(robots): disallow tag-filtered blog URLs#4247
emir-karabeg merged 1 commit intostagingfrom
robots-blog-tag

emir-karabeg commented Apr 21, 2026

Uh oh!

vercel Bot commented Apr 21, 2026

Uh oh!

cursor Bot commented Apr 21, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

emir-karabeg commented Apr 21, 2026

Summary

Type of Change

Testing

Checklist

Uh oh!

vercel Bot commented Apr 21, 2026

Uh oh!

cursor Bot commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

Uh oh!

greptile-apps Bot commented Apr 21, 2026

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

cursor Bot commented Apr 21, 2026 •

edited

Loading