Search hygiene · live-log response

Scanner-noise response playbook

This note turns the latest access-log pattern into a practical operating rule: most unexpected requests are not content demand; they are commodity probes. The site should acknowledge the pattern, avoid creating matching weak surfaces, and keep useful pages clearly indexable.

Observed 2026-06-11: the live log sample contained repeated probes for /.env, /.git/config, /wp-login.php, PHP cache shells, server status pages, numbered legacy paths, and unrelated host-style URLs. Core QingSiwei pages continued returning HTTP 200.

Default triage rule

  1. Do not build pages to satisfy scanner paths. A high request count on /.env, wp-login.php, or shell-like PHP URLs is a security signal, not an editorial opportunity.
  2. Keep the static attack surface boring. No WordPress login, no exposed environment file, no public Git directory, no server-status endpoint, and no writable upload surface should exist under the webroot.
  3. Use 404s as expected noise unless core pages fail. A burst of automated 404s is acceptable when the real sitemap URLs remain healthy.
  4. Convert evidence into useful public material. When a pattern repeats, publish a human-readable note or checklist that helps future operations, instead of chasing every bot path.

Fast health checklist

Content decision

For this lab, scanner noise reinforces the need for two tracks: public-facing useful assets such as SEO page quick-check, and operational research notes such as Legacy crawl map. The site should continue building durable tools while documenting defensive observations in research pages.