Skip to content

Commit

Permalink
Update crawlers info
Browse files Browse the repository at this point in the history
  • Loading branch information
ikedas committed Jan 30, 2025
1 parent c003ea0 commit 8a6b41b
Showing 1 changed file with 16 additions and 3 deletions.
19 changes: 16 additions & 3 deletions src/lib/Sympa/WWW/Crawlers.pm
Original file line number Diff line number Diff line change
Expand Up @@ -97,7 +97,7 @@ use constant crawler => qr{
| findthatfile
| europarchive\.org
| NerdByNature\.Bot
| sistrix[ ]crawler
| (sistrix|SISTRIX)[ ][cC]rawler
| Ahrefs(Bot|SiteAudit)
| fuelbot
| CrunchBot
Expand Down Expand Up @@ -381,11 +381,15 @@ use constant crawler => qr{
| Amazon[ ]CloudFront
| Pulsepoint
| CloudFlare-AlwaysOnline
| Cloudflare-Healthchecks
| Cloudflare-Traffic-Manager
| CloudFlare-Prefetch
| Cloudflare-SSLDetector
| https:\/\/developers\.cloudflare\.com\/security-center\/
| Google-Structured-Data-Testing-Tool
| WordupInfoSearch
| WebDataStats
| HttpUrlConnection
| Seekport[ ]Crawler
| ZoomBot
| VelenPublicWebCrawler
| MoodleBot
Expand Down Expand Up @@ -511,7 +515,7 @@ use constant crawler => qr{
| seostar\.co
| IonCrawl
| Uptime-Kuma
| SeekportBot
| Seekport
| FreshpingBot
| Feedbin
| CriteoBot
Expand All @@ -523,6 +527,7 @@ use constant crawler => qr{
| OdklBot\/
| GPTBot
| ChatGPT-User
| OAI-SearchBot
| YandexRenderResourcesBot\/
| LightspeedSystemsCrawler
| ev-crawler\/
Expand Down Expand Up @@ -586,6 +591,14 @@ use constant crawler => qr{
| [cC]laude[bB]ot
| Monsidobot
| GroupMeBot
| Vercelbot
| vercel-screenshot
| facebookcatalog\/
| meta-externalagent\/
| meta-externalfetcher\/
| AcademicBotRTU
| KeybaseBot
| CookieHubScan
)
}x;

Expand Down

0 comments on commit 8a6b41b

Please sign in to comment.