trafilatura

What is trafilatura?

About

trafilatura is an uncategorized agent. If you think this is incorrect or can provide additional detail about its purpose, please contact us. You can see how often trafilatura visits your website by setting up Dark Visitors agent analytics.

Expected Behavior

Behavior will vary depending on whether this agent is a search engine crawler, data scraper, archiver, one-off fetcher, etc.

Type

Uncategorized
Not currently assigned a type

Detail

Last Updated 7 minutes ago

Insights

Top Website Robots.txts

0%
0% of top websites are blocking trafilatura
Learn How →

Country of Origin

United States
trafilatura normally visits from the United States

Global Traffic

The percentage of all internet traffic coming from Uncategorized Agents

Top Visited Website Categories

News
Hobbies and Leisure
Food and Drink
Arts and Entertainment
Computers and Electronics
Get These Insights for Your Website
Use the WordPress plugin, Node.js package, or API to get started in seconds.

Robots.txt

Should I Block trafilatura?

It's difficult to say without a type. Its purposes could either be good or bad for your website, depending on what it is.

How Do I Block trafilatura?

⚠️ Manual Robots.txt Edits Are Not Scalable
New agents are created every day. Instead, serve a continuously updating robots.txt that blocks new agents automatically.

You can block trafilatura or limit its access by setting user agent token rules in your website's robots.txt. Set up Dark Visitors agent analytics to check whether it's actually following them.

User Agent String trafilatura/1.9.0 (+https://github.com/adbar/trafilatura)
# robots.txt
# This should block trafilatura

User-agent: trafilatura
Disallow: /