Transparency notice
Why you may see LUPPI on your website
LUPPI ingests a limited amount of publicly available web content to improve character knowledge, safety guardrails, and content attribution. We built this page so site owners can understand how our crawler behaves, how to opt out, and who to contact with questions.
User-Agent header
SolarisScraper/1.0 (+https://luppi.ai/web-scraping-info)
If you see this signature in your server logs, the request originated from our automated web-scraping workflow.
What we collect
- Publicly available text content that helps characters reference verified facts.
- Structured data (titles, headings, metadata) used for attribution and source linking.
- Only pages allowed by
robots.txtand standard rate limits.
What we never collect
- Content behind authentication, paywalls, or that requires manual interaction.
- Personal data, private API endpoints, or pages blocked by your access controls.
- Information from sites that ask us to stop via robots exclusions or a direct request.
How to control or opt out
We honor standard exclusion mechanisms and provide multiple ways to reach us:
- Add
User-agent: SolarisScraperrules to yourrobots.txtfile to block us. - Throttle or deny requests from the exact header shown above; we do not rotate the signature.
- Email us at support@luppi.ai with your domain and we will manually suppress future crawls.
Responsible use commitments
- Requests are rate limited and monitored to avoid degrading your infrastructure.
- We attribute sources whenever generated content references your material.
- Compliance and safety teams regularly audit the collected data for misuse.
Have more questions? Reach out and we will respond within two business days.
Last updated: October 22, 2025
