Transparency notice

Why you may see LUPPI on your website

LUPPI ingests a limited amount of publicly available web content to improve character knowledge, safety guardrails, and content attribution. We built this page so site owners can understand how our crawler behaves, how to opt out, and who to contact with questions.

User-Agent header

SolarisScraper/1.0 (+https://luppi.ai/web-scraping-info)

If you see this signature in your server logs, the request originated from our automated web-scraping workflow.

What we collect

Publicly available text content that helps characters reference verified facts.
Structured data (titles, headings, metadata) used for attribution and source linking.
Only pages allowed by robots.txt and standard rate limits.

What we never collect

Content behind authentication, paywalls, or that requires manual interaction.
Personal data, private API endpoints, or pages blocked by your access controls.
Information from sites that ask us to stop via robots exclusions or a direct request.

How to control or opt out

We honor standard exclusion mechanisms and provide multiple ways to reach us:

Add User-agent: SolarisScraper rules to your robots.txt file to block us.
Throttle or deny requests from the exact header shown above; we do not rotate the signature.
Email us at support@luppi.ai with your domain and we will manually suppress future crawls.

Responsible use commitments

Requests are rate limited and monitored to avoid degrading your infrastructure.
We attribute sources whenever generated content references your material.
Compliance and safety teams regularly audit the collected data for misuse.

Have more questions? Reach out and we will respond within two business days.

Last updated: October 22, 2025