Structured Data
Introduction
By default, Olostep supports returning content in various formats including:
- HTML
- Text
- Raw PDF
- Markdown
For some use cases you might not need the entire content but only specified, structured and clean data in JSON format.
To achieve this, you may use:
- Parsers: Ideal for high-volume, consistent, recurring website scraping
- LLM Extraction: Ideal for flexible extraction needs or websites with changing structures
Both methods provide clean, structured JSON data that can be immediately used in your applications without additional processing.