Overview

The HTML Reader API simplifies data extraction from HTML files - delivering structured, accurate outputs from even the most complex web structures. It captures text, tables, images, and metadata with precision.

  • Extracts content from raw HTML files
  • Supports complex nested structures
  • Maintains original formatting and hierarchy
  • High accuracy and fast response time

Great for

  • Web Scraping - Extract data from scraped HTML content
  • Email Processing - Extract content from HTML emails
  • SEO Analysis - Analyze HTML structure and content
  • Data Mining - Process HTML files for business intelligence

Get Your API Key

How to generate API Key

Quick guide to generate your API key

Using the API

Once you have your API key, you can start extracting data from HTML documents.

Check our API Reference for more details

Endpoint

POST
https://html-reader-api.starkai.io/html_reader

Request Example (using CURL)

curl 
  --request POST 
  --url https://html-reader-api.starkai.io/html_reader 
  --header 'Content-Type: application/json' 
  --header 'X-Api-Key: <api-key>' 
  --data '{
            "html_content": "<!DOCTYPE html><html><head><title>Sample</title></head><body><h1>Hello World</h1><p>This is a paragraph.</p></body></html>"
          }'