ingestion.url_ingestor¶
chunktuner.ingestion.url_ingestor
¶
Fetch remote URLs into Document objects.
URLIngestor
¶
Fetch HTTP(S) resources into a single Document (HTML preprocessed to text).
ingest_url
¶
GET url and map response body to text / markdown / html content.