Documentation Index
Fetch the complete documentation index at: https://docs.upsonic.ai/llms.txt
Use this file to discover all available pages before exploring further.
Overview
XML loader processes XML files using XPath expressions to split documents and extract content. Supports namespace handling, attribute extraction, and flexible content synthesis modes. Loader Class:XMLLoader
Config Class: XMLLoaderConfig
Install
Install the XML loader optional dependency group:
Examples
Parameters
| Parameter | Type | Description | Default | Source |
|---|---|---|---|---|
encoding | str | None | File encoding (auto-detected if None) | None | Base |
error_handling | "ignore" | "warn" | "raise" | How to handle loading errors | ”warn” | Base |
include_metadata | bool | Whether to include file metadata | True | Base |
custom_metadata | dict | Additional metadata to include | Base | |
max_file_size | int | None | Maximum file size in bytes | None | Base |
skip_empty_content | bool | Skip documents with empty content | True | Base |
split_by_xpath | str | XPath expression to identify document elements | "//*[not(*)] | //item | //product | //book" | Specific |
content_xpath | str | None | Relative XPath to select content | None | Specific |
content_synthesis_mode | "smart_text" | "xml_snippet" | Content format | ”smart_text” | Specific |
include_attributes | bool | Include element attributes in metadata | True | Specific |
metadata_xpaths | dict[str, str] | None | Map metadata keys to XPath expressions | None | Specific |
strip_namespaces | bool | Remove XML namespaces | True | Specific |
recover_mode | bool | Parse malformed XML | False | Specific |

