Class DOCXTextExtractor
- Namespace
- FoundationaLLM.Vectorization.DataFormats.Office
- Assembly
- FoundationaLLM.Vectorization.Engine.dll
Extracts text from DOCX files.
public class DOCXTextExtractor
- Inheritance
-
DOCXTextExtractor
- Inherited Members
- Extension Methods
Methods
GetText(BinaryData)
Extracts the text content from a DOCX document.
public static string GetText(BinaryData binaryContent)
Parameters
binaryContent
BinaryDataThe binary content of the DOCX document.
Returns
- string
The text content of the DOCX document.