Table of Contents

Class DOCXTextExtractor

Namespace
FoundationaLLM.Vectorization.DataFormats.Office
Assembly
FoundationaLLM.Vectorization.Engine.dll

Extracts text from DOCX files.

public class DOCXTextExtractor
Inheritance
DOCXTextExtractor
Inherited Members
Extension Methods

Methods

GetText(BinaryData)

Extracts the text content from a DOCX document.

public static string GetText(BinaryData binaryContent)

Parameters

binaryContent BinaryData

The binary content of the DOCX document.

Returns

string

The text content of the DOCX document.