Table of Contents

Class XLSXTextExtractor

Namespace
FoundationaLLM.Vectorization.DataFormats.Office
Assembly
FoundationaLLM.Vectorization.Engine.dll

Extracts text from XLSX files.

public class XLSXTextExtractor
Inheritance
XLSXTextExtractor
Inherited Members
Extension Methods

Constructors

XLSXTextExtractor(bool, bool, bool, string?, string?, string?, string?, string?)

Constructor for XLSXTextExtractor.

public XLSXTextExtractor(bool withWorksheetNumber = true, bool withEndOfWorksheetMarker = false, bool withQuotes = true, string? worksheetNumberTemplate = null, string? endOfWorksheetMarkerTemplate = null, string? rowPrefix = null, string? columnSeparator = null, string? rowSuffix = null)

Parameters

withWorksheetNumber bool
withEndOfWorksheetMarker bool
withQuotes bool
worksheetNumberTemplate string
endOfWorksheetMarkerTemplate string
rowPrefix string
columnSeparator string
rowSuffix string

Methods

GetText(BinaryData)

Extracts the text content from a PPTX document.

public string GetText(BinaryData binaryContent)

Parameters

binaryContent BinaryData

The binary content of the PPTX document.

Returns

string

The text content of the PPTX document.