Class XLSXTextExtractor
- Namespace
- FoundationaLLM.Vectorization.DataFormats.Office
- Assembly
- FoundationaLLM.Vectorization.Engine.dll
Extracts text from XLSX files.
public class XLSXTextExtractor
- Inheritance
-
XLSXTextExtractor
- Inherited Members
- Extension Methods
Constructors
XLSXTextExtractor(bool, bool, bool, string?, string?, string?, string?, string?)
Constructor for XLSXTextExtractor.
public XLSXTextExtractor(bool withWorksheetNumber = true, bool withEndOfWorksheetMarker = false, bool withQuotes = true, string? worksheetNumberTemplate = null, string? endOfWorksheetMarkerTemplate = null, string? rowPrefix = null, string? columnSeparator = null, string? rowSuffix = null)
Parameters
withWorksheetNumber
boolwithEndOfWorksheetMarker
boolwithQuotes
boolworksheetNumberTemplate
stringendOfWorksheetMarkerTemplate
stringrowPrefix
stringcolumnSeparator
stringrowSuffix
string
Methods
GetText(BinaryData)
Extracts the text content from a PPTX document.
public string GetText(BinaryData binaryContent)
Parameters
binaryContent
BinaryDataThe binary content of the PPTX document.
Returns
- string
The text content of the PPTX document.