Class DOCXContentTextExtractionPlugin
- Assembly
- FoundationaLLM.DataPipelinePlugins.dll
Implements the DOCX Content Text Extraction Plugin.
public class DOCXContentTextExtractionPlugin : PluginBase, IContentTextExtractionPlugin
- Inheritance
-
DOCXContentTextExtractionPlugin
- Implements
- Inherited Members
- Extension Methods
Constructors
DOCXContentTextExtractionPlugin(Dictionary<string, object>, IPluginPackageManager, IPluginPackageManagerResolver, IServiceProvider)
Implements the DOCX Content Text Extraction Plugin.
public DOCXContentTextExtractionPlugin(Dictionary<string, object> pluginParameters, IPluginPackageManager packageManager, IPluginPackageManagerResolver packageManagerResolver, IServiceProvider serviceProvider)
Parameters
pluginParameters
Dictionary<string, object>The dictionary containing the plugin parameters.
packageManager
IPluginPackageManagerThe package manager for the plugin.
packageManagerResolver
IPluginPackageManagerResolverThe package manager resolver for the plugin.
serviceProvider
IServiceProviderThe service provider of the dependency injection container.
Properties
Name
protected override string Name { get; }
Property Value
Methods
ExtractText(BinaryData)
Extracts text from the provided raw content.
public Task<PluginResult<string>> ExtractText(BinaryData rawContent)
Parameters
rawContent
BinaryDataThe binary content to extract text from.
Returns
- Task<PluginResult<string>>
A PluginResult<T> object with the extracted text.