Table of Contents

Class ContentIdentifier

Namespace
FoundationaLLM.Common.Models.DataPipelines
Assembly
FoundationaLLM.Common.dll

Represents the identifier of a content item processed by data pipelines.

public class ContentIdentifier
Inheritance
ContentIdentifier
Inherited Members
Extension Methods

Properties

CanonicalId

The canonical identifier of the content item.

[JsonPropertyOrder(2)]
[JsonPropertyName("canonical_id")]
public required string CanonicalId { get; set; }

Property Value

string

Remarks

Data pipeline state services use it to derive the location of the state in the underlying storage.

FileName

The file name part of the content identifier.

[JsonIgnore]
public string FileName { get; }

Property Value

string

this[int]

The indexer allowing to access the components of the multipart identifier using [] notation.

[JsonIgnore]
public string this[int i] { get; }

Parameters

i int

The index of component being retrieved.

Property Value

string

MultipartId

The multipart unique identifier of the the content item (i.e. document).

[JsonInclude]
[JsonPropertyOrder(1)]
[JsonPropertyName("multipart_id")]
public required List<string> MultipartId { get; set; }

Property Value

List<string>

UniqueId

The unique identifier of the content (i.e., document) being vectorized. The identifier is determined by concatenating the parts from MultipartId.

[JsonIgnore]
public string UniqueId { get; }

Property Value

string

Methods

ValidateMultipartId(int)

Validates a multipart unique content identifier.

public void ValidateMultipartId(int expectedPartsCount)

Parameters

expectedPartsCount int

The expected number of parts in the multipart identifier.

Exceptions

ContentIdentifierException