aito.schema.AitoLanguageAnalyzerSchema

class aito.schema.AitoLanguageAnalyzerSchema(language: str, use_default_stop_words: bool = None, custom_stop_words: List[str] = None, custom_key_words: List[str] = None)

Aito LanguageAnalyzer schema

Parameters
  • language (str) – the name or the ISO code of the language

  • use_default_stop_words (bool, defaults to False) – filter the language default stop words

  • custom_stop_words (List[str], defaults to []) – words that will be filtered

  • custom_key_words (List[str], defaults to []) – words that will not be featurized

Methods

from_deserialized_object(obj)

create an AitoSchema object from a JSON deserialized object

from_json_string(json_string, **kwargs)

create an AitoSchema object from a JSON string

infer_from_samples(samples[, max_sample_size])

Infer an analyzer from the given samples

to_json_serializable()

convert the AitoSchema object to a json serializable object (dict, in most case)

to_json_string(**kwargs)

the AitoSchema object as a JSON string

Attributes

analyzer_type

comparison_properties

iterable of the properties that will be used for comparison with another schema object of the same type

custom_key_words

list of words that will not be featurizerd

custom_stop_words

list of words that will be filtered

language

the language of the analyzer

type

the type of the schema component

use_default_stop_words

filter the language default stop words