medcat.components.addons.meta_cat.mctokenizers.tokenizers
Classes:
Functions:
Attributes:
FAKE_TOKENIZER_PATH
module-attribute
FAKE_TOKENIZER_PATH = '#\n/fake-path-not-exist#/'
TokenizerWrapperBase
TokenizerWrapperBase(hf_tokenizer: Optional[Tokenizer] = None)
Bases: ABC
Methods:
-
ensure_tokenizer– -
get_pad_id– -
get_size– -
load– -
save– -
token_to_id–
Attributes:
-
hf_tokenizers– -
name(str) –
Source code in medcat-v2/medcat/components/addons/meta_cat/mctokenizers/tokenizers.py
15 16 | |
hf_tokenizers
instance-attribute
hf_tokenizers = hf_tokenizer
ensure_tokenizer
ensure_tokenizer() -> Tokenizer
Source code in medcat-v2/medcat/components/addons/meta_cat/mctokenizers/tokenizers.py
45 46 47 48 | |
get_pad_id
abstractmethod
Source code in medcat-v2/medcat/components/addons/meta_cat/mctokenizers/tokenizers.py
42 43 | |
get_size
abstractmethod
get_size() -> int
Source code in medcat-v2/medcat/components/addons/meta_cat/mctokenizers/tokenizers.py
36 37 | |
load
abstractmethod
classmethod
Source code in medcat-v2/medcat/components/addons/meta_cat/mctokenizers/tokenizers.py
31 32 33 34 | |
save
abstractmethod
save(dir_path: str) -> None
Source code in medcat-v2/medcat/components/addons/meta_cat/mctokenizers/tokenizers.py
28 29 | |
init_tokenizer
init_tokenizer(cnf: ConfigMetaCAT) -> Optional[TokenizerWrapperBase]
Source code in medcat-v2/medcat/components/addons/meta_cat/mctokenizers/tokenizers.py
51 52 53 54 55 56 57 58 59 60 61 | |
load_tokenizer
load_tokenizer(config: ConfigMetaCAT, tokenizer_folder: str) -> Optional[TokenizerWrapperBase]
Source code in medcat-v2/medcat/components/addons/meta_cat/mctokenizers/tokenizers.py
64 65 66 67 68 69 70 71 72 73 74 75 76 | |