medcat.components.addons.relation_extraction.base_component
Classes:
Attributes:
-
logger–
RelExtrBaseComponent
RelExtrBaseComponent(tokenizer: BaseTokenizerWrapper = BaseTokenizerWrapper(), model: RelExtrBaseModel = None, model_config: RelExtrBaseConfig = None, config: ConfigRelCAT = ConfigRelCAT(), task: str = 'train', init_model: bool = False)
Component that holds the model and everything for RelCAT.
Parameters:
-
(tokenizerBaseTokenizerWrapper, default:BaseTokenizerWrapper()) –The base tokenizer for RelCAT.
-
(modelRelExtrBaseModel, default:None) –The model wrapper.
-
(model_configRelExtrBaseConfig, default:None) –The model-specific config.
-
(configConfigRelCAT, default:ConfigRelCAT()) –The RelCAT config.
-
(taskstr, default:'train') –The task - used for checkpointing.
-
(init_modelbool, default:False) –Loads default BERT base model, tokenizer, model config. Defaults to False.
Methods:
-
from_relcat_config– -
load–Args:
-
save–Saves model and its dependencies to specified save_path folder.
Attributes:
-
best_f1(float) – -
epoch(int) – -
model(RelExtrBaseModel) – -
model_config(RelExtrBaseConfig) – -
name– -
optimizer(AdamW) – -
pad_id– -
padding_seq– -
relcat_config(ConfigRelCAT) – -
scheduler(MultiStepLR) – -
task(str) – -
tokenizer(BaseTokenizerWrapper) –
Source code in medcat-v2/medcat/components/addons/relation_extraction/base_component.py
27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 | |
name
class-attribute
instance-attribute
name = 'base_component_rel'
optimizer
instance-attribute
optimizer: AdamW = None
pad_id
instance-attribute
pad_id = padding_idx
padding_seq
instance-attribute
padding_seq = Pad_Sequence(seq_pad_value=pad_id, label_pad_value=pad_id)
scheduler
instance-attribute
scheduler: MultiStepLR = None
from_relcat_config
classmethod
from_relcat_config(relcat_config: ConfigRelCAT, pretrained_model_name_or_path: str = './') -> RelExtrBaseComponent
Source code in medcat-v2/medcat/components/addons/relation_extraction/base_component.py
151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 | |
load
classmethod
load(pretrained_model_name_or_path: str = './') -> RelExtrBaseComponent
Parameters:
-
(pretrained_model_name_or_pathstr, default:'./') –Path to RelCAT model. Defaults to "./".
Returns:
-
RelExtrBaseComponent(RelExtrBaseComponent) –component.
Source code in medcat-v2/medcat/components/addons/relation_extraction/base_component.py
134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 | |
save
Saves model and its dependencies to specified save_path folder. The CDB is obviously not saved, it is however necessary to save the tokenizer used.
Parameters:
-
(save_pathstr) –folder path in which to save the model & deps.
Source code in medcat-v2/medcat/components/addons/relation_extraction/base_component.py
101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 | |