Initial data format should be as follows:

id > item > type, which includes the following types:

onom :: onomastic data (names) onto :: toponimic onomastic data (toponymic names) topo :: toponymic data (toponyms)

date should be replace with the following: yebi :: year of birth yede :: year of death (this one will help to count individuals, as there are often many dates in a biography) yeag :: age in years yemi :: years of other kind (not categorized)

Enriching Layers

In R, the initial data is to be split into subvariables () and expanded with enriching layers that will provide additional analytical data, such as transliteration, translation, categorization (metacategories, synonyms, etc), coordinates, etc.