Textos Andinos - Building The Database


MetaData

Manuel Medrano’s Database extracted from the book Textos Andinos, consists of 72 xml files. I have converted his original metadata and augmented it and put it into a CSV file:

Code
csv_dir = f"{kq.project_directory()}/notebook/textos_andinos/data/CSV"
metadata_df = pd.read_csv(f"{csv_dir}/TA_metadata.csv")
metadata_df.columns
metadata_df
Index(['memoria', 'name', 'title', 'volume', 'chapter', 'attribution',
       'line_ start', 'line_ end', 'year', 'lat', 'lon', 'dist_cuzco (km)',
       'altitude (m)', 'reference location', 'type', 'pre_post_conquest',
       'language', 'int_ unint', 'pop_s_m_l', 'num_speakers_sng_mult',
       'currency', 'lat_zone'],
      dtype='object')
memoria name title volume chapter attribution line_ start line_ end year lat ... altitude (m) reference location type pre_post_conquest language int_ unint pop_s_m_l num_speakers_sng_mult currency lat_zone
0 1 m_01 Memoria de los nietos de Topa Inga Yupangui 1 1c Capac Ayllu 304 315 1532 -13.530 ... 3337 Cuzco pnar pre quechua unint l mult no central
1 2 m_02 Memoria de las provinçias que conquistó Topa Ynga Yupangui padre de Guaina Capac Ynga con sus hermanos 1 1e Capac Ayllu 383 481 1532 -13.530 ... 3337 Cuzco pnar pre quechua unint l mult no central
2 3 m_03 Relación de las guacas del Cuzco 1 2 Bernabé Cobo 159 1327 1532 -13.530 ... 3337 Cuzco pnar pre quechua unint l sng no central
3 4 m_04 Relación de lo que los Chupachos daban al Inga 1 3b Paucarguaman, Querin, Julcacondor 59 127 1532 -9.940 ... 1925 Conchumayo pnum pre quechua unint l mult no north
4 5 m_05 Relación de lo que los Chupachos dan a Gómez Arias de Ábila 1 3c Paucarguaman, Querin, Julcacondor 135 224 1549 -9.940 ... 1925 Conchumayo pnum post quechua unint l mult no north
... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...
67 68 m_68 La tasa de Pedro de la Gasca del repartimiento de Machaca que tubo en encomienda don Gerónimo de Soria 2 40A Gerónimo de Loaysa, Fernando de Santillán y Domingo de Santo Thomas 18 92 1550 -16.960 ... 3933 Machaca la Grande pnum post aymara unint m sng yes south
68 69 m_69 Relación de lo que valió la mita primera del año de 1552 d’estas cuentas que la cobró Juan López de Alcalá de los yndios de Machaca 2 40D Bartolomé de Otaçu y Francisco de Tapia 631 666 1552 -16.956 ... 3933 Machaca la Grande pnum post aymara unint m mult yes south
69 70 m_70 Ymformaçión hecha por mandado de su excelençia sobre los daños que se han recreçido a los yndios del balle de Xauxa en los Pleytos que han tinido asý en los bienes de la comunidad como en los de particulares 2 41 Caciques de Lurin Huanca y Hanan Huanca 55 172 1570 -11.920 ... 3264 Mantaro Valley pnum post quechua unint l sng yes central
70 71 m_71 Ymformaçión hecha por mandado de su excelençia sobre los daños que se han recreçido a los yndios del balle de Xauxa en los Pleytos que han tinido asý en los bienes de la comunidad como en los de particulares 2 41 Caciques de Lurin Huanca y Hanan Huanca 211 231 1570 -11.920 ... 3264 Mantaro Valley pnum post quechua unint l sng yes central
71 72 m_72 Ymformaçión hecha por mandado de su excelençia sobre los daños que se han recreçido a los yndios del balle de Xauxa en los Pleytos que han tinido asý en los bienes de la comunidad como en los de particulares 2 41 Caciques de Lurin Huanca y Hanan Huanca 252 359 1570 -11.920 ... 3264 Mantaro Valley pnum post quechua unint l sng no central

Words

For each “memoria”, there is a separate XML file consisting of the words in the file. Reading and parsing these files is the work of separate Python class - Textos_Andinos. Part of it’s job is to concatenate those XML files in one giant CSV as a dataframe source:

Code
word_df = pd.read_csv(f"{csv_dir}/TA_words.csv")
word_df
Unnamed: 0 name word_index text lemma pos
0 0 m_01 0 Don don PROPN
1 1 m_01 1 Andrés andrés PROPN
2 2 m_01 2 Topa topa PROPN
3 3 m_01 3 Yupanqui yupanqui PROPN
4 4 m_01 4 . . PUNCT
... ... ... ... ... ... ...
75595 75595 m_72 567 de de ADP
75596 75596 m_72 568 don don PROPN
75597 75597 m_72 569 Lorenzo lorenzo PROPN
75598 75598 m_72 570 Pomalaxa pomalaxa PROPN
75599 75599 m_72 571 . . PUNCT

Measured Nouns

I am borrowing/adapting the linguistic term “measured noun” (ie. measure words in Chinese) to mean any noun mentioned in the khipu recitations, that has a number preceeding it - for example 100 pesos. We can do a crude search for nouns preceeded by a number as a first search for measured nouns. Again, this is done by the Textos_Andinos class.

Code
nouns_df = pd.read_csv(f"{csv_dir}/TA_measured_nouns.csv")
nouns_df
Unnamed: 0 name noun_index count noun
0 1616 m_02 0 3 hermano
1 1247 m_03 0 9 ceque
2 1248 m_03 1 85 huaca
3 1249 m_03 2 5 huaca
4 1250 m_03 3 8 huaca
... ... ... ... ... ...
4033 809 m_72 48 3 partida
4034 810 m_72 49 11 cesto
4035 811 m_72 50 368 cesto
4036 812 m_72 51 37 pieza
4037 813 m_72 52 187 carnero

Summarizing

With that, we are ready to start collecting sentences and documents. We break words on sentence boundaries, create spanish text, segment the spanish text into a few thousand character packets, and send those packets slowly, to Google, to translate into English. Add on counts of measured nouns, sentences, etc. and we have a summary table:

Code
summary_df = pd.read_csv(f"{csv_dir}/TA_summary.csv")
summary_df.columns
summary_df.head(1)
Index(['Unnamed: 0.1', 'Unnamed: 0', 'memoria', 'name', 'title', 'volume',
       'chapter', 'attribution', 'line_ start', 'line_ end', 'year', 'lat',
       'lon', 'dist_cuzco (km)', 'altitude (m)', 'reference location', 'type',
       'pre_post_conquest', 'language', 'int_ unint', 'pop_s_m_l',
       'num_speakers_sng_mult', 'currency', 'lat_zone', 'spanish_text',
       'english_text', 'num_words', 'num_sentences', 'num_measured_nouns',
       'num_measures', 'lda_topic_num'],
      dtype='object')
Unnamed: 0.1 Unnamed: 0 memoria name title volume chapter attribution line_ start line_ end ... num_speakers_sng_mult currency lat_zone spanish_text english_text num_words num_sentences num_measured_nouns num_measures lda_topic_num
0 0 0 1 m_01 Memoria de los nietos de Topa Inga Yupangui 1 1c Capac Ayllu 304 315 ... mult no central Don Andrés Topa Yupanqui.Don García Pilco Topa.Don Juan Puro Topa.Don Alonso Guaipar Topa.Don Felipe Topa Yupanqui.Don Martín Topa.Gerónimo Tito.Alonso Topa Yupanqui.Don García Pilco Topa.Alonso Ayacho.Don Gonzalo Amaro.Domingo Topa.Don García Limache Yupanqui.El ayllu Cayao.Don Hernando Topa Manco.Don García Cayo Topa.Pedro Quespi Poro.Francisco Topa.Ayllu Payán.Francisco Chonco Topa.Don Cristóbal Pisac Topa.Francisco Chui Topa.Don García Ayacho.Felipe Topa Yupanqui. Don Andrés Topa Yupanqui.Don García Pilco Topa.Don Juan Puro Topa.Don Alonso Guaipar Topa.Don Felipe Topa Yupanqui.Don Martín Topa.Gerono Tito.Alonso Topa Yupanqui.Don García Pilco Topa.Alonso Ayacho.Don Gonzalo Amaro.Domingo Topa.Don García Limache Yupanqui.The Ayllu Cayao.Don Hernando Topa Manco.Don García Cayo Topa.Pedro Quesspi poro.Franciso Topa.Ayllu Payán.Francisco Chonco Topa.Don Cristóbal Pisac Topa.Francisco Chui Topa.Don García Ayacho.Felipe Topa Yupanqui. 101 24 0 0 1

You may notice the last column in the summary table is lda_topic_num. This will be described and created in a few sections ahead. Meanwhile, let’s review some of the documents.

Review of Documents

The Largest Document

The largest document is a veritable Inka Baedeker of ceques (roads), and huacas (temples) and their associated value to the Inkas (ie. sacrifices - including children). I include a bit of the rough English text here, because it is fascinating reading:

Code
# The largest document:
largest_doc_df = summary_df.sort_values(by=['num_words'],ascending=False)
largest_doc_name = largest_doc_df['name'].tolist()[0]
largest_text = largest_doc_df['english_text'].tolist()[0]
print(f"Document {largest_doc_name} has {largest_doc_df['num_words'].tolist()[0]} words. Text is:\n\n{largest_text[:15000]}...")
Document m_03 has 12001 words. Text is:

Beginning then on the path of Chinchaysuyu, which comes out through the neighborhood of Carmenga, there was 9 cosques, in which 85 huacas were comprehensive by this order.The first CEQUE was said, Cayao, was in charge of partiality, and Ayllu, from Goacaytaqui, and had the next 5 huacas.The first was called, Michosamaro, was close to the skirt of El Cerro de Totocache, and they said to be one of those who fought to have come out with the first Inca Mancocapac of the cave of Pacaritampu.To which they refer that a woman from which they left with them from said cave killed him by certain contempt that with her tube, and turned stone.And that his soul appeared in this same place, and he ordered that he sacrificed him there;And so was the sacrifice of this very old huaca;which was always made of gold, clothing, seashells, and other things;And he used to be done by good temporary.The second guide of this ceque was said, Patallacta;It was a house that Diputó Inca Yupanqui for the sacrifices of him, and he died in her, and the Incas who later succeeded him made here ordinary sacrifice.He was generally offered all the things that they consumed in sacrifice, for the health, and prosperity of the Inca.The third huaca was called, pilcopuquio;It is a source next to the house overdicted, from which a ditch comes out: and tell the Indians that have made Inca Yupanqui that house for sacrifices;He ordered that he left that water and then decreed that he was became ordinary sacrifice.The fourth huaca was said, Cirocaya;It is a stone cave, from which they had believed to leave the hail: so at the time they feared from him, they were all going to sacrifice it because it did not leave, and destroyed them the sow.La Quinta, and Postrera Huaca of this ceque was named, Sononcay;It is a hill where it was very old to offer sacrifices for the health of the Inca.The second check of this same Chinchaysuyo road was said, Payan;In which there were 8 Huacas de El Ayllu, and Family of Vicaquirao.The first huaca was said, Guaracince, which was in the Plaza de El Temple of El Sol named Chuquipampa Sound Llano de Oro was a plain piezuel that was there;In which they said that the earth tremor was formed.They did sacrifices in it so that it does not tremble, and they were very solemn;Because when the earth trembled, children killed and ordinarily burned, and clothes, and buried gold, and silver.The second guide was called, Racramirpay;This was a stone that had a window that was a little more under where it is now the convent of San Agustín;whose story they refer in this way: that in a certain battle that he gave Inca Yupanqui to the enemies of him, an Indian appeared in the air, and helped him overcome the.And after the Vitoria, came to El Cuzco with the Inca said, and sitting in that window became stone;Which from that time worshiped, and made him ordinary sacrifice.And particularly he was solemn when the Inca personally went to war asking him to help the king as he had helped Inca Yupanqui in that war.The third huaca was a massive gold idol called Inti Apa, which means thunder from the sun;Which was put on rich golden goals.He did inca Yupanqui, and took it by guide or brother.He had a house in the neighborhood of Totocache, and made him great veneration and in the same house, or temple was the body of said Inca Yupanqui.They made this idol very ordinary sacrifice of children, and of everything else, begging the forces of the Inca, and their empire does not diminish.The fourth huaca was said, Viroypacha.It is a pork of reasonable water, which statanyed by Inca Yupanqui Huaca, was begged for the stillness of the Inca, the Quinta Huaca was a Llano called, Chuquibamba, which is next to the fortress;They sacrificed him as the others.The sixth huaca was said, Macasayba;It was a large stone that Inca Yupanqui put next to the plain of Chuquibamba and sent him veneration, and sacrifices for the health of the king.The Séntima Huaca was a quarry called, Guayrangalle, which is above the fortress, in which they made sacrifices for various respects.The eighth and latest huaca of this ceque was said, Guayllaurcaja, is a portezuelo that is done in the middle of a hill, where Viracocha Inca sat many times to rest on the said hill: and from that time, and by his command was heldBy adoratory.The third cEque of this road was said, Collana;It had 10 huacas, and the first was called, Nina, which was a brazier made of a stone, where the fire was lit for the sacrifices, and could not take the other part: it was next to the temple the sun, andHe had great veneration, and solemn sacrifices were made.The second guide was said, Canchaapacha, it was a source that was on the street of Diego Maldonado, which made sacrifice for certain stories that the Indians tell.The third huaca was a source called, Ticicocha, which was inside the house of El Said Diego Maldonado: It was this source of Coya, or Queen Mamaocllo, in which they became very large, and ordinary sacrifices, especially when they wanted to ask for somethingTo said Mamaocllo, who was the most revered woman who had between these Indians.The fourth huaca was said, condorcacha, and it was the house in which Inca Yupanqui lived.The Fifth Huaca was a happiness, Pomacorco, and they do not give another reason to sacrifice it more than having been Guaynacapac.The sixth huaca was called, molliagrace, it was a certain stone that was in the middle of a flash, callaver;which he ordered put there, and have the per hives, Inca Yupanqui.The Séntima Huaca was the house that was from King Tupa Inca called Calcopuquioguaci, in which sacrifices were offered to said Inca Tupa.The eighth huaca was a source that was said, calplack, which was down the said house of Tupa Inca, and they were going to wash in it all those who were made in the party of El Raymi.Water is swallowed from this source for the Inca with many songs made for only this, and they were maids that bring it.The ninth huaca was called, cugiguaman, was a hawklike stone, said Inca Yupanqui had appeared at a quarry, and sent him to put themselves in this ceque, and sacrifices were made.The tenth huaca of this ceque was a Fontezuela said quinapuquy, which pointed out by Inca Yupanqui loving, were offered sacrifices for the health of the Inca.At the fourth ceque they called, Payao, had 8 huacas;Of which the first was said, Araytampu, was a large stone with another 4 small, which were next to the house that was Benito de la Peña, and they were from the pururaucas.The second guide was called, Puñi, I was on a small plain next to the house of Diego Maldonado: it was very solemn widow because it was because of the dream: they offered him all kinds of sacrifices, and came to him for 2 demands, theOne to beg for those who could not sleep, and the other that did not die sleeping.The third huaca was called, Curiocllo, was a womans curiollo house that was from Amaro Topa Inca, which was in Colcapata, and also worshiped a fountain that was next to her.The fourth guide was called, Colcapata, and it was the house of Paullu Inca, where there was a stone per idol, who adored Ayllu de Andasaya: And the origin he had was he had commanded adoring Pachacutic Inca, because he said that a certain sir had beenConverted into said stone.The Quinta Huaca was said, Guamacancha, which was the fortress in a crill of this name;It was a fence within which there were 2 small owls deputies to fast when they were made of orejones.The sixth Huaca was a large stone called, Collaconcho, which was in the fortress, which claims to bring it for that building fell 3 times, and killed some Indians: and the sorcerers in questions they asked, said they answered that ifThey were in complaint to put in the building, all there would be an evil end, beyond that they would not be part for it.And since that time was held by general huaca;which offered by the forces of the Inca.The Sétima Huaca was said, Chachacomaja, were certain trees on hand, next to which was a stone who made sacrifice because the Inca did not have anger.The eighth and last huaca of this ceque was a high hill called, Chuquipalta, which is next to the fortress, in which 3 stones were on behalf of the Pachaychachic, Inti Illapa, and Punchau: and in this hill universal sacrifice was madeof children, and girls;and small figures of the same made of gold;And clothes were burned;And rams, because it was he had a very solemn width.The fifth ceque of this same path, and direction of Chichaysuyu was said, Cayao;It contained 10 huacas: the first call, Cuscricancha, was the place where Inca Yupanqui border of the Temple of Coricancha;And for this reason they offered those of the Ayllu Inacapanaca.The second huaca was a temple called, pucamarca, which was in the houses that were from the graduate of the range in which there was an idol of the thunder said Chucuylla.The third huaca was said, Cuzcocalla, was on the street that goes to the square by going through this line, or CEque;And it was a good number of stones that claimed to be of the pururaucas.The fourth huaca was the main square called, Aucaypata, which is also here;In it the universal sacrifice was made for the sun, and the other huacas;And it was distributed, and took the other parts of the kingdom, and it was very revered.The fifth was a bueno called, coracure, in which I slept Inca Yupanqui, which is where the houses of Cabildo are now: he ordered the Inca saying to worship that place, and burn on clothes, and rams, and so it was done.The sixth huaca was called, Sabbacurinca, was a wellcarved seat, where the Incas sat;which was very venerated, and it was solemnly sacrifices;And out of respect this seat was adored all the fortress: that I should be inside, or next to it.The Séntima Huaca was called, Chacaguanacauri;which is a cerrillo that is way of yucay;Where the mancebnes that were arid for a certain straw that were wearing in the spears.The Eighth Huaca was a small burial call, Guamangguachca, of a brother of Guaynacapac, which was from the other part of the fortress;He made a worship for having died small the brother of the Inca, saying that by the veneration they gave him would not die more than that age.The Ninth Huaca was a hill that is on the way to the Yucay Valley called, Cinca, in which there was a stone that worshiped the Indians of Ayamarca, having for opinion that came from it.The tenth huaca was a cumsy, or spring called, Corcorpuquiu;in which children and everything else were offered.The sixth ceque was said, Colana, like the third and had 11 huacas: the first was said, Catonge, and it was a stone of the pururaucas, which was in a window next to the temple of El Sol.The second guide was called, Pucamarca, was a deputy house or temple for the sacrifices of El Pachayachachic, in which children were sacrificed, and everything else.The third huaca was said, ñan, which means road, was in the square where the way was taken for Chinchaysuyu: Universal sacrifice was made by the walkers, and because that way was always whole, and do not collapse, andCayese.The fourth huaca was by name, Guayra, and was at the door of Cajana;In it was made sacrifice to the wind so that it did not hurt: and was made a hole, in which sacrifices were buried.The Quinta Huaca was the Huaynacapac Palace called Cajana;Within which there was a lagoon named Ticcicocha, which was a major worship, and where great sacrifices were made.The sixth huaca was a source called, Capipacchan that was in Capi, in which the Inca used to bathe;sacrifices were made, and prayed that the water did not take the strength, nor did it hurt him.The Séntima Huaca was said, Capi, which means root;It was a very large root of quinoa, which said the sorcerers who was the root of where the Cuzco came from, and that by it was kept: it made sacrifices for the conservation of the city.The eighth was called, Quisco, was on top of Capis hill, where universal sacrifice was made for the same reason as the root overdicted.The Ninth Huaca was a hill called, Quiangalla, who is on the road to Yucay, where there were 2 cars, or pillars that had as a sign that the sun was the beginning of the summer there.The tenth was a Fontezuela who was saying, Guragaillapuquiu, and is next to this hill, in which it took the dust that was plenty of the sacrifices of the huacas of this ceque.The eleventh, and Sun Huaca was said, Ilacamarca, was in a fortress that had built on a rock of Yucay.And in it the huacas of this ceque ended.The seventh ceque was said Cayao, and was from Ayllu de Capailu, had the 8 huacas that are followed: The first was called Omanamaro, and it was a long stone, which said to be of the Pururaucas, which was at the door of theHouse that was Figueroa.Universal sacrifice was made for the health of the Inca.The second guide was 2 small owls called Sancacancha, the one;and the other, hurrish;Where they had a lot of lions, tigers, snakes, and all the bad savings that could be.In these owls they put the prisoners who brought the war;and the one who died that night, ate the said beasts;And he was alive, they took him out;And this had as a sign that he had a good heart, and purpose of serving the Inca.The third huaca was said, Marcatampu, were a round stones, which were in Carmenga, where now is the parish of Santa Ana;which indicated by Inca Yupanqui main hidore: children were offered for health, and conservation of the Inca.The fourth was called, Taxanamaro;They were 5 round stones that Viracocha Inca sent to put the toxan hill, which is on top of Carmenga.The offering given to him was only of shells;He begged this guide by the Vitoria of the Inca.To the fifth huaca of this ceque they called, Vrcoslla Amaro, were many stones together on a crill that is on top of Carmenga;It was made sacrifices for the health of the Inca.The sixth was said, Callancapuquiu, is the ticutic spring, which offered shells because always manase.The Séntima Huaca was said, Churuncana, is a round hill that is on top of Carmega, where the real road of Chinchero de El de Yucay is sides.From this hill the sacrifices were made to Ticciviracocha, asking him to expire the Inca throughout the earth to the confines of the sea: they offered him of all things, especially children.The eighth, and last huaca of this ceque was a source called, Mujayllapuquiu, which is close to Guarguaylla;They offered itching shells for certain effects.The eighth ceque of this road was said, Payan, like the second, and there was 13 Huacas.The first was a small house next to the Temple of El Sol called, Ilanguarque, in which certain weapons were kept that they claimed to have given the Sun to Inca Yupanqui, with which he beat his enemies: he was made to this universal sacrifice.The second was said, mancochuqui, was a huanacauri punch;And what was caught from her sacrificed him.The third huaca was a source called, Aucaypata, which was next to where it is now the Casa de Cabildo, in which the priests of Chucuilla said that b...

Document with The Most Measured Nouns (equivalent to saying which khipu has the most cords)

Code
## The Document with The most Measured Nouns (equivalent to saying which khipu has the most cords)
# The largest document:
sorted_measure_nouns_df = summary_df.sort_values(by=['num_measured_nouns'],ascending=False)
largest_mnouns_name = sorted_measure_nouns_df['name'].tolist()[0]
english_text = ku.multiline(sorted_measure_nouns_df.head(1)['english_text'].tolist()[0], line_length=160, continuation_char="\n\t")
print(f"Document {largest_mnouns_name=} has {sorted_measure_nouns_df['num_measured_nouns'].tolist()[0]} measured nouns. Text is:\n\n{english_text[:4000]}...")
Document largest_mnouns_name='m_11' has 762 measured nouns. Text is:

And they took him from relief to the said Marquis with his Indians who were.261 Indians loaded with corn and potatoes and quinoa.and 301 pesos in gold in
    chipans and coconuts and other jewels and 301 pesos in silver and also in Mates and Chipans and Topos.And other things that were worth it and 20 land rams and
    40 pieces of cumi clothes very rich 60 Indians for their service from the said Marquis.And his soldiers.and also the sayings 271.Indians who were charged with
    said relief remained in the said Cajamarca Valley by Yanaconas from the soldiers of said Marqués that none of them returned to the said distribution of
    atunxauxa.They also led the said Marquis 2 Frames from the New Earth.Also 77 and average corn fanegas for your food.Also 20 fanegas of potatoes.Also 20 pairs of
    floors.All of which led the said Marquis to Cajamarca.and they received it by friend.He also took Susichac cacique of the saying the father of the said Don
    Francisco Cusichac 452 Indians to Bonbón where he found the said Marquis to be served and carried the loads and favor the said Marquis and his people to move
    forward.Also since Bonbon came the captain Soto.He went through the doctors.From this distribution of Xauxa where he took 20 Indies for his service.And he took
    them to Angoyaco and from there returned the Soto.To the sayings of this distribution of Xauxa where he was back to said Marquis Don Francisco Pizarro who was
    in the sayings.And they gave the said Marquis for their service and maintenance the following things: First they gave them for their service so that they bring
    wooden grass trabits and the other things needed 589 Indians.men of whom none of them remained in this distribution because they took them with them the
    Spaniards for their service.They also gave the said caciques.To the said Marquis for his 437 Indian service, which all took with them.And there was none.They
    also gave him 1275.Sheep the said caciques to the said Marqués for himself and his soldiers while they were in the sayings.He also gave the said Marquis and his
    soldiers 12045 Ranges on earth.For your meal.They also ran the said soldiers of said Marquis Don Francisco Pizarro 29231 sheep and rams and bales in the said
    distribution.They also gave the said Marquis.19 Dresses of Cumbi wool of man and woman.They also gave the said Marquis 2 Breaked.They also ran the soldiers 2216
    dresses of man and woman from Cumbi.They also gave the said Marquis the time it was in the sayings.12406 corn fanegas for your food.They also gave the said
    Marquis and their soldiers 124 Fanegas de quinoa for their food.They also gave the said Marquis 1249 fanegas of potatoes for their food.They also ran the said
    soldiers 21563 corn fanegas.They also ran the said soldiers 2336 Fanegas of quinoa.They also ran the said soldiers 2414 faengas of potatoes.They also gave the
    said Marquis.The said caciques.185 pairs of cabuya floors.They also ran the said soldiers 3659 pairs.of Sheets of Cabuya.They also gave the said caciques to
    said Marquis 55 mud pots for their service.They also gave the said Marquis and soldiers 1316 Chamelic and porongos for their service.They also ran the soldiers
    of the saying.Marqués 1231 fists and pitchers and tanjas.They also ran the soldiers.11285 Chamelics and porongos.They also gave him the saying.marquis.1240
    partridges for your food.They also gave him the sayings.caciquesTo said Marquis 200071 Razada wood loads.They also gave the said Marquis 30404 small wood
    loads.They also gave the said Marquis.48989.Grass loads for horses.They also gave him the saying.marquis.321354 Straw charges for soldiers and stable beds.They
    also gave the said Marquis.1322 large water pitchers.For your drinking and soldiers.They also gave the said Marquis 12404.Fish baskets.That each basket has a
    pound of fish.They also gave the said caciques.To the said Marquis at the time he wanted to go from the said tambros for Cuzco 311.Indians for
    their.Charges.They also raned.The said soldi...
Code
# What are the measured nouns categories?
largest_mnouns_df = nouns_df[nouns_df['name'] == largest_mnouns_name]
largest_mnouns = largest_mnouns_df['noun'].value_counts().index.tolist()
noun_text = ku.multiline(largest_mnouns, line_length=160, continuation_char="\n\t")
print(f"There are {len(largest_mnouns)} measured nouns categories in document {largest_mnouns_name}:\n\t{noun_text}")
There are 66 measured nouns categories in document m_11:
    ['fanega', 'carga', 'indio', 'par', 'oveja', 'libra', 'olla', 'cestillo', 'cántaro', 'chamelico', 'perdiz', 'gallina', 'cordero', 'huevo', 'frazada',
    'sobrecarga', 'vestido', 'soldado', 'pieza', 'ysanga', 'puerco', 'pica', 'almud', 'yçanga', 'cabestro', 'carnero', 'español', 'aderezo', 'estaca', 'palo',
    'peso', 'silla', 'vellón', 'pescuezo', 'batea', 'banco', 'casa', 'costal', 'plato', 'alforja', 'cuero', 'putre', 'día', 'mecha', 'batela', 'toldo', 'petaca',
    'cabra', 'lechón', 'jáquima', 'mesa', 'carpintero', 'ycanca', 'depósito', 'alpargate', 'cargado', 'tabla', 'pico', 'jarro', 'suelta', 'mes', 'pan',
    'sobrecargo', 'hombre', 'puño', 'ycanga']
Code
# What are the top 10 measured noun khipu texts?
print(f"Top 10 Measured Noun Khipu Texts are {sorted_measure_nouns_df['name'].tolist()[:10]}")
Top 10 Measured Noun Khipu Texts are ['m_11', 'm_48', 'm_07', 'm_60', 'm_21', 'm_15', 'm_19', 'm_14', 'm_16', 'm_63']