FactGrid:Career Statements: Difference between revisions

From FactGrid
Jump to navigation Jump to search
 
(41 intermediate revisions by the same user not shown)
Line 1: Line 1:
* https://docs.google.com/spreadsheets/d/1MUjQnxstxUwwiIGTRebYS6bfpmVeGsqHvYrbcbN7XxM/edit#gid=0
[[File:Buchhändler.jpg|thumb|right|500px|Kupferstich "Der Buchhändler" aus: ''Abbildung der gemein-nützlichen Haupt-Stände von Christoph Weigel'' (Regensburg, 1698).]]


== The present mess ==
== Problem and Solution ==
Historical sources are rich with statements about the social status, the personal situations or the occupations of people they are mentioning. Subscription lists, address books, tax lists — they all will offer names, localisations and career statements from "merchant's widow", to "colonel" of the local regiment.


Our present way to deal with career statements is still messy. The reason for this is historical. We started with an input from German address books - loading here everything from "baker" to "widow of a merchant" and "Doctor of medicine and senator" on a single Property [[Property:P165]] - a property which eventually needed that extra broad label "career statement".
The information is often merely given to identify a particular person. Digest it and it will shed light on the social composition of an audience or organisation you are studying. The problem is that you will need a good deal of background information on these statements before they begin to make sense in greater masses. If you have 1,000 names and 300 different career statements you will need background information on all these career statements to get a fast categorisation and a first impression of the numbers (and people) under these useful headers.


The second messy step came with the quickly reduce the risk of a language fork - the solution was here the Deepl-translation of the entire set of some 3,000 statements into English and French. It worked well with simple trades like "Baker" but created a mess with all the rarer names of historical trades.
The database should know the words, it should know variants and abbreviations of these words in order to understand them and it should have a system or various competing systems of categorisations to unite your people in different groups under under various questions. Once you have this background information the machine can put the 300 statements into ten or twenty groups and count the 1,000 people which you will now fin in these groups for statistics or for further research int interesting groups that are now more visible.


A third insecurity came into the field with the neighbouring [[Property:P164]] for "offices held" - here we began to link to very specific offices like [[Item:Q43367|Pastorate Altenbergen]]. Some of these specific terms also received the P2-statement "career statement".
The solution is a database that knows the words and that breaks them down in ontologies. This is basically what we are trying to provide with the "career statements" on FactGrid. If you have a set of people use the [[Property:P165]] to connect them to "career statements" and the you can run searches and sort results with the ontological information that is on these terms.


On top of that we introduced a qualifier for specific positions [[Property:P166]] since positions such as the Pastorate Altenbergen could be held by "Parish substitutes", "Pastors" or "Parish vicars" - which had to be qualified.
== FactGrid career statements — three granularities ==


The various problems we created will not be solved that easily on the present number of roughly 4,600 items. We have:
FactGrid is presently offering some 8,800 "Career statements" - this is the list in English and in alphabetical order - the translations are not yet revised and there are presently some blank items in this list that will be removed in the next steps.


* unnecessary variations: Laquai, Lakei
* [https://database.factgrid.de/query/#SELECT%20%3FCareer_statement%20%3FCareer_statementLabel%20%3FCareer_statementAltLabel%20%3FOhdAB_category%20%3FOhdAB_categoryLabel%20%3FOhdAB_ID%20WHERE%20%7B%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22en%22.%20%7D%0A%20%20%3FCareer_statement%20%28wdt%3AP2%2F%28wdt%3AP3%2a%29%29%20wd%3AQ37073.%0A%20%20OPTIONAL%20%7B%20%3FCareer_statement%20wdt%3AP1007%20%3FOhdAB_category.%20%7D%0A%20%20OPTIONAL%20%7B%20%3FCareer_statement%20wdt%3AP904%20%3FOhdAB_ID.%20%7D%0A%7D%0AORDER%20BY%20%3FCareer_statementLabel All FactGrid Career statements]
* compound statements of the sort "Carpenter's widow" (to be resolved as ''widow'', Qualifier: ''status of deceased husband: Carpenter'')
* compound statements of the sort "Retired carpenter"  (to be resolved as ''Pensioner'', Qualifier: ''status of former occupation: Carpenter'')
* compound statements where people had different positions and occupations
* academic titles which we could just state under their own property
* honorary titles such as "Senator", or "Privy councillor" (de: Geheimrat) which should perhaps rather be seen as awards under that property
* German labels where French and English words should appear


== Eliminating unnecessary (?) variations: Laquai, Lakai ==
Some of these items are as broad as "pensioner", some are succinct in various degrees from "baker" to "master baker" to "court master baker" and some of them create singular positions like: "rector of the University of Erfurt". These are the three categories that should be in the game of the respective P2/P3 statements (is/ subcategory of):


This is an easy case where we have just spelling variants such as in ''Laquai'' and ''Lakei'' but a complex problem where we have alternatives like ''Laquai'' and ''Valet de Chambre'' or ''Kammer-Diener''. The early modern ''Kammer-Diener'' could be a high ranking title of honour.
* [[Item:Q37073]] Career statement (baker, master baker, court baker, university rector, widow, pensioner)
* [[Item:Q37131]] Career statement with historical or geographical specifications
* [[Item:Q257052]] Career statement that captures a sequence of incumbents "rector/president of the University of Erfurt"


The standard procedure is here the merging of items.
The P3* switch allows you to search for all university rectors even though some have precise statements on them — see line 3 of this sample query:


== Dividing compound statements ==
* [https://database.factgrid.de/query/#SELECT%20%3FUniversity_president%20%3FUniversity_presidentLabel%20WHERE%20%7B%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%0A%20%20%3FUniversity_president%20wdt%3AP165%2Fwdt%3AP3%2a%20wd%3AQ39675.%0A%7D%0A sample search: All university rectors/ presidents on FactGrid]


We should resolve most of the compound statements.
== Connected to the OhdAB ontology ==


* "Carpenter's widow" should become [[Property:P165]] ''Career statement'' "Widow" + Qualifier [[Property:P614]] ''Status of the (deceased) husband'' "Carpenter"
* [[FactGrid:OhdAB-Datenbank]] OhdAB project space
* "Retired Carpenter" should become [[Property:P165]] ''Career statement'' [[Item:Q37181]] "Retired" (or [[Item:Q37178]] Pensioner) + Qualifier [[Property:P211]] ''Previous Professional status'' "Carpenter". In addition we should set a P165 first level statement on "Carpenter" to note that former position.
* "Inn keeper and master cooper" should become two statements, with the respective dates to state the simultaneity.


The complexity is here that we might actually preserve some of the compounds in the last case as they stand for a specific design of the production, e.g. in a factory that produces these two products.
All our "career statements" are connected to Katrin Moellers ontology of career statements OhdAB. The OhdAB comprises some 45,000 items from no-statement to different jobs in a circus all in a hierarchy of differentiations. The following links give the original German version and an English version that has automated translations in it.


The standard procedure is the division into regular statements and the deletion of the compounds.
* https://kurzelinks.de/OhdAB German (original version)
* http://tinyurl.com/ylkfblre English (complex translation to be used with a grain of salt)


== No academic titles on P165 statements? ==
All the OhdAB items begin with the OhdAB number code and form a sphere of their own. Contact Katrin Moeller and her team if you want to offer more than translations. Our working vocabulary is connected to the OhdAB ontology through the [[Property:P1007]] statements on each "career statement". With the help of the graph database you now use the OhdAB information behind the P1007 link as in the following search:


The first input brought academic titles on [[Property:P165]] ''Career statements''. We already have a [[Property:P170]] for academic titles.
* [https://database.factgrid.de/query/embed.html#%23defaultView%3ABubbleChart%0ASELECT%20%3FOhdABLabel%20%28count%28distinct%28%3FA%29%29%20as%20%3Fcount%29%20WHERE%20%7B%0A%20%20%0A%20%20SELECT%20%3FA%20%3FALabel%20%3FADescription%20%3Ffamily_nameLabel%20%3FEntry%20%3FBnF_ID%20%3FDate_of_birth%20%3FOhdAB%20%3FOhdABLabel%20WHERE%20%7B%0A%20%20%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22de%22.%20%7D%0A%20%20%20%20%3FA%20wdt%3AP91%20wd%3AQ11161.%0A%20%20%20%20%3FA%20wdt%3AP165%20%3FOccupation.%20%0A%20%20%20%20%3FOccupation%20wdt%3AP1007%2a%20%3FOhdAB.%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%3FOhdAB%20wdt%3AP2%20wd%3AQ651501.%20%20%20%20%20%20%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%7D%0A%20%20%0A%20%20%7D%20group%20by%20%3FOhdABLabel The members of Frankfurt's Unions-Lodge, occupations bubble chart] Statistical analysis on the OhdAB-01 level.  


Academic titles on career statements make, nonetheless, sense as in the case of Dr. med., in German the usual statement for an active medical practitioner.
The OhdAB matching is not yet finished, but we will get this done in the first weeks of 2024. The OhdAB translation is another project. The German labelling should always be the source here.


== Should Honorary titles ("Privy councillor"/"Geheimrat") become "public awards"? ==
== Statistical breakdowns - options ==


Adam Weishaupt receives such a title in 1786 - which brings the professor of law into a new position protected by Gotha's duke.
We can run several ontologies and various statistical breakdowns side by side. The simple way is to have a property and the respective ontology on that property.


In a way these titles are awards which should appear under our [[Property:P171]] ''public award'', but in many lists e.g. of address books they will fill the statement of occupation. We have therefore left these statements on the [[Property:P165]] for the time being.
=== FactGrid (P626) Economic Sector Statistics ===


== From Deepl to authentic translations ==
*  [https://database.factgrid.de/query/embed.html#%23defaultView%3ABubbleChart%0ASELECT%20%3FSectorLabel%20(count(distinct(%3FA))%20as%20%3Fcount)%20WHERE%20%7B%0A%20%20%0A%20%20SELECT%20%3FA%20%3FALabel%20%3FADescription%20%3Ffamily_nameLabel%20%3FEntry%20%3FBnF_ID%20%3FDate_of_birth%20%3FSector%20%3FSectorLabel%20WHERE%20%7B%0A%20%20%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22de%22.%20%7D%0A%20%20%20%20%3FA%20wdt%3AP91%20wd%3AQ11161.%0A%20%20%20%20%3FA%20wdt%3AP165%20%3FOccupation.%20%0A%20%20%20%20%3FOccupation%20wdt%3AP626%20%3FSector.%0A%7D%0A%20%20%0A%20%20%7D%20group%20by%20%3FSectorLabel The members of Frankfurt's Unions-Lodge, occupations bubble chart the older FactGrid breakdown of Property:P626 statements]
:* [https://database.factgrid.de/query/index.html#%23defaultView%3ABubbleChart%0ASELECT%20%3FOhdABLabel%20%28count%28distinct%28%3FA%29%29%20as%20%3Fcount%29%20WHERE%20%7B%0A%20%20%0A%20%20SELECT%20%3FA%20%3FALabel%20%3FADescription%20%3Ffamily_nameLabel%20%3FEntry%20%3FBnF_ID%20%3FDate_of_birth%20%3FOhdAB%20%3FOhdABLabel%20WHERE%20%7B%0A%20%20%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22de%22.%20%7D%0A%20%20%20%20%3FA%20wdt%3AP91%20wd%3AQ11161.%0A%20%20%20%20%3FA%20wdt%3AP165%20%3FOccupation.%20%0A%20%20%20%20%3FOccupation%20wdt%3AP1007%20%3Fkey.%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%3Fkey%20wdt%3AP911%20%3FOhdAB.%20%20%20%20%20%20%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%7D%0A%20%20%0A%20%20%7D%20group%20by%20%3FOhdABLabel The members of Frankfurt's Unions-Lodge, by complexity of their jobs]
:* [https://database.factgrid.de/query/embed.html#%23defaultView%3ABubbleChart%0ASELECT%20%3FOhdABLabel%20%28count%28distinct%28%3FA%29%29%20as%20%3Fcount%29%20WHERE%20%7B%0A%20%20%0A%20%20SELECT%20%3FA%20%3FALabel%20%3FADescription%20%3Ffamily_nameLabel%20%3FEntry%20%3FBnF_ID%20%3FDate_of_birth%20%3FOhdAB%20%3FOhdABLabel%20WHERE%20%7B%0A%20%20%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22de%22.%20%7D%0A%20%20%20%20%3FA%20wdt%3AP91%20wd%3AQ11161.%0A%20%20%20%20%3FA%20wdt%3AP165%20%3FOccupation.%20%0A%20%20%20%20%3FOccupation%20wdt%3AP1007%2a%20%3FOhdAB.%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%3FOhdAB%20wdt%3AP2%20wd%3AQ651501.%20%20%20%20%20%20%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%7D%0A%20%20%0A%20%20%7D%20group%20by%20%3FOhdABLabel The members of Frankfurt's Unions-Lodge, occupations - statistical analysis on the OhdAB-01 level.]
::* [https://database.factgrid.de/query/index.html#%23defaultView%3ABubbleChart%0ASELECT%20%3FShort%20%28count%28distinct%28%3FA%29%29%20as%20%3Fcount%29%20WHERE%20%7B%0A%20%20%0A%20%20SELECT%20%3FA%20%3FALabel%20%3FADescription%20%3Ffamily_nameLabel%20%3FEntry%20%3FBnF_ID%20%3FDate_of_birth%20%3FOhdAB%20%3FOhdABLabel%20%3FShort%20WHERE%20%7B%0A%20%20%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22de%22.%20%7D%0A%20%20%20%20%3FA%20wdt%3AP91%20wd%3AQ11161.%0A%20%20%20%20%3FA%20wdt%3AP165%20%3FOccupation.%20%0A%20%20%20%20%3FOccupation%20wdt%3AP1007%2a%20%3FOhdAB.%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%3FOhdAB%20wdt%3AP2%20wd%3AQ651501.%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%3FOhdAB%20wdt%3AP808%20%3FShort.%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%7D%0A%20%20%0A%20%20%7D%20group%20by%20%3FShort The same query on short labels, which are nicer on statistical bubbles]


The best solution is a translation with the look into contemporary dictionaries (see our collection historical dictionaries that went online at [[FactGrid:Authentic translation help]]).
The OhdAB ontology is just one way to arrange the various career statements. We can run several such systems side by side (they usually do not need more than one or two properties to generate a hierarchy). We are here just at the beginning. What we need is the individual breakdown l like a division of


It is the best solution as it allows the automatic matching of titles found in documents in different various languages.
* '''career requirements''' — from vocational training to doctoral degree in medicine
* '''admission procedures''' —  how do you get a certain position? do you inherit the position, are you elected into it, do you buy the charge?
* '''social power''' — how many people will you have under your command?
* '''hierarchy level''' — what next position(s) can you reach from the present position in your career?
* '''organisational power''' how is your group or profession organised? By a guild, a trade union, an umbrella organisation?


The biggest problem is here that the different languages did not show the same differentiations. There are fields that received various specific terms in some languages (reflecting here the definitions defended by the old guilds), whilst other languages have just one word for the job that needed to be done.
Careers are far more than just occupations that will be filled with work - they give access to networks, they provide power (or make you poor and helpless), they provide status and prestige, and here we need help to put the various statements on the individual statements which have created and will continue to create. The more we know about the career statements, the easier it is to isolate professions that require the same answers on questions like the ones asked so far.


The problem increases with the historical developments we are trying to grasp. Here we have terms that remain stable whilst the jobs were changing - and again these changes were not always noted in the various languages simultaneously and coherently. Some changed words and some did not.
Contact us if you feel you could make sense of particular profession you are handling in your research, help us to create the properties and answers you will need in order to make use of the work we have done so far.


== Problem of the three properties: P165: Career Statements, P164: Positions, P166: specific positions ==
== If you want to create career statements ==
Wikidata has basically two properties: ''Occupation'' ([https://www.wikidata.org/wiki/Property:P106  P106]) and ''Position held'' ([https://www.wikidata.org/wiki/Property:P39 P39]]).
Use the [https://database.factgrid.de/query/#SELECT%20%3FCareer_statement%20%3FCareer_statementLabel%20%3FCareer_statementAltLabel%20%3FOhdAB_category%20%3FOhdAB_categoryLabel%20%3FOhdAB_ID%20WHERE%20%7B%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22de%22.%20%7D%0A%20%20%3FCareer_statement%20%28wdt%3AP2%2F%28wdt%3AP3%2a%29%29%20wd%3AQ37073.%0A%20%20OPTIONAL%20%7B%20%3FCareer_statement%20wdt%3AP1007%20%3FOhdAB_category.%20%7D%0A%20%20OPTIONAL%20%7B%20%3FCareer_statement%20wdt%3AP904%20%3FOhdAB_ID.%20%7D%0A%7D career statements we already have] wherever applicable.


The FactGrid equivalents are ''Career statement'' [[Property:P165]] and ''Office held'' [[Property:P164]]. Our "Career statement" property is more inclusive, allowing all the statements one might find on a personal record (such as retired, pensioner, widow, candidate of theology).
if you need new statements: Create them with together with a P2 statement of [[Item:Q37073]] / [[Item:Q37131]] or [[Item:Q257052]] and connect them to the OhdAB ontology with a [[Property:P1007]] statement.


The additional [[Property:P166]] came into use in order to state specific job descriptions on items such as [[Item:Q43367|Pastorate Altenbergen]]. It might make sense to fuse this Property into [[Property:P165]].
If you do not find the proper OhdAB match - contact the OhdAB team to create the perfect match of a pragmatic regular statement and a OhdAB systematic identification.


Could we go a whole step further and reduce everything to ''Career statement'' [[Property:P165]] statements? Not that easily if we still want to be able to run a simple count on all the shoemakers of a town. We might have to cleanse our career statements for that purpose of all statements that refer to specific positions in regiments etc.
== Use cases ==


== Problem of historical changes and stable Q-Items ==
...this is an immense challenge. It needs specialists for this field.


== Statistics ==
[[Category:Data modelling]]
It is not yet possible to run statistics on our career statements. At the moment we will get widely scattered fields of often extremely specific statements. Pies or bubble graphs will need succinct reductions to work with. A first pattern could be:
[[Category:Career statements]]
 
# Church
# Military
# Law, government and administration
# Crafts
# Commerce
# Agriculture
# Education and academia
# Artists and writers
# Servants
# Prostitutes and mistresses
# Landed property
# Unemployed / sick people / students/
 
We can run several patterns side by side - all they need is Properties and the respective statements on our career statements.
 
Wives are a difficult category - in a way they are a group of their own, in a way they belong to the trade of their husbands up to thee "professor's wife" who is eager to be noted in this condition.

Latest revision as of 09:04, 1 February 2024

Kupferstich "Der Buchhändler" aus: Abbildung der gemein-nützlichen Haupt-Stände von Christoph Weigel (Regensburg, 1698).

Problem and Solution

Historical sources are rich with statements about the social status, the personal situations or the occupations of people they are mentioning. Subscription lists, address books, tax lists — they all will offer names, localisations and career statements from "merchant's widow", to "colonel" of the local regiment.

The information is often merely given to identify a particular person. Digest it and it will shed light on the social composition of an audience or organisation you are studying. The problem is that you will need a good deal of background information on these statements before they begin to make sense in greater masses. If you have 1,000 names and 300 different career statements you will need background information on all these career statements to get a fast categorisation and a first impression of the numbers (and people) under these useful headers.

The database should know the words, it should know variants and abbreviations of these words in order to understand them and it should have a system or various competing systems of categorisations to unite your people in different groups under under various questions. Once you have this background information the machine can put the 300 statements into ten or twenty groups and count the 1,000 people which you will now fin in these groups for statistics or for further research int interesting groups that are now more visible.

The solution is a database that knows the words and that breaks them down in ontologies. This is basically what we are trying to provide with the "career statements" on FactGrid. If you have a set of people use the Property:P165 to connect them to "career statements" and the you can run searches and sort results with the ontological information that is on these terms.

FactGrid career statements — three granularities

FactGrid is presently offering some 8,800 "Career statements" - this is the list in English and in alphabetical order - the translations are not yet revised and there are presently some blank items in this list that will be removed in the next steps.

Some of these items are as broad as "pensioner", some are succinct in various degrees from "baker" to "master baker" to "court master baker" and some of them create singular positions like: "rector of the University of Erfurt". These are the three categories that should be in the game of the respective P2/P3 statements (is/ subcategory of):

  • Item:Q37073 Career statement (baker, master baker, court baker, university rector, widow, pensioner)
  • Item:Q37131 Career statement with historical or geographical specifications
  • Item:Q257052 Career statement that captures a sequence of incumbents "rector/president of the University of Erfurt"

The P3* switch allows you to search for all university rectors even though some have precise statements on them — see line 3 of this sample query:

Connected to the OhdAB ontology

All our "career statements" are connected to Katrin Moellers ontology of career statements OhdAB. The OhdAB comprises some 45,000 items from no-statement to different jobs in a circus all in a hierarchy of differentiations. The following links give the original German version and an English version that has automated translations in it.

All the OhdAB items begin with the OhdAB number code and form a sphere of their own. Contact Katrin Moeller and her team if you want to offer more than translations. Our working vocabulary is connected to the OhdAB ontology through the Property:P1007 statements on each "career statement". With the help of the graph database you now use the OhdAB information behind the P1007 link as in the following search:

The OhdAB matching is not yet finished, but we will get this done in the first weeks of 2024. The OhdAB translation is another project. The German labelling should always be the source here.

Statistical breakdowns - options

We can run several ontologies and various statistical breakdowns side by side. The simple way is to have a property and the respective ontology on that property.

FactGrid (P626) Economic Sector Statistics

The OhdAB ontology is just one way to arrange the various career statements. We can run several such systems side by side (they usually do not need more than one or two properties to generate a hierarchy). We are here just at the beginning. What we need is the individual breakdown l like a division of

  • career requirements — from vocational training to doctoral degree in medicine
  • admission procedures — how do you get a certain position? do you inherit the position, are you elected into it, do you buy the charge?
  • social power — how many people will you have under your command?
  • hierarchy level — what next position(s) can you reach from the present position in your career?
  • organisational power how is your group or profession organised? By a guild, a trade union, an umbrella organisation?

Careers are far more than just occupations that will be filled with work - they give access to networks, they provide power (or make you poor and helpless), they provide status and prestige, and here we need help to put the various statements on the individual statements which have created and will continue to create. The more we know about the career statements, the easier it is to isolate professions that require the same answers on questions like the ones asked so far.

Contact us if you feel you could make sense of particular profession you are handling in your research, help us to create the properties and answers you will need in order to make use of the work we have done so far.

If you want to create career statements

Use the career statements we already have wherever applicable.

if you need new statements: Create them with together with a P2 statement of Item:Q37073 / Item:Q37131 or Item:Q257052 and connect them to the OhdAB ontology with a Property:P1007 statement.

If you do not find the proper OhdAB match - contact the OhdAB team to create the perfect match of a pragmatic regular statement and a OhdAB systematic identification.

Use cases