PropertyValue
?:abstract
  • "Occupation coding, an important task in official statistics, refers to coding a respondent's text answer into one of many hundreds of occupation codes. To date, occupation coding is still at least partially conducted manually at great expense. We propose two new methods for automatic coding: a hybrid method that combines a rule-based approach based on duplicates with a statistical learning algorithm, and a modified nearest neighbor approach. Using data from the German General Social Survey (ALLBUS), we show that both methods improve on both the coding accuracy of the underlying statistical learning algorithm and the coding accuracy of duplicates where duplicates exist. We also find that statistical learning is improved by combining separate models for the detailed occupation codes and for aggregate occupation codes. Further, we find defining duplicates based on n-gram variables (a concept from text mining) is preferable to one based on exact string matches." Es werden Daten aus dem ALLBUS 2006 verwendet. (xsd:string)
?:author
?:comment
  • (ALLBUS) (xsd:string)
?:dataSource
  • ALLBUS-Bibliography (xsd:string)
?:dateCreated
  • Aufgenommen: 31. Fassung, März 2017 (xsd:gyear)
?:dateModified
  • 2016 (xsd:gyear)
?:datePublished
  • 2016 (xsd:gyear)
?:duplicate
is ?:hasPart of
?:inLanguage
  • english (xsd:string)
is ?:mainEntity of
?:name
  • Improving the Accuracy of Automated Occupation Coding at Any Production Rate (xsd:string)
?:publicationType
  • techreport (xsd:string)
?:reference
?:sourceInfo
  • Bibsonomy (xsd:string)
?:studyGroup
  • ALLBUS (xsd:string)
?:tags
  • 2016 (xsd:string)
  • ALLBUS (xsd:string)
  • ALLBUS2006 (xsd:string)
  • ALLBUS_input2016 (xsd:string)
  • ALLBUS_pro (xsd:string)
  • ALLBUS_version31 (xsd:string)
  • FDZ_ALLBUS (xsd:string)
  • checked (xsd:string)
  • english (xsd:string)
  • techreport (xsd:string)
rdf:type
?:uploadDate
  • 10.08.2016 (xsd:gyear)
?:url