Ask HN: Built a DB of over 50M+ Org Names for API use. Should it be made public?

westurner a year ago

Yeah, how do you indicate uncertainty in the aigen estimated correspondences? W3C CSVW supports dataset, column, and cell -level metadata. E.g. opencog atomspace hypergraph supports an Attention Value and a Truth Value.

Are there surprising regional and temporal trends in the names?

RDFS specifies a standard vocabulary for classes and subclasses, and properties and sub properties; rdfs:Class , rdfs:Property .

There are schema.org properties on the schema:LocalBusiness class for various business identifiers and other attributes ;

https://schema.org/url : domain

https://schema.org/identifier and subproperties : https://schema.org/duns , https://schema.org/taxID ,

https://schema.org/areaServed

https://schema.org/brand r: https://schema.org/Brand , https://schema.org/Organization

Maybe, a https://schema.org/Dataset :isPartOf a https://schema.org/ScholarlyArticle

https://schema.org/isPartOf