Data Download Tables

Table NameDescription# of RowsOrigin
Inventor Gender23.8 MBGender assignment of disambiguated inventor. Methods Report3,772,032disamb
application64.302 MBInformation on the applications for granted patent6,819,362raw
assignee13.829 MBDisambiguated assignee data506,284disamb
botanic487.309 KBBotanic information for plant patents14,464raw
brf_sum_text13.829 GBBrief summary text6,225,977raw
claim11.755 GBFull text of patent claims, including dependency and sequence96,694,250raw
cpc_current1.019 GBCurrent CPC classification data for all patents (applied retrospectively to all patents)36,846,878raw (from separate classification files)
cpc_group20.933 KBLookup table of current CPC groups662raw (from separate classification files)
cpc_subgroup5.376 MBLookup table of current CPC subgroups260,008raw (from separate classification files)
cpc_subsection3.435 KBLookup table of current CPC subsections128raw (from separate classification files)
detail_desc_text39.40 GB
*provided upon request to
Detailed patent description text6,260,847raw
draw_desc_text4.083 GBDrawing description text68,027,122raw
foreign_priority104.935 MBForeign priority data3,191,918raw
figures141.372 MBNumber of figures and sheets6,313,313raw
foreigncitation814.565 MBCitations made to foreign patents by US patents25,374,575raw
government_interest4.212 MBRaw government interest statements on all patents (where available)137,738raw
government_organization5.019 KBOrganization names and related agency hierarchy parsed from the government interest statements on all patents (where available)247processed
inventor40.278 MBDisambiguated inventor data3,772,041disamb
ipcr424.343 MBInternational Patent Classification data for all patents (as of publication date)13,854,255raw
lawyer5.083 MBDisambiguated lawyer data166,251disamb
location3.84 MBDisambiguated location data, including latitude and longitude141,189disamb
location_assignee13.988 MBMetadata table for many-to-many relationships506,284disamb (linking table)
location_inventor172.881 MBMetadata table for many-to-many relationships3,771,982disamb (linking table)
mainclass2.534 KBLookup table of original USPC main classes (as of patent publication date)1,237raw
mainclass_current7.515 KBLookup table of current USPC main technology classes (applied retrospectively to all patents)510raw (from separate classification files)
nber110.807 MBNBER classification data for all patents up to May 20155,105,937raw (from separate classification files)
nber_category456 bytesLookup table for NBER categories6raw (from separate classification files)
nber_subcategory871 bytesLookup table for NBER subcategories37raw (from separate classification files)
non_inventor_applicant178.309 MBNon-inventor applicant information3,546,627raw
otherreference2.902 GBNon-patent citations mentioned in patents (e.g. articles, papers, etc.)36,101,604raw
patent1.324 GBData on granted patents6,819,362raw
patent_assignee98.657 MBMetadata table for many-to-many relationships6,070,101disamb (linking table)
patent_contractawardnumber915.027 KBContract or award numbers parsed from the government interest statements on all patents (where available)116,771processed
patent_govintorg526.598 KBMetadata table with patent-to-organization relationships linked to the government_organization table161,866processed
patent_inventor127.39 MBMetadata table for many-to-many relationships16,237,888disamb (linking table)
patent_lawyer144.39 MBMetadata table for many-to-many relationships7,739,192disamb (linking table)
pct_data43.463 MBPCT data1,264,191raw
persistent_inventor_disambig469.96 MBPersistant Inventor Disambiguation16,237,888raw
rawassignee393.606 MBRaw assignee information as it appears in the source text and XML files6,070,101raw
rawexaminer290.439 MBRaw examiner information9,344,749raw
rawinventor866.405 MBRaw inventor information as it appears in the source text and XML files16,237,888raw
rawlawyer390.825 MBRaw lawyer information as it appears in the source text and XML files7,739,192raw
rawlocation840.103 MBRaw location data for inventors and assignees, as it appears in xml and text source files25,845,682raw
rel_app_text157.284 MBRelated applications text1,617,202raw
subclass591.386 KBLookup table of original USPC subclasses (as of patent publication date)272,425raw
subclass_current2.017 MBLookup table of current USPC subclasses (applied retrospectively to all patents)171,053raw (from separate classification files)
us_term_of_grant76.21 MBU.S. term of grant data3,215,961raw
usapplicationcitation1.259 GBCitations made to US patent applications by US patents32,145,240raw
uspatentcitation3.505 GBCitations made to US granted patents by US patents98,207,057raw
uspc468.504 MBUSPC classification data for all patents18,040,076raw
uspc_current592.773 MBCurrent USPC classification data for all patents up to May 201522,885,509raw (from separate classification files)
usreldoc299.022 MBU.S. related documents (post-2005 patents only)9,222,837raw
wipo21.605 MBWIPO technology fields for all patents8,565,628raw (from separate classification files)
wipo_field1.731 KBLookup table of WIPO technology fields70raw (from separate classification files)

The PatentsView database is sourced from USPTO-provided text and XML data on published patent applications (2001-most recent update) and granted patents (1976-most recent update). The current PatentsView database MySQL dump is available for download, upon request. The patent applications database, which contains all granted and non-granted applications, is also available upon request. After March, 2016, the applications database will not contain the same inventor IDs as the PatentsView database. Only inventors on granted applications can be matched between the PatentsView and applications databases via a granted application ID.

This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License (

Attribution should be given to PatentsView ( for use, distribution, or derivative works.

From the PatentsView database, simple assignee and lawyer disambiguations are performed, and the patents are geocoded with a location-based disambiguation. Data are then fed into the inventor disambiguation algorithm in order to identify clusters of inventor names that are determined to be the same individual. Because the disambiguation of inventor identities is an ongoing effort, there are likely to be errors observable in the PatentsView data tables. The team welcomes feedback as we continue to improve our disambiguation methodology.

For more information, visit the Methods and Sources section of the website.