Data Download Tables

Table NameDescription# of RowsOrigin
application62.622 MBInformation on the applications for granted patent6,647,699raw
assignee12.269 MBDisambiguated assignee data389,246disamb
botanic466.84 KBBotanic information for plant patents13,888raw
brf_sum_text13.512 GBBrief summary text6,069,808raw
claim11.457 GBFull text of patent claims, including dependency and sequence94,162,123raw
cpc_current855.203 MBCurrent CPC classification data for all patents (applied retrospectively to all patents)35,208,720raw (from separate classification files)
cpc_group20.654 KBLookup table of current CPC groups662raw (from separate classification files)
cpc_subgroup4.501 MBLookup table of current CPC subgroups227,206raw (from separate classification files)
cpc_subsection3.233 KBLookup table of current CPC subsections128raw (from separate classification files)
detail_desc_text39.40 GB
*provided upon request to
Detailed patent description text6,260,847raw
draw_desc_text3.946 GBDrawing description text65,837,233raw
foreign_priority102.376 MBForeign priority data3,116,493raw
figures137.6 MBNumber of figures and sheets6,147,287raw
foreigncitation775.606 MBCitations made to foreign patents by US patents24,173,810raw
government_interest4.078 MBRaw government interest statements on all patents (where available)133,617raw
government_organization4.534 KBOrganization names and related agency hierarchy parsed from the government interest statements on all patents (where available)234processed
inventor40.826 MBDisambiguated inventor data3,822,573disamb
ipcr398.117 MBInternational Patent Classification data for all patents (as of publication date)13,051,297raw
lawyer5.012 MBDisambiguated lawyer data163,905disamb
location3.607 MBDisambiguated location data, including latitude and longitude128,947disamb
location_assignee14.485 MBMetadata table for many-to-many relationships389,247disamb (linking table)
location_inventor166.395 MBMetadata table for many-to-many relationships3,821,365disamb (linking table)
mainclass2.408 KBLookup table of original USPC main classes (as of patent publication date)1,237raw
mainclass_current7.362 KBLookup table of current USPC main technology classes (applied retrospectively to all patents)510raw (from separate classification files)
nber110.807 MBNBER classification data for all patents up to May 20155,105,937raw (from separate classification files)
nber_category306 bytesLookup table for NBER categories6raw (from separate classification files)
nber_subcategory697 bytesLookup table for NBER subcategories37raw (from separate classification files)
non_inventor_applicant167.488 MBNon-inventor applicant information3,345,047raw
otherreference2.761 GBNon-patent citations mentioned in patents (e.g. articles, papers, etc.)34,354,742raw
patent1.294 GBData on granted patents6,647,699raw
patent_assignee80.812 MBMetadata table for many-to-many relationships5,902,217disamb (linking table)
patent_contractawardnumber878.523 KBContract or award numbers parsed from the government interest statements on all patents (where available)112,136processed
patent_govintorg512.696 KBMetadata table with patent-to-organization relationships linked to the government_organization table156,733processed
patent_inventor123.015 MBMetadata table for many-to-many relationships15,752,163disamb (linking table)
patent_lawyer98.167 MBMetadata table for many-to-many relationships7,544,959disamb (linking table)
pct_data41.192 MBPCT data1,201,531raw
persistent_inventor_disambig304.942 MBPersistant Inventor Disambiguation15,752,164raw
rawassignee393.575 MBRaw assignee information as it appears in the source text and XML files5,903,411raw
rawexaminer283.393 MBRaw examiner information9,129,949raw
rawinventor836.302 MBRaw inventor information as it appears in the source text and XML files15,752,110raw
rawlawyer380.681 MBRaw lawyer information as it appears in the source text and XML files7,544,984raw
rawlocation815.29 MBRaw location data for inventors and assignees, as it appears in xml and text source files24,991,549raw
rel_app_text150.405 MBRelated applications text1,552,104raw
subclass591.155 KBLookup table of original USPC subclasses (as of patent publication date)272,394raw
subclass_current2.017 MBLookup table of current USPC subclasses (applied retrospectively to all patents)171,053raw (from separate classification files)
us_term_of_grant73.375 MBU.S. term of grant data3,104,452raw
usapplicationcitation1.143 GBCitations made to US patent applications by US patents29,512,965raw
uspatentcitation3.374 GBCitations made to US granted patents by US patents94,726,690raw
uspc468.38 MBUSPC classification data for all patents18,035,443raw
uspc_current521.786 MBCurrent USPC classification data for all patents up to May 201522,880,877raw (from separate classification files)
usreldoc282.787 MBU.S. related documents (post-2005 patents only)8,766,072raw
wipo20.957 MBWIPO technology fields for all patents8,327,407raw (from separate classification files)
wipo_field1.537 KBLookup table of WIPO technology fields70raw (from separate classification files)

The PatentsView database is sourced from USPTO-provided text and XML data on published patent applications (2001-most recent update) and granted patents (1976-most recent update). The current PatentsView database MySQL dump is available for download, upon request. The patent applications database, which contains all granted and non-granted applications, is also available upon request. After March, 2016, the applications database will not contain the same inventor IDs as the PatentsView database. Only inventors on granted applications can be matched between the PatentsView and applications databases via a granted application ID.

This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License (

Attribution should be given to PatentsView ( for use, distribution, or derivative works.

From the PatentsView database, simple assignee and lawyer disambiguations are performed, and the patents are geocoded with a location-based disambiguation. Data are then fed into the inventor disambiguation algorithm in order to identify clusters of inventor names that are determined to be the same individual. Because the disambiguation of inventor identities is an ongoing effort, there are likely to be errors observable in the PatentsView data tables. The team welcomes feedback as we continue to improve our disambiguation methodology.

For more information, visit the Methods and Sources section of the website.