Data Download Tables

Table NameDescription# of RowsOrigin
application61.248 MBInformation on the applications for granted patent6,502,933raw
assignee12.071 MBDisambiguated assignee data383,348disamb
botanic447.573 KBBotanic information for plant patents13,366raw
brf_sum_text13.045 GBBrief summary text5,937,953raw
claim11.369 GBFull text of patent claims, including dependency and sequence91,920,308raw
cpc_current824.459 MBCurrent CPC classification data for all patents (applied retrospectively to all patents)33,931,998raw (from separate classification files)
cpc_group19.322 KBLookup table of current CPC groups656raw (from separate classification files)
cpc_subgroup5.2 MBLookup table of current CPC subgroups259,048raw (from separate classification files)
cpc_subsection2.837 KBLookup table of current CPC subsections127raw (from separate classification files)
detail_desc_text39.40 GB
*provided upon request to
Detailed patent description text6,260,847raw
draw_desc_text2.76 GBDrawing description text61,431,696raw
foreign_priority100.076 MBForeign priority data3,051,958raw
figures134.423 MBNumber of figures and sheets6,007,407raw
foreigncitation735.246 MBCitations made to foreign patents by US patents22,943,438raw
government_interest3.715 MBRaw government interest statements on all patents (where available)130,312raw
government_organization4.086 KBOrganization names and related agency hierarchy parsed from the government interest statements on all patents (where available)212processed
inventor39.16 MBDisambiguated inventor data3,663,964disamb
ipcr367.787 MBInternational Patent Classification data for all patents (as of publication date)12,171,206raw
lawyer5.181 MBDisambiguated lawyer data167,828disamb
location3.803 MBDisambiguated location data, including latitude and longitude134,160disamb
location_assignee14.458 MBMetadata table for many-to-many relationships382,997disamb (linking table)
location_inventor163.121 MBMetadata table for many-to-many relationships3,660,865disamb (linking table)
mainclass2.349 KBLookup table of original USPC main classes (as of patent publication date)1,237raw
mainclass_current7.312 KBLookup table of current USPC main technology classes (applied retrospectively to all patents)511raw (from separate classification files)
nber110.807 MBNBER classification data for all patents up to May 20155,105,937raw (from separate classification files)
nber_category248 bytesLookup table for NBER categories6raw (from separate classification files)
nber_subcategory639 bytesLookup table for NBER subcategories37raw (from separate classification files)
non_inventor_applicant35.622 MBNon-inventor applicant information708,100raw
otherreference2.645 GBNon-patent citations mentioned in patents (e.g. articles, papers, etc.)32,647,910raw
patent1.262 GBData on granted patents6,502,933raw
patent_assignee78.908 MBMetadata table for many-to-many relationships5,760,479disamb (linking table)
patent_contractawardnumber810.604 KBContract or award numbers parsed from the government interest statements on all patents (where available)102,537processed
patent_govintorg489.071 KBMetadata table with patent-to-organization relationships linked to the government_organization table150,299processed
patent_inventor81.272 MBMetadata table for many-to-many relationships15,334,607disamb (linking table)
patent_lawyer139.57 MBMetadata table for many-to-many relationships7,380,033disamb (linking table)
pct_data39.006 MBPCT data1,146,846raw
persistent_inventor_disambig367.678 MBPersistant Inventor Disambiguation15,334,570raw
rawassignee383.284 MBRaw assignee information as it appears in the source text and XML files5,761,505raw
rawexaminer275.752 MBRaw examiner information8,901,186raw
rawinventor815.382 MBRaw inventor information as it appears in the source text and XML files15,334,570raw
rawlawyer378.455 MBRaw lawyer information as it appears in the source text and XML files7,380,035raw
rawlocation732.86 MBRaw location data for inventors and assignees, as it appears in xml and text source files21,334,737raw
rel_app_text142.001 MBRelated applications text1,489,780raw
subclass574.245 KBLookup table of original USPC subclasses (as of patent publication date)264,505raw
subclass_current2.017 MBLookup table of current USPC subclasses (applied retrospectively to all patents)171,053raw (from separate classification files)
us_term_of_grant72.77 MBU.S. term of grant data3,011,193raw
usapplicationcitation1.068 GBCitations made to US patent applications by US patents27,355,088raw
uspatentcitation3.273 GBCitations made to US granted patents by US patents91,867,780raw
uspc468.177 MBUSPC classification data for all patents18,027,864raw
uspc_current515.564 MBCurrent USPC classification data for all patents up to May 201522,590,827raw (from separate classification files)
usreldoc271.9 MBU.S. related documents (post-2005 patents only)8,371,462raw
wipo23.13 MBWIPO technology fields for all patents8,661,433raw (from separate classification files)
wipo_field1.48 KBLookup table of WIPO technology fields70raw (from separate classification files)

The PatentsView database is sourced from USPTO-provided text and XML data on published patent applications (2001-most recent update) and granted patents (1976-most recent update). The current PatentsView database MySQL dump is available for download, upon request. The patent applications database, which contains all granted and non-granted applications, is also available upon request. After March, 2016, the applications database will not contain the same inventor IDs as the PatentsView database. Only inventors on granted applications can be matched between the PatentsView and applications databases via a granted application ID.

This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License (

Attribution should be given to PatentsView ( for use, distribution, or derivative works.

From the PatentsView database, simple assignee and lawyer disambiguations are performed, and the patents are geocoded with a location-based disambiguation. Data are then fed into the inventor disambiguation algorithm in order to identify clusters of inventor names that are determined to be the same individual. Because the disambiguation of inventor identities is an ongoing effort, there are likely to be errors observable in the PatentsView data tables. The team welcomes feedback as we continue to improve our disambiguation methodology.

For more information, visit the Methods and Sources section of the website.