Data Download Tables

Table NameDescription# of RowsOrigin
application0.047 GBInformation on the applications for granted patent6,366,664raw
assignee0.011 GBDisambiguated assignee data376,913disamb
botanic0.0 GBBotanic information for plant patents12,805raw
brf_sum_text12.176 GBBrief summary text5,813,766raw
claim10.658 GBFull text of patent claims, including dependency and sequence89,944,541raw
cpc_current0.86 GBCurrent CPC classification data for all patents (applied retrospectively to all patents)32,874,742raw (from separate classification files)
cpc_group0.018 MBLookup table of current CPC groups656raw (from separate classification files)
cpc_subgroup5.099 MBLookup table of current CPC subgroups259,048raw (from separate classification files)
cpc_subsection0.003 MBLookup table of current CPC subsections127raw (from separate classification files)
draw_desc_text2.655 GBDrawing description text61,307,429raw
foreign_priority0.092 GBForeign priority data2,992,770raw
figures0.124 GBNumber of figures and sheets5,875,847raw
foreigncitation0.65 GBCitations made to foreign patents by US patents22,020,427raw
government_interest0.003 GBRaw government interest statements on all patents (where available)127,367raw
government_organization0.003 MBOrganization names and related agency hierarchy parsed from the government interest statements on all patents (where available)187processed
inventor0.034 GBDisambiguated inventor data3,482,305disamb
ipcr0.318 GBInternational Patent Classification data for all patents (as of publication date)11,557,735raw
lawyer4.964 MBDisambiguated lawyer data165,025disamb
location3.483 MBDisambiguated location data, including latitude and longitude129,303disamb
location_assignee13.393 MBMetadata table for many-to-many relationships526,332disamb (linking table)
location_inventor52.102 MBMetadata table for many-to-many relationships4,799,016disamb (linking table)
mainclass0.002 MBLookup table of original USPC main classes (as of patent publication date)1,237raw
mainclass_current0.007 MBLookup table of current USPC main technology classes (applied retrospectively to all patents)511raw (from separate classification files)
nber0.105 GBNBER classification data for all patents up to May 20155,105,937raw (from separate classification files)
nber_category0.0 MBLookup table for NBER categories6raw (from separate classification files)
nber_subcategory0.001 MBLookup table for NBER subcategories37raw (from separate classification files)
non_inventor_applicant0.028 GBNon-inventor applicant information598,855raw
otherreference2.5 GBNon-patent citations mentioned in patents (e.g. articles, papers, etc.)31,414,354raw
patent1.165 GBData on granted patents6,366,664raw
patent_assignee0.031 GBMetadata table for many-to-many relationships5,629,558disamb (linking table)
patent_contractawardnumber0.911 MBContract or award numbers parsed from the government interest statements on all patents (where available)138,988processed
patent_govintorg0.329 MBMetadata table with patent-to-organization relationships linked to the government_organization table146,371processed
patent_inventor0.11 GBMetadata table for many-to-many relationships14,953,518disamb (linking table)
patent_lawyer0.033 GBMetadata table for many-to-many relationships7,226,556disamb (linking table)
pct_data0.034 GBPCT data1,100,050raw
persistent_inventor_disamb0.283 GBCrosswalk between rawinventor IDs and disambiguated IDs by the date of database update14,959,652disamb
rawassignee0.353 GBRaw assignee information as it appears in the source text and XML files5,629,565raw
rawexaminer0.254 GBRaw examiner information8,727,147raw
rawinventor0.759 GBRaw inventor information as it appears in the source text and XML files14,959,652raw
rawlawyer0.352 GBRaw lawyer information as it appears in the source text and XML files7,226,654raw
rawlocation0.675 GBRaw location data for inventors and assignees, as it appears in xml and text source files20,705,708raw
rel_app_text0.132 GBRelated applications text1,438,216raw
subclass0.378 MBLookup table of original USPC subclasses (as of patent publication date)264,386raw
subclass_current1.924 MBLookup table of current USPC subclasses (applied retrospectively to all patents)171,053raw (from separate classification files)
us_term_of_grant0.065 GBU.S. term of grant data2,923,111raw
usapplicationcitation0.921 GBCitations made to US patent applications by US patents25,343,373raw
uspatentcitation3.027 GBCitations made to US granted patents by US patents89,122,312raw
uspc0.438 GBUSPC classification data for all patents18,024,187raw
uspc_current0.537 GBCurrent USPC classification data for all patents up to May 201522,576,756raw (from separate classification files)
usreldoc0.237 GBU.S. related documents (post-2005 patents only)8,022,497raw
wipo16.071 MBWIPO technology fields for all patents8,474,756raw (from separate classification files)
wipo_field0.001 MBLookup table of WIPO technology fields70raw (from separate classification files)

The PatentsView database is sourced from USPTO-provided text and XML data on published patent applications (2001-most recent update) and granted patents (1976-most recent update). The current PatentsView database MySQL dump is available for download, upon request. The patent applications database, which contains all granted and non-granted applications, is also available upon request. After March, 2016, the applications database will not contain the same inventor IDs as the PatentsView database. Only inventors on granted applications can be matched between the PatentsView and applications databases via a granted application ID.

This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).

Attribution should be given to PatentsView (www.patentsview.org) for use, distribution, or derivative works.

From the PatentsView database, simple assignee and lawyer disambiguations are performed, and the patents are geocoded with a location-based disambiguation. Data are then fed into the inventor disambiguation algorithm in order to identify clusters of inventor names that are determined to be the same individual. Because the disambiguation of inventor identities is an ongoing effort, there are likely to be errors observable in the PatentsView data tables. The team welcomes feedback as we continue to improve our disambiguation methodology.

For more information, visit the Methods and Sources section of the website.