Data Download Tables

Table Name Description # of Rows Origin Data Last Updated
applicationzip: 73.8 MiB, tsv: 402.5 MiB Information on the applications for granted patent 7,526,704 raw December 10, 2020
assigneezip: 14.4 MiB, tsv: 31.7 MiB Disambiguated assignee data 512,152 disamb December 10, 2020
botaniczip: 589.7 KiB, tsv: 1.2 MiB Botanic information for plant patents 16,796 raw December 10, 2020
brf_sum_text Brief summary text raw
claim Full text of patent claims, including dependency and sequence raw
cpc_currentzip: 1.3 GiB, tsv: 3.7 GiB Current CPC classification data for all patents (applied retrospectively to all patents) 41,413,742 raw (from separate classification files) December 10, 2020
cpc_groupzip: 21.5 KiB, tsv: 67.7 KiB Lookup table of current CPC groups 672 raw (from separate classification files) December 10, 2020
cpc_subgroupzip: 5.4 MiB, tsv: 60.5 MiB Lookup table of current CPC subgroups 258,827 raw (from separate classification files) December 10, 2020
cpc_subsectionzip: 3.2 KiB, tsv: 7.9 KiB Lookup table of current CPC subsections 136 raw (from separate classification files) December 10, 2020
detail_desc_text Detailed patent description text raw
draw_desc_text Drawing description text raw
foreign_priorityzip: 120.5 MiB, tsv: 287.8 MiB Foreign priority data 3,503,110 raw December 10, 2020
figureszip: 161.1 MiB, tsv: 285.5 MiB Number of figures and sheets 7,000,232 raw December 10, 2020
foreigncitationzip: 1017.0 MiB, tsv: 2.5 GiB Citations made to foreign patents by US patents 30,397,946 raw December 10, 2020
government_interestzip: 4.7 MiB, tsv: 33.5 MiB Raw government interest statements on all patents (where available) 151,438 raw December 10, 2020
government_organizationzip: 5.9 KiB, tsv: 34.1 KiB Organization names and related agency hierarchy parsed from the government interest statements on all patents (where available) 297 processed December 10, 2020
inventorzip: 52.2 MiB, tsv: 141.8 MiB Disambiguated inventor data 4,576,927 disamb December 10, 2020
inventor_genderzip: 21.4 MiB, tsv: 110.3 MiB Gender assignment of disambiguated inventor. Methods Report 4,111,891 processed December 21, 2020
ipcrzip: 568.7 MiB, tsv: 1.7 GiB International Patent Classification data for all patents (as of publication date) 17,371,183 raw December 10, 2020
lawyerzip: 5.6 MiB, tsv: 12.1 MiB Disambiguated lawyer data 174,992 disamb December 10, 2020
locationzip: 5.9 MiB, tsv: 12.1 MiB Disambiguated location data, including latitude and longitude 144,673 disamb December 10, 2020
mainclasszip: 2.4 KiB, tsv: 7.1 KiB Lookup table of original USPC main classes (as of patent publication date) 1,239 raw December 10, 2020
mainclass_currentzip: 7.5 KiB, tsv: 21.5 KiB Lookup table of current USPC main technology classes (applied retrospectively to all patents) 510 raw (from separate classification files) December 10, 2020
nberzip: 115.3 MiB, tsv: 228.9 MiB NBER classification data for all patents up to May 2015 5,105,937 raw (from separate classification files) December 10, 2020
nber_categoryzip: 208.0 B, tsv: 92.0 B Lookup table for NBER categories 6 raw (from separate classification files) December 10, 2020
nber_subcategoryzip: 611.0 B, tsv: 906.0 B Lookup table for NBER subcategories 37 raw (from separate classification files) December 10, 2020
non_inventor_applicantzip: 229.4 MiB, tsv: 488.2 MiB Non-inventor applicant information 4,342,949 raw December 10, 2020
otherreferencezip: 3.5 GiB, tsv: 7.4 GiB Non-patent citations mentioned in patents (e.g. articles, papers, etc.) 43,638,777 raw December 10, 2020
patentzip: 1.5 GiB, tsv: 5.5 GiB Data on granted patents 7,528,963 raw December 10, 2020
patent_assigneezip: 207.6 MiB, tsv: 499.5 MiB Metadata table for many-to-many relationships 6,884,971 disamb (linking table) December 10, 2020
patent_contractawardnumberzip: 1.4 MiB, tsv: 4.4 MiB Contract or award numbers parsed from the government interest statements on all patents (where available) 180,751 processed December 10, 2020
patent_govintorgzip: 613.2 KiB, tsv: 2.3 MiB Metadata table with patent-to-organization relationships linked to the government_organization table 183,473 processed December 10, 2020
patent_inventorzip: 445.7 MiB, tsv: 1.0 GiB Metadata table for many-to-many relationships 18,276,455 disamb (linking table) December 10, 2020
patent_lawyerzip: 117.4 MiB, tsv: 367.4 MiB Metadata table for many-to-many relationships 8,540,953 disamb (linking table) December 10, 2020
pct_datazip: 53.6 MiB, tsv: 151.9 MiB PCT data 1,525,368 raw December 10, 2020
persistent_assignee_disambigzip: 734.2 MiB, tsv: 1.3 GiB Persistant Assignee Disambiguation 6,787,574 raw December 10, 2020
persistent_inventor_disambigzip: 527.8 MiB, tsv: 2.5 GiB Persistant Inventor Disambiguation 17,987,290 raw December 10, 2020
rawassigneezip: 461.7 MiB, tsv: 880.1 MiB Raw assignee information as it appears in the source text and XML files 6,884,971 raw December 10, 2020
rawexaminerzip: 335.1 MiB, tsv: 720.8 MiB Raw examiner information 10,206,461 raw December 10, 2020
rawinventorzip: 1017.6 MiB, tsv: 2.0 GiB Raw inventor information as it appears in the source text and XML files 18,276,455 raw December 10, 2020
rawlawyerzip: 444.9 MiB, tsv: 901.2 MiB Raw lawyer information as it appears in the source text and XML files 8,540,953 raw December 10, 2020
rawlocationzip: 1.3 GiB, tsv: 2.9 GiB Raw location data for inventors and assignees, as it appears in xml and text source files 29,527,583 raw December 10, 2020
rel_app_textzip: 205.8 MiB, tsv: 874.3 MiB Related applications text 1,952,099 raw December 10, 2020
subclasszip: 599.4 KiB, tsv: 2.6 MiB Lookup table of original USPC subclasses (as of patent publication date) 272,516 raw December 10, 2020
subclass_currentzip: 2.1 MiB, tsv: 7.3 MiB Lookup table of current USPC subclasses (applied retrospectively to all patents) 168,048 raw (from separate classification files) December 10, 2020
us_term_of_grantzip: 89.0 MiB, tsv: 209.7 MiB U.S. term of grant data 3,678,459 raw December 10, 2020
usapplicationcitationzip: 1.8 GiB, tsv: 5.2 GiB Citations made to US patent applications by US patents 43,956,647 raw December 10, 2020
uspatentcitationzip: 4.2 GiB, tsv: 10.8 GiB Citations made to US granted patents by US patents 113,129,077 raw December 10, 2020
uspczip: 490.3 MiB, tsv: 963.2 MiB USPC classification data for all patents 18,053,119 raw December 10, 2020
uspc_currentzip: 619.0 MiB, tsv: 1.2 GiB Current USPC classification data for all patents up to May 2015 22,852,958 raw (from separate classification files) December 10, 2020
usreldoczip: 373.2 MiB, tsv: 1.1 GiB U.S. related documents (post-2005 patents only) 11,179,485 raw December 10, 2020
wipozip: 25.6 MiB, tsv: 157.6 MiB WIPO technology fields for all patents 9,887,621 raw (from separate classification files) December 10, 2020
wipo_fieldzip: 1.5 KiB, tsv: 3.7 KiB500 bytes Lookup table of WIPO technology fields 70 raw (from separate classification files) December 10, 2020

The PatentsView database is sourced from USPTO-provided text and XML data on published patent applications (2001-most recent update) and granted patents (1976-most recent update). The current PatentsView database MySQL dump is available for download, upon request. The patent applications database, currently only in beta format, contains all granted and non-granted applications, is also available upon request. The database currently does not contain all years of data or any of the disambiguated elements.

This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).

Attribution should be given to PatentsView (www.patentsview.org) for use, distribution, or derivative works.

From the PatentsView database, simple assignee and lawyer disambiguations are performed, and the patents are geocoded with a location-based disambiguation. Data are then fed into the inventor disambiguation algorithm in order to identify clusters of inventor names that are determined to be the same individual. Because the disambiguation of inventor identities is an ongoing effort, there are likely to be errors observable in the PatentsView data tables. The team welcomes feedback as we continue to improve our disambiguation methodology.

For more information,click the "Methods and Sources" link in the footer below.