Data Download Tables
Table Name | Description | # of Rows | Origin | Data Last Updated | applicationzip: 73.8 MiB, tsv: 402.5 MiB | Information on the applications for granted patent | 7,526,704 | raw | December 10, 2020 |
---|---|---|---|---|
assigneezip: 14.4 MiB, tsv: 31.7 MiB | Disambiguated assignee data | 512,152 | disamb | December 10, 2020 |
botaniczip: 589.7 KiB, tsv: 1.2 MiB | Botanic information for plant patents | 16,796 | raw | December 10, 2020 |
brf_sum_text | Brief summary text | raw | ||
claim | Full text of patent claims, including dependency and sequence | raw | ||
cpc_currentzip: 1.3 GiB, tsv: 3.7 GiB | Current CPC classification data for all patents (applied retrospectively to all patents) | 41,413,742 | raw (from separate classification files) | December 10, 2020 |
cpc_groupzip: 21.5 KiB, tsv: 67.7 KiB | Lookup table of current CPC groups | 672 | raw (from separate classification files) | December 10, 2020 |
cpc_subgroupzip: 5.4 MiB, tsv: 60.5 MiB | Lookup table of current CPC subgroups | 258,827 | raw (from separate classification files) | December 10, 2020 |
cpc_subsectionzip: 3.2 KiB, tsv: 7.9 KiB | Lookup table of current CPC subsections | 136 | raw (from separate classification files) | December 10, 2020 |
detail_desc_text | Detailed patent description text | raw | ||
draw_desc_text | Drawing description text | raw | ||
foreign_priorityzip: 120.5 MiB, tsv: 287.8 MiB | Foreign priority data | 3,503,110 | raw | December 10, 2020 |
figureszip: 161.1 MiB, tsv: 285.5 MiB | Number of figures and sheets | 7,000,232 | raw | December 10, 2020 |
foreigncitationzip: 1017.0 MiB, tsv: 2.5 GiB | Citations made to foreign patents by US patents | 30,397,946 | raw | December 10, 2020 |
government_interestzip: 4.7 MiB, tsv: 33.5 MiB | Raw government interest statements on all patents (where available) | 151,438 | raw | December 10, 2020 |
government_organizationzip: 5.9 KiB, tsv: 34.1 KiB | Organization names and related agency hierarchy parsed from the government interest statements on all patents (where available) | 297 | processed | December 10, 2020 |
inventorzip: 52.2 MiB, tsv: 141.8 MiB | Disambiguated inventor data | 4,576,927 | disamb | December 10, 2020 |
inventor_genderzip: 21.4 MiB, tsv: 110.3 MiB | Gender assignment of disambiguated inventor. Methods Report | 4,111,891 | processed | December 21, 2020 |
ipcrzip: 568.7 MiB, tsv: 1.7 GiB | International Patent Classification data for all patents (as of publication date) | 17,371,183 | raw | December 10, 2020 |
lawyerzip: 5.6 MiB, tsv: 12.1 MiB | Disambiguated lawyer data | 174,992 | disamb | December 10, 2020 |
locationzip: 5.9 MiB, tsv: 12.1 MiB | Disambiguated location data, including latitude and longitude | 144,673 | disamb | December 10, 2020 |
mainclasszip: 2.4 KiB, tsv: 7.1 KiB | Lookup table of original USPC main classes (as of patent publication date) | 1,239 | raw | December 10, 2020 |
mainclass_currentzip: 7.5 KiB, tsv: 21.5 KiB | Lookup table of current USPC main technology classes (applied retrospectively to all patents) | 510 | raw (from separate classification files) | December 10, 2020 |
nberzip: 115.3 MiB, tsv: 228.9 MiB | NBER classification data for all patents up to May 2015 | 5,105,937 | raw (from separate classification files) | December 10, 2020 |
nber_categoryzip: 208.0 B, tsv: 92.0 B | Lookup table for NBER categories | 6 | raw (from separate classification files) | December 10, 2020 |
nber_subcategoryzip: 611.0 B, tsv: 906.0 B | Lookup table for NBER subcategories | 37 | raw (from separate classification files) | December 10, 2020 |
non_inventor_applicantzip: 229.4 MiB, tsv: 488.2 MiB | Non-inventor applicant information | 4,342,949 | raw | December 10, 2020 |
otherreferencezip: 3.5 GiB, tsv: 7.4 GiB | Non-patent citations mentioned in patents (e.g. articles, papers, etc.) | 43,638,777 | raw | December 10, 2020 |
patentzip: 1.5 GiB, tsv: 5.5 GiB | Data on granted patents | 7,528,963 | raw | December 10, 2020 |
patent_assigneezip: 207.6 MiB, tsv: 499.5 MiB | Metadata table for many-to-many relationships | 6,884,971 | disamb (linking table) | December 10, 2020 |
patent_contractawardnumberzip: 1.4 MiB, tsv: 4.4 MiB | Contract or award numbers parsed from the government interest statements on all patents (where available) | 180,751 | processed | December 10, 2020 |
patent_govintorgzip: 613.2 KiB, tsv: 2.3 MiB | Metadata table with patent-to-organization relationships linked to the government_organization table | 183,473 | processed | December 10, 2020 |
patent_inventorzip: 445.7 MiB, tsv: 1.0 GiB | Metadata table for many-to-many relationships | 18,276,455 | disamb (linking table) | December 10, 2020 |
patent_lawyerzip: 117.4 MiB, tsv: 367.4 MiB | Metadata table for many-to-many relationships | 8,540,953 | disamb (linking table) | December 10, 2020 |
pct_datazip: 53.6 MiB, tsv: 151.9 MiB | PCT data | 1,525,368 | raw | December 10, 2020 |
persistent_assignee_disambigzip: 734.2 MiB, tsv: 1.3 GiB | Persistant Assignee Disambiguation | 6,787,574 | raw | December 10, 2020 |
persistent_inventor_disambigzip: 527.8 MiB, tsv: 2.5 GiB | Persistant Inventor Disambiguation | 17,987,290 | raw | December 10, 2020 |
rawassigneezip: 461.7 MiB, tsv: 880.1 MiB | Raw assignee information as it appears in the source text and XML files | 6,884,971 | raw | December 10, 2020 |
rawexaminerzip: 335.1 MiB, tsv: 720.8 MiB | Raw examiner information | 10,206,461 | raw | December 10, 2020 |
rawinventorzip: 1017.6 MiB, tsv: 2.0 GiB | Raw inventor information as it appears in the source text and XML files | 18,276,455 | raw | December 10, 2020 |
rawlawyerzip: 444.9 MiB, tsv: 901.2 MiB | Raw lawyer information as it appears in the source text and XML files | 8,540,953 | raw | December 10, 2020 |
rawlocationzip: 1.3 GiB, tsv: 2.9 GiB | Raw location data for inventors and assignees, as it appears in xml and text source files | 29,527,583 | raw | December 10, 2020 |
rel_app_textzip: 205.8 MiB, tsv: 874.3 MiB | Related applications text | 1,952,099 | raw | December 10, 2020 |
subclasszip: 599.4 KiB, tsv: 2.6 MiB | Lookup table of original USPC subclasses (as of patent publication date) | 272,516 | raw | December 10, 2020 |
subclass_currentzip: 2.1 MiB, tsv: 7.3 MiB | Lookup table of current USPC subclasses (applied retrospectively to all patents) | 168,048 | raw (from separate classification files) | December 10, 2020 |
us_term_of_grantzip: 89.0 MiB, tsv: 209.7 MiB | U.S. term of grant data | 3,678,459 | raw | December 10, 2020 |
usapplicationcitationzip: 1.8 GiB, tsv: 5.2 GiB | Citations made to US patent applications by US patents | 43,956,647 | raw | December 10, 2020 |
uspatentcitationzip: 4.2 GiB, tsv: 10.8 GiB | Citations made to US granted patents by US patents | 113,129,077 | raw | December 10, 2020 |
uspczip: 490.3 MiB, tsv: 963.2 MiB | USPC classification data for all patents | 18,053,119 | raw | December 10, 2020 |
uspc_currentzip: 619.0 MiB, tsv: 1.2 GiB | Current USPC classification data for all patents up to May 2015 | 22,852,958 | raw (from separate classification files) | December 10, 2020 |
usreldoczip: 373.2 MiB, tsv: 1.1 GiB | U.S. related documents (post-2005 patents only) | 11,179,485 | raw | December 10, 2020 |
wipozip: 25.6 MiB, tsv: 157.6 MiB | WIPO technology fields for all patents | 9,887,621 | raw (from separate classification files) | December 10, 2020 |
wipo_fieldzip: 1.5 KiB, tsv: 3.7 KiB500 bytes | Lookup table of WIPO technology fields | 70 | raw (from separate classification files) | December 10, 2020 |
The PatentsView database is sourced from USPTO-provided text and XML data on published patent applications (2001-most recent update) and granted patents (1976-most recent update). The current PatentsView database MySQL dump is available for download, upon request. The patent applications database, currently only in beta format, contains all granted and non-granted applications, is also available upon request. The database currently does not contain all years of data or any of the disambiguated elements.
This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).
Attribution should be given to PatentsView (www.patentsview.org) for use, distribution, or derivative works.
From the PatentsView database, simple assignee and lawyer disambiguations are performed, and the patents are geocoded with a location-based disambiguation. Data are then fed into the inventor disambiguation algorithm in order to identify clusters of inventor names that are determined to be the same individual. Because the disambiguation of inventor identities is an ongoing effort, there are likely to be errors observable in the PatentsView data tables. The team welcomes feedback as we continue to improve our disambiguation methodology.
For more information,click the "Methods and Sources" link in the footer below.