Data Download Tables

Table NameDescription# of RowsOrigin
applicationzip: 67.5 MiB, tsv: 300.0 MiBInformation on the applications for granted patent7,144,426raw
assigneezip: 13.5 MiB, tsv: 29.0 MiBDisambiguated assignee data486,382disamb
botaniczip: 522.3 KiB, tsv: 981.4 KiBBotanic information for plant patents15,470raw
brf_sum_textzip: 13.8 GiB, tsv: 53.8 GiBBrief summary text6,398,758raw
claimzip: 12.3 GiB, tsv: 39.2 GiBFull text of patent claims, including dependency and sequence101,535,737raw
cpc_currentzip: 1.1 GiB, tsv: 2.6 GiBCurrent CPC classification data for all patents (applied retrospectively to all patents)39,915,465raw (from separate classification files)
cpc_groupzip: 21.2 KiB, tsv: 64.8 KiBLookup table of current CPC groups672raw (from separate classification files)
cpc_subgroupzip: 5.4 MiB, tsv: 59.9 MiBLookup table of current CPC subgroups259,864raw (from separate classification files)
cpc_subsectionzip: 3.2 KiB, tsv: 7.4 KiBLookup table of current CPC subsections137raw (from separate classification files)
detail_desc_text39.40 GB
*provided upon request to contact@patentsview.org
Detailed patent description text6,260,847raw
draw_desc_textzip: 4.4 GiB, tsv: 11.5 GiBDrawing description text75,960,407raw
foreign_priorityzip: 110.6 MiB, tsv: 234.7 MiBForeign priority data3,335,155raw
figureszip: 148.6 MiB, tsv: 244.7 MiBNumber of figures and sheets6,628,620raw
foreigncitationzip: 891.4 MiB, tsv: 2.0 GiBCitations made to foreign patents by US patents27,707,775raw
government_interestzip: 4.4 MiB, tsv: 31.3 MiBRaw government interest statements on all patents (where available)144,275raw
government_organizationzip: 5.4 KiB, tsv: 29.9 KiBOrganization names and related agency hierarchy parsed from the government interest statements on all patents (where available)276processed
inventorzip: 41.4 MiB, tsv: 97.3 MiBDisambiguated inventor data3,857,229disamb
inventor_genderzip: 15.1 MiB, tsv: 104.2 MiBGender assignment of disambiguated inventor. Methods Report3,388,166raw
ipcrzip: 472.5 MiB, tsv: 1.2 GiBInternational Patent Classification data for all patents (as of publication date)15,467,479raw
lawyerzip: 5.4 MiB, tsv: 11.7 MiBDisambiguated lawyer data170,421disamb
locationzip: 3.8 MiB, tsv: 8.5 MiBDisambiguated location data, including latitude and longitude142,189disamb
location_assigneezip: 11.4 MiB, tsv: 22.2 MiBMetadata table for many-to-many relationships611,040disamb (linking table)
location_inventorzip: 55.3 MiB, tsv: 376.8 MiBMetadata table for many-to-many relationships17,165,195disamb (linking table)
mainclasszip: 2.3 KiB, tsv: 4.7 KiBLookup table of original USPC main classes (as of patent publication date)1,239raw
mainclass_currentzip: 7.3 KiB, tsv: 19.5 KiBLookup table of current USPC main technology classes (applied retrospectively to all patents)511raw (from separate classification files)
nberzip: 110.8 MiB, tsv: 189.9 MiBNBER classification data for all patents up to May 20155,105,938raw (from separate classification files)
nber_categoryzip: 198.0 B, tsv: 64.0 BLookup table for NBER categories7raw (from separate classification files)
nber_subcategoryzip: 587.0 B, tsv: 754.0 BLookup table for NBER subcategories38raw (from separate classification files)
non_inventor_applicantzip: 199.3 MiB, tsv: 398.0 MiBNon-inventor applicant information3,916,816raw
otherreferencezip: 3.2 GiB, tsv: 6.4 GiBNon-patent citations mentioned in patents (e.g. articles, papers, etc.)39,565,626raw
patentzip: 1.4 GiB, tsv: 5.1 GiBData on granted patents7,144,426raw
patent_assigneezip: 75.6 MiB, tsv: 201.4 MiBMetadata table for many-to-many relationships6,383,646disamb (linking table)
patent_contractawardnumberzip: 995.6 KiB, tsv: 2.7 MiBContract or award numbers parsed from the government interest statements on all patents (where available)127,579processed
patent_govintorgzip: 563.4 KiB, tsv: 1.8 MiBMetadata table with patent-to-organization relationships linked to the government_organization table173,138processed
patent_inventorzip: 91.6 MiB, tsv: 296.1 MiBMetadata table for many-to-many relationships17,160,503disamb (linking table)
patent_lawyerzip: 105.2 MiB, tsv: 317.5 MiBMetadata table for many-to-many relationships8,106,798disamb (linking table)
pct_datazip: 47.3 MiB, tsv: 133.2 MiBPCT data1,386,105raw
persistent_assignee_disambigzip: 466.3 MiB, tsv: 958.8 MiBPersistant Assignee Disambiguation17,165,520raw
persistent_inventor_disambigzip: 523.1 MiB, tsv: 2.1 GiBPersistant Inventor Disambiguation17,165,520raw
rawassigneezip: 415.5 MiB, tsv: 764.9 MiBRaw assignee information as it appears in the source text and XML files6,387,374raw
rawexaminerzip: 304.0 MiB, tsv: 589.8 MiBRaw examiner information9,744,574raw
rawinventorzip: 916.2 MiB, tsv: 1.5 GiBRaw inventor information as it appears in the source text and XML files17,165,605raw
rawlawyerzip: 412.3 MiB, tsv: 826.1 MiBRaw lawyer information as it appears in the source text and XML files8,107,397raw
rawlocationzip: 891.8 MiB, tsv: 1.9 GiBRaw location data for inventors and assignees, as it appears in xml and text source files27,460,929raw
rel_app_textzip: 178.2 MiB, tsv: 751.9 MiBRelated applications text1,769,653raw
subclasszip: 589.6 KiB, tsv: 2.1 MiBLookup table of original USPC subclasses (as of patent publication date)272,471raw
subclass_currentzip: 2.0 MiB, tsv: 6.7 MiBLookup table of current USPC subclasses (applied retrospectively to all patents)171,054raw (from separate classification files)
us_term_of_grantzip: 82.3 MiB, tsv: 219.8 MiBU.S. term of grant data3,428,319raw
usapplicationcitationzip: 1.5 GiB, tsv: 3.9 GiBCitations made to US patent applications by US patents37,447,155raw
uspatentcitationzip: 3.8 GiB, tsv: 8.7 GiBCitations made to US granted patents by US patents105,027,310raw
uspczip: 469.2 MiB, tsv: 825.2 MiBUSPC classification data for all patents18,048,429raw
uspc_currentzip: 592.5 MiB, tsv: 1.0 GiBCurrent USPC classification data for all patents up to May 201522,852,959raw (from separate classification files)
usreldoczip: 326.8 MiB, tsv: 933.5 MiBU.S. related documents (post-2005 patents only)10,107,282raw
wipozip: 22.8 MiB, tsv: 109.3 MiBWIPO technology fields for all patents9,053,080raw (from separate classification files)
wipo_fieldzip: 1.4 KiB, tsv: 3.3 KiB500 bytesLookup table of WIPO technology fields71raw (from separate classification files)

The PatentsView database is sourced from USPTO-provided text and XML data on published patent applications (2001-most recent update) and granted patents (1976-most recent update). The current PatentsView database MySQL dump is available for download, upon request. The patent applications database, which contains all granted and non-granted applications, is also available upon request. After March, 2016, the applications database will not contain the same inventor IDs as the PatentsView database. Only inventors on granted applications can be matched between the PatentsView and applications databases via a granted application ID.

This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).

Attribution should be given to PatentsView (www.patentsview.org) for use, distribution, or derivative works.

From the PatentsView database, simple assignee and lawyer disambiguations are performed, and the patents are geocoded with a location-based disambiguation. Data are then fed into the inventor disambiguation algorithm in order to identify clusters of inventor names that are determined to be the same individual. Because the disambiguation of inventor identities is an ongoing effort, there are likely to be errors observable in the PatentsView data tables. The team welcomes feedback as we continue to improve our disambiguation methodology.

For more information, visit the Methods and Sources section of the website.