Data Download Tables

Table NameDescription# of RowsOrigin
inventor_genderzip: 24.6 MiB, tsv: 120.4 MiBGender assignment of disambiguated inventor. Methods Report3,388,166raw
applicationzip: 68.3 MiB, tsv: 304.0 MiBInformation on the applications for granted patent7,236,658raw
assigneezip: 13.7 MiB, tsv: 29.3 MiBDisambiguated assignee data491,558disamb
botaniczip: 535.9 KiB, tsv: 1006.0 KiBBotanic information for plant patents15,851raw
brf_sum_textzip: 14.0 GiB, tsv: 58.7 GiBBrief summary text6,398,758raw
claimzip: 12.3 GiB, tsv: 39.2 GiBFull text of patent claims, including dependency and sequence101,535,737raw
cpc_currentzip: 1.1 GiB, tsv: 2.7 GiBCurrent CPC classification data for all patents (applied retrospectively to all patents)41,272,258raw (from separate classification files)
cpc_groupzip: 21.3 KiB, tsv: 64.9 KiBLookup table of current CPC groups673raw (from separate classification files)
cpc_subgroupzip: 5.4 MiB, tsv: 59.8 MiBLookup table of current CPC subgroups259,967raw (from separate classification files)
cpc_subsectionzip: 3.2 KiB, tsv: 7.4 KiBLookup table of current CPC subsections137raw (from separate classification files)
detail_desc_text39.40 GB
*provided upon request to
Detailed patent description text6,260,847raw
draw_desc_textzip: 4.6 GiB, tsv: 13.1 GiBDrawing description text75,960,407raw
foreign_priorityzip: 112.0 MiB, tsv: 237.6 MiBForeign priority data3,375,360raw
figureszip: 150.6 MiB, tsv: 248.1 MiBNumber of figures and sheets6,718,174raw
foreigncitationzip: 912.8 MiB, tsv: 2.1 GiBCitations made to foreign patents by US patents28,363,458raw
government_interestzip: 4.5 MiB, tsv: 31.7 MiBRaw government interest statements on all patents (where available)146,031raw
government_organizationzip: 5.5 KiB, tsv: 30.5 KiBOrganization names and related agency hierarchy parsed from the government interest statements on all patents (where available)281processed
inventorzip: 41.8 MiB, tsv: 98.3 MiBDisambiguated inventor data3,896,191disamb
ipcrzip: 487.0 MiB, tsv: 1.3 GiBInternational Patent Classification data for all patents (as of publication date)15,916,642raw
lawyerzip: 5.4 MiB, tsv: 11.7 MiBDisambiguated lawyer data171,525disamb
locationzip: 3.8 MiB, tsv: 8.6 MiBDisambiguated location data, including latitude and longitude143,507disamb
location_assigneezip: 11.6 MiB, tsv: 22.7 MiBMetadata table for many-to-many relationships625,765disamb (linking table)
location_inventorzip: 44.3 MiB, tsv: 119.9 MiBMetadata table for many-to-many relationships5,453,518disamb (linking table)
mainclasszip: 2.3 KiB, tsv: 4.7 KiBLookup table of original USPC main classes (as of patent publication date)1,239raw
mainclass_currentzip: 7.3 KiB, tsv: 19.5 KiBLookup table of current USPC main technology classes (applied retrospectively to all patents)511raw (from separate classification files)
nberzip: 110.8 MiB, tsv: 189.9 MiBNBER classification data for all patents up to May 20155,105,938raw (from separate classification files)
nber_categoryzip: 198.0 B, tsv: 64.0 BLookup table for NBER categories7raw (from separate classification files)
nber_subcategoryzip: 587.0 B, tsv: 754.0 BLookup table for NBER subcategories38raw (from separate classification files)
non_inventor_applicantzip: 204.8 MiB, tsv: 409.3 MiBNon-inventor applicant information4,019,652raw
otherreferencezip: 3.3 GiB, tsv: 6.6 GiBNon-patent citations mentioned in patents (e.g. articles, papers, etc.)40,503,553raw
patentzip: 1.4 GiB, tsv: 5.2 GiBData on granted patents7,236,658raw
patent_assigneezip: 76.7 MiB, tsv: 204.3 MiBMetadata table for many-to-many relationships6,473,230disamb (linking table)
patent_contractawardnumberzip: 1013.8 KiB, tsv: 2.7 MiBContract or award numbers parsed from the government interest statements on all patents (where available)130,012processed
patent_govintorgzip: 571.8 KiB, tsv: 1.9 MiBMetadata table with patent-to-organization relationships linked to the government_organization table175,694processed
patent_inventorzip: 93.0 MiB, tsv: 300.9 MiBMetadata table for many-to-many relationships17,423,524disamb (linking table)
patent_lawyerzip: 106.5 MiB, tsv: 321.6 MiBMetadata table for many-to-many relationships8,210,578disamb (linking table)
pct_datazip: 48.5 MiB, tsv: 136.2 MiBPCT data1,419,725raw
persistent_assignee_disambigzip: 560.7 MiB tsv: 1.1 GiBPersistant Assignee Disambiguation6,476,973raw
persistent_inventor_disambigzip: 532.9 MiB, tsv: 2.2 GiBPersistant Inventor Disambiguation17,428,665raw
rawassigneezip: 421.6 MiB, tsv: 775.7 MiBRaw assignee information as it appears in the source text and XML files6,477,010raw
rawexaminerzip: 307.7 MiB, tsv: 596.6 MiBRaw examiner information9,856,999raw
rawinventorzip: 932.5 MiB, tsv: 1.6 GiBRaw inventor information as it appears in the source text and XML files17,428,665raw
rawlawyerzip: 417.5 MiB, tsv: 836.7 MiBRaw lawyer information as it appears in the source text and XML files8,210,643raw
rawlocationzip: 905.7 MiB, tsv: 1.9 GiBRaw location data for inventors and assignees, as it appears in xml and text source files27,916,460raw
rel_app_textzip: 184.4 MiB, tsv: 777.7 MiBRelated applications text1,812,700raw
subclasszip: 589.6 KiB, tsv: 2.1 MiBLookup table of original USPC subclasses (as of patent publication date)272,479raw
subclass_currentzip: 2.0 MiB, tsv: 6.7 MiBLookup table of current USPC subclasses (applied retrospectively to all patents)171,054raw (from separate classification files)
us_term_of_grantzip: 83.7 MiB, tsv: 224.0 MiBU.S. term of grant data3,488,171raw
usapplicationcitationzip: 1.5 GiB, tsv: 4.1 GiBCitations made to US patent applications by US patents38,982,468raw
uspatentcitationzip: 3.8 GiB, tsv: 8.8 GiBCitations made to US granted patents by US patents106,958,247raw
uspczip: 469.3 MiB, tsv: 825.4 MiBUSPC classification data for all patents18,051,093raw
uspc_currentzip: 592.5 MiB, tsv: 1.0 GiBCurrent USPC classification data for all patents up to May 201522,852,959raw (from separate classification files)
usreldoczip: 334.8 MiB, tsv: 956.9 MiBU.S. related documents (post-2005 patents only)10,354,774raw
wipozip: 23.2 MiB, tsv: 111.4 MiBWIPO technology fields for all patents9,211,248raw (from separate classification files)
wipo_fieldzip: 1.4 KiB, tsv: 3.3 KiB500 bytesLookup table of WIPO technology fields71raw (from separate classification files)

The PatentsView database is sourced from USPTO-provided text and XML data on published patent applications (2001-most recent update) and granted patents (1976-most recent update). The current PatentsView database MySQL dump is available for download, upon request. The patent applications database, which contains all granted and non-granted applications, is also available upon request. After March, 2016, the applications database will not contain the same inventor IDs as the PatentsView database. Only inventors on granted applications can be matched between the PatentsView and applications databases via a granted application ID.

This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License (

Attribution should be given to PatentsView ( for use, distribution, or derivative works.

From the PatentsView database, simple assignee and lawyer disambiguations are performed, and the patents are geocoded with a location-based disambiguation. Data are then fed into the inventor disambiguation algorithm in order to identify clusters of inventor names that are determined to be the same individual. Because the disambiguation of inventor identities is an ongoing effort, there are likely to be errors observable in the PatentsView data tables. The team welcomes feedback as we continue to improve our disambiguation methodology.

For more information, visit the Methods and Sources section of the website.