Data Download Tables

Table NameDescription# of RowsOrigin
Inventor Gender23.8 MBGender assignment of disambiguated inventor. Methods Report3,772,032disamb
application290.292 MBInformation on the applications for granted patent6,915,760raw
assignee29.924 MBDisambiguated assignee data504,294disamb
botanic940.945 KBBotanic information for plant patents14,833raw
brf_sum_text54.248 GBBrief summary text6,313,928raw
claim37.641 GBFull text of patent claims, including dependency and sequence97,498,661raw
cpc_current2.481 GBCurrent CPC classification data for all patents (applied retrospectively to all patents)37,863,439raw (from separate classification files)
cpc_group64.408 KBLookup table of current CPC groups665raw (from separate classification files)
cpc_subgroup59.946 MBLookup table of current CPC subgroups260,057raw (from separate classification files)
cpc_subsection6.867 KBLookup table of current CPC subsections129raw (from separate classification files)
detail_desc_text39.40 GB
*provided upon request to contact@patentsview.org
Detailed patent description text6,260,847raw
draw_desc_text2.76 GBDrawing description text61,431,696raw
foreign_priority227.439 MBForeign priority data3,234,467raw
figures236.301 MBNumber of figures and sheets6,406,737raw
foreigncitation1.877 GBCitations made to foreign patents by US patents26,045,053raw
government_interest30.148 MBRaw government interest statements on all patents (where available)139,777raw
government_organization27.364 KBOrganization names and related agency hierarchy parsed from the government interest statements on all patents (where available)258processed
inventor95.564 MBDisambiguated inventor data3,790,243disamb
ipcr1.146 GBInternational Patent Classification data for all patents (as of publication date)14,325,389raw
lawyer9.973 MBDisambiguated lawyer data167,500disamb
location6.998 MBDisambiguated location data, including latitude and longitude141,189disamb
location_assignee23.606 MBMetadata table for many-to-many relationships648,515disamb (linking table)
location_inventor362.341 MBMetadata table for many-to-many relationships16,512,464disamb (linking table)
mainclass4.705 KBLookup table of original USPC main classes (as of patent publication date)1,238raw
mainclass_current18.969 KBLookup table of current USPC main technology classes (applied retrospectively to all patents)511raw (from separate classification files)
nber189.907 MBNBER classification data for all patents up to May 20155,105,938raw (from separate classification files)
nber_category64 bytesLookup table for NBER categories7raw (from separate classification files)
nber_subcategory754 bytesLookup table for NBER subcategories38raw (from separate classification files)
non_inventor_applicant353.43 MBNon-inventor applicant information3,657,702raw
otherreference6.02 GBNon-patent citations mentioned in patents (e.g. articles, papers, etc.)37,113,971raw
patent4.966 GBData on granted patents6,915,760raw
patent_assignee194.68 MBMetadata table for many-to-many relationships6,163,971disamb (linking table)
patent_contractawardnumber2.522 MBContract or award numbers parsed from the government interest statements on all patents (where available)120,256processed
patent_govintorg1.749 MBMetadata table with patent-to-organization relationships linked to the government_organization table165,345processed
patent_inventor284.348 MBMetadata table for many-to-many relationships16,515,751disamb (linking table)
patent_lawyer307.134 MBMetadata table for many-to-many relationships7,848,683disamb (linking table)
pct_data123.132 MBPCT data1,300,180raw
persistent_assignee_disambig567.031 MBPersistant Assignee Disambiguation6,163,970raw
persistent_inventor_disambig1.696 GBPersistant Inventor Disambiguation16,512,690raw
rawassignee691.561 MBRaw assignee information as it appears in the source text and XML files6,163,971raw
rawexaminer572.755 MBRaw examiner information9,463,550raw
rawinventor1.493 GBRaw inventor information as it appears in the source text and XML files16,512,751raw
rawlawyer759.245 MBRaw lawyer information as it appears in the source text and XML files7,848,683raw
rawlocation1.811 GBRaw location data for inventors and assignees, as it appears in xml and text source files26,325,559raw
rel_app_text687.805 MBRelated applications text1,662,713raw
subclass2.104 MBLookup table of original USPC subclasses (as of patent publication date)272,445raw
subclass_current6.745 MBLookup table of current USPC subclasses (applied retrospectively to all patents)171,054raw (from separate classification files)
us_term_of_grant176.98 MBU.S. term of grant data3,278,095raw
usapplicationcitation3.531 GBCitations made to US patent applications by US patents33,675,470raw
uspatentcitation8.303 GBCitations made to US granted patents by US patents100,141,759raw
uspc824.977 MBUSPC classification data for all patents18,042,464raw
uspc_current1.049 GBCurrent USPC classification data for all patents up to May 201522,852,959raw (from separate classification files)
usreldoc873.618 MBU.S. related documents (post-2005 patents only)9,485,757raw
wipo104.846 MBWIPO technology fields for all patents8,704,281raw (from separate classification files)
wipo_field3.331 KBLookup table of WIPO technology fields71raw (from separate classification files)

The PatentsView database is sourced from USPTO-provided text and XML data on published patent applications (2001-most recent update) and granted patents (1976-most recent update). The current PatentsView database MySQL dump is available for download, upon request. The patent applications database, which contains all granted and non-granted applications, is also available upon request. After March, 2016, the applications database will not contain the same inventor IDs as the PatentsView database. Only inventors on granted applications can be matched between the PatentsView and applications databases via a granted application ID.

This work was created through a government contract funded by the Office of Chief Economist in the US Patent and Trademark Office. Users are free to use, share, or adapt the material for any purpose, subject to the standards of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).

Attribution should be given to PatentsView (www.patentsview.org) for use, distribution, or derivative works.

From the PatentsView database, simple assignee and lawyer disambiguations are performed, and the patents are geocoded with a location-based disambiguation. Data are then fed into the inventor disambiguation algorithm in order to identify clusters of inventor names that are determined to be the same individual. Because the disambiguation of inventor identities is an ongoing effort, there are likely to be errors observable in the PatentsView data tables. The team welcomes feedback as we continue to improve our disambiguation methodology.

For more information, visit the Methods and Sources section of the website.