If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.
You already know Dokkio is an AI-powered assistant to organize & manage your digital files & messages. Very soon, Dokkio will support Outlook as well as One Drive. Check it out today!

USPTO NBER Extension Dataset

Page history last edited by Robert McNamee 12 years, 9 months ago

Overview | Key Projects: Collaboration Continuum

Data Overview

This dataset is designed to complement and enhance the traditional NBER patent dataset used in a great deal of management and international business research. It uses the original list of patents from the NBER dataset and extracts all data available on the USPTO website for each of these patents. This adds a range of useful data such as all classifications (not only primary class), search classifications, and un-truncated original text for fields like inventor locations.

The main contribution of this dataset is that it adds the full text of patent abstracts, all patent claims, and complete descriptions of patents. This is an incredibly rich dataset that can be used to bring a level of qualitative and quantitative rigor to patent data analysis not possible with currently available patent datasets. For example, patent classifications are one of the most commonly analyzed items of data in the NBER patent dataset. Each patent classification reflects the technology described in one of the patent's claims. However, each classification can only be assigned a single time to a patent (i.e., if 20 claims on a patent relate to a certain class/subclass and a last claim relates to a different class/subclass, the patent will be classified into these 2 class/subclass once each). Thus classification data is a limited and possibly biased representation of the actual unique technology represented by a patent and described in its claims.

The dataset is described more below:

NBER to Extension Match Percentages