Derwent SequenceBase Databases

Search and explore global patent sequence data.

Request a demo Login to Derwent SequenceBase

GENESEQ

GENESEQ™ is a proprietary database used to easily search and identify biological sequences (DNA, RNA and protein sequences) covered in patents from 56 issuing authorities. Manually annotated to highlight IP context and biological significance,  GENESEQ™ allows IP professionals and biologic scientists to spend less time searching and understanding sequence data and easily assess patentability, identify potential infringing patents and track competitor activities.

USGENE

Unravel the complex strands of biological sequences

USGENE® is the unparalleled resource in which to perform nucleic acid and protein U.S. patent sequence searching. The database has an archive which dates back to 1981, and unrivaled sequence data timeliness of within 24 hours of United States Patent & Trademark Office (USPTO) publication.

With over 191 million biological sequences from more than 232,000 unique U.S. patents and published patent applications, it contains a wealth of essential sequence information not available anywhere else.

The database is continually growing with hundreds of documents containing biological sequences added twice per week. This means biological scientists can spend less time searching for sequence data and more time on research and development, and IP professionals can be sure they don’t miss any potential infringements or competitor activities.

Complete, timely, annotated data with thorough coverage

The database provides thorough coverage of all biological sequence information from U.S. patents and published patent applications including:

  • Nucleic acid sequences 10 or more bases in length
  • Amino acid sequences 4 or more residues in length
  • All PCR primers and probes of any length
  • Sequence information, including organism name, molecule type, and modifications

Database annotation includes:

  • Patent title, applicant abstract and claims text
  • Sequence identity number (SEQ ID NO) within the sequence listing
  • Patent sequence location (PSL) within the indexed document
  • Original source organism
  • Standardized, full bibliographic data

Comprehensive access and release options

USGENE, the only dedicated, unified source for all available USPTO sequence data, is available through major sequence search and data management solutions including:

How USGENE aids IP professionals

The amount of legal and scientific data recorded in patent sequence datasets has grown exponentially. Scientists and legal professionals carrying out essential IP research are seeking the best and simplest way to discover IP sequence information across disparate resources.

In the past, their alternatives relied on outsourcing or stepping carefully through multiple, databases — workflows which often left them with major difficulties to overcome:

  • Is this search really complete?
  • The volume of results is overwhelming!
  • Time taken for results analysis inhibits the IP sequence search process
  • Sifting through duplicated results from multiple databases is slow and inefficient
  • Outsourcing is expensive and hard to integrate into workflows
  • It’s hard to share and report results in a simple accessible way

To overcome these challenges, IP specialists are increasingly turning to USGENE via the SequenceBase Research Portal, the online IP research solution with easy-to-use, readily accessible content, search, analysis, and reporting tools.

How USGENE aids biological scientists

USGENE aggregates, normalizes, curates and updates U.S. patents sequence data from all relevant global datasets, including USPTO, PSIPS, NCBI and EMBL. Over 232,000 U.S. patents and published patent applications — including sequence data and annotations – are archived and updated twice a week, every Tuesday and Thursday. USGENE is simply the most comprehensive and timely U.S. patent sequence database available.

The database enables you to understand the context of each sequence in the patent, quickly identifying key biological and bibliographic associations and sequence modifications. Its standardized structure makes it easy to map into your in-house bioinformatics systems.

WOGENE

Unravel the complex strands of biological sequences

WOGENE is a comprehensive resource in which to perform international nucleic acid and protein patent sequence searching. The database has an archive which dates back to 1979, and unrivalled sequence data timeliness of within 24 hours of  World Intellectual Property Organization (WIPO) publication.

With over 56 million biological sequences from more than 182,000 unique international published patent applications, it contains a wealth of essential sequence information not available anywhere else.

The database is continually growing with hundreds of documents containing biological sequences added twice per week. This means biological scientists can spend less time searching for sequence data and more time on research and development, and IP professionals can be sure they don’t miss any potential infringements or competitor activities.

Complete, timely, annotated data with thorough coverage

The database provides thorough coverage of all biological sequence information from international published patent applications including:

  • Nucleic acid sequences 10 or more bases in length
  • Amino acid sequences 4 or more residues in length
  • All PCR primers and probes of any length
  • Sequence information, including organism name, molecule type, and modifications

Database annotation includes:

  • Patent title
  • Sequence identity number (SEQ ID NO) within the sequence listing
  • Original source organism
  • Standardized bibliographic data

Comprehensive access and release options

WOGENE, a comprehensive, dedicated, unified source for all available international patent sequence data, is available through major sequence search and data management solutions including:

How WOGENE aids IP professionals

The amount of legal and scientific data recorded in patent sequence datasets has grown exponentially. Scientists and legal professionals carrying out essential IP research are seeking the best and simplest way to discover IP sequence information across disparate resources.

In the past, their alternatives relied on outsourcing or stepping carefully through multiple, databases — workflows which often left them with major difficulties to overcome:

  • Is this search really complete?
  • The volume of results is overwhelming!
  • Time taken for results analysis inhibits the IP sequence search process
  • Sifting through duplicated results from multiple databases is slow and inefficient
  • Outsourcing is expensive and hard to integrate into workflows
  • It’s hard to share and report results in a simple accessible way

To overcome these challenges, IP specialists are increasingly turning to WOGENE via the SequenceBase Research Portal, the online IP research solution with easy-to-use, readily accessible content, search, analysis, and reporting tools.

How WOGENE aids biological scientists

WOGENE aggregates, normalizes, curates and updates international patent sequence data from all relevant global datasets, including WIPO, EPO, NCBI and EMBL. Over 182,000 international published patent applications — including sequence data and annotations – are archived and updated each week.

The database enables you to understand the context of each sequence in the patent, quickly identifying key biological and bibliographic associations and sequence modifications. Its standardized structure makes it easy to map into your in-house bioinformatics systems.

GENBANK

Unravel the complex strands of biological sequences

GenBank ® is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences.  GenBank is part of the International Nucleotide Sequence Database Collaboration, which comprises the DNA DataBank of Japan (DDBJ), the European Nucleotide Archive (ENA), and GenBank at NCBI. These three organizations exchange data on a daily basis.

From its inception in 1982, GenBank is constantly growing at an exponential rate, doubling every 18 months. With over 228 billion nucleotide bases from more than 199 million (on Feb 2017) individual sequences GenBank provides access within the scientific community to the most up-to-date and wide-ranging DNA sequence information.

Complete, timely, annotated data with thorough coverage

Each GenBank entry includes:

  • a concise description of the sequence
  • the scientific name and taxonomy of the source organism
  • bibliographic references
  • table of features listing areas of biological significance, such as coding regions and their protein translations, transcription units, repeat regions and sites of mutations or modifications

A GenBank release occurs every two months and is available from the ftp site. The release notes for the current version of GenBank provide detailed information about the release and notifications of upcoming changes to GenBank. GenBank growth statistics are available here.

Comprehensive access and release options

GenBank is available through major sequence search and data management solutions including:

How GENBANK aids IP professionals

IP specialists are increasingly turning to GenBank via the  SequenceBase Research Portal, the online IP research solution with easy-to-use, readily accessible content, search, analysis, and reporting tools.

The GenBank database is designed to provide and encourage access within the scientific community to the most up to date and comprehensive DNA sequence information. Therefore, NCBI places no restrictions on the use or distribution of the GenBank data. However, some submitters may claim patent, copyright, or other intellectual property rights in all or a portion of the data they have submitted. NCBI is not in a position to assess the validity of such claims, and therefore cannot provide comment or unrestricted permission concerning the use, copying, or distribution of the information contained in GenBank.

To learn more about NCBI’s Policies, please visit  NCBI Data Usage Policies and Disclaimers page.

How GENBANK aids biological scientists

The GenBank database enables you to understand the context of each sequence in the patent, quickly identifying key biological and bibliographic associations and sequence modifications. Its standardized structure makes it easy to map into your in-house bioinformatics systems.

Many GenBank DNA sequence records are annotated with coding region proteins. For convenience, NCBI provides access to this protein data alongside the GenBank release every two months. A description of the coverage of the latest protein data release is available via the NCBI ftp site.  GenBank protein data are available and fully searchable via the  SequenceBase Research Portal.

Innovate with intelligence

Talk to us today

Get in touch