Ten Years at GSC
- Opening Discussion
- Ten years at GSC
- Organizational transitions
- Results and Future Plans
- Awards
- Data / Documents
How to Participate and Get Registered
Call for GSComplex members, for collecting information for exchange, joint research, if your cooperation be much appreciated.
To registration, please contact the Executive Office.From sequencing to the elucidation of life phenomena, to the full picture of life strategies

We have succeeded in deepening the understanding of various life processes by viewing an organism from a molecular biology viewpoint as a "dynamic molecular device". However, the rapid development of genomic science in the 1990s changed the style of biological investigation, and generated a new trend of research. In other words, a paradigm shift has occurred away from the inductive approach, which tries to conjecture the whole system based on the understanding of individual life processes, toward the deductive approach, which understands individual phenomenon by observing the whole configuration that provides the basic design of life, such as a genome. GSC was established in 1998 as the implementing center for this type of genome science research. Since then, we have made numerous accomplishments, known internationally, that have become milestones in life science research, such as complete sequencing of the human genome, completing the mouse gene encyclopedia project, discovering the RNA continent, and completing the Protein 3000 Project. We have also conducted systematic development of the mutant resources of experimental organisms such as the mouse (Mus musculus) and Arabidopsis thaliana. Moreover, we have continued the challenge of working toward establishing a technology base for computational biology, which is important for understanding life as a system.
Click on each year to display its details.
- 1998
- October
- GSC was started
- 1999
- January
- Partial base sequence data of 20,000 mouse gene were made publically available
- April
- Mechanisms of a base-specific protein binding to a single-stranded RNA unveiled
- July
- Mouse gene data (180,000 pieces) were disclosed in collaboration with the National Institute of Genetics
- October
- Completion of the large-capacity DNA sequencer, the RIKEN Integrated Sequence Analyzer (RISA) Construction of the high-speed large capacity sequencer RISA was completed. This analyzer has a processing capacity four times that of a conventional high-speed large-capacity sequencer, making it possible to analyze approximately 600,000 base pairs of DNA in 8 hours.

High-speed large-capacity sequencer "RISA"
- 2000
- May
- Complete sequencing of human chromosome 21 sets a precedent for the world The international team, mainly consisting of GSC, completed the sequencing of human chromosome 21 as part of the International Genome Project. GSC was responsible for 50% of approximately 34,000,000 bases.

The article detailing the complete sequencing of human chromosome 21 was published in Nature journal
In addition to the discovery of 98 new genes, characteristic repeat configurations in the adjacent centromere and telomeres were elucidated for the very first time.
- June
- The International Human Genome Sequence Consortium, including GSC, declared the completion of the draft sequencing of the human genome
- July
- The human genome draft sequence integrated database was jointly created and made publicly available in public by GSC and the Human Genome Center, Institute of Medical Science, University of Tokyo
- September
- All genomes of the symbiotic microbe "Buchnera" were sequenced as a world-first
- 2001
- February
- Disclosure of mouse cDNA annotation information

An article with functional annotations of mouse full-length cDNA was published by Nature journal -
Publication of an article about the human genome draft sequence analysis Publication of an article on the human genome draft sequence analysis by the "International Human Genome Sequence Determination Consortium", comprised of the 6 countries of Japan, the United States, the UK, France, China, and Germany, and 20 research centers. GSC acquired and analyzed about 203 Mb of the sequence data.

Press release of the human genome draft sequence - Elucidation of the part of the Mechanism about the Internalization of Extracellular Molecules into Cells
- March
- Started the development of a large-scale high-throughput discovery system for the N-ethyl-N-nitrosourea (ENU) induced mutation
- April
- Established the International Structural Genomics Organization (ISGO)
- 2002
- January
-
Publication of the world's first human/chimpanzee genome comparison map Terminal sequences of approximately 64,000 kinds of DNA fragment of the chimpanzee genome were determined and the world's first genome map comparing the human and chimpanzee was accomplished. Results of the analysis clearly showed that the difference in genomes was only 1.23% (approximately 37,000,000 bases).
Total imaging of the chimpanzee genome fragment,
arranged on the human chromosome
- March
- Identified approximately 14,600 kinds of A. thaliana full-length cDNA Developed the "RIKEN Arabidopsis Genome Encyclopedia", we analyzed approximately 14,600 A. thaliana full-length cDNAs, leading the way for this kind of analysis, and attached functional explanatory notes. Confirmed approximately 2,400 new genes and clarified control region information for 14,000 genes.

A. thaliana
- April
- Started the National Project on Protein Structural and Functional Analysis (Protein 3000) and the National BioResource ProjectNational BioResource Project
We have been collecting, maintaining, and offering the biological resources of 24 species under the "TNational BioResource Project"T, a five-year national project that started in 2002. Toshihiko Shiroishi, Project Director of the Functional Genomics Research Group which is the core agency for "Tmouse mutagenesis"T, was in charge of developing the mouse mutant strains. We have developed various mutants that can be used as models of human disease including adult diseases, and are making them available to the global research community. These mutants were developed by our unique methodology, the systematic phenotype screening of mouse mutants induced by chemical mutagen, ENU.
- August
- DNA repair revealed by the crystal structure of the human recombination protein Rad52
- September
- Revealed novel mechanism of signal transduction from extracellular signal
- November
- Constructed a new system for comprehensive study of protein structure prediction
- December
- Published the world's largest scale gene information demonstrating that the large amount of non-protein code region is transcribed, a finding that may dramatically advance the functional analysis of genes
- 2003
- February
- Developed MDGRAPE-2A
- April

Dr. Sakaki, Director of GSC, presents 24 CD-ROMs containing the complete sequence data of the human genome to Prime Minister Koizumi.Declaration of the completion of sequencing the human genomeDeclaration of the completion of sequencing of the human genomeIn April 2003, the International Human Genome Sequencing Consortium (IHGSC), comprising 6 countries (Japan, the US, the UK, France, Germany, and China) with 18 research centers including GSC, jointly declared with the leaders of the 6 countries, the successful completion of the sequencing of the human genome with more than 99.99% accuracy. Sequencing was started in 1991, and continued after the completion of the sequencing of chromosome 22 (December 1999) and chromosome 21 (May 2000); and thereafter, in February 2001, the IHGSC and Celera Genomics Co., Ltd., respectively, published the draft sequence (accuracy: 99.9% or more; non-sequenced sequence: 10% or more) of each chromosome. The characteristics of the human genome, one of which the total number of human genes for coding protein was relatively small at approximately 22,000 were discovered.
- April
- Announcement regarding the trial version of the DNA Book "The DNA Book", printed cDNA by the original technology, was completed. Approximately 60,000 cDNA clones can be easily distributed as books and used by researchers worldwide.

Trial Version of the DNA Book
- July
- Analysis of rice full-length cDNA

- Ultra-high sensitivity NMR data measurement of protein was realized by the world's highest magnetic field (920 MHz) NMR system

- 2004
- February
-
Succeeded in developing the next-generation cell simulator on the grid Developed the next-generation cell simulator "OBIYagns" (Yet Another Gene Network Simulator) that uses grid technology. In addition to reducing computing time, simultaneous use of multiple users became possible through the internet

National simultaneous use of OBIYagns
- April
- Started the Genome Network Project
- May
- Structure of the human DNA recombinase revealed a double ring

- Succeeded in sequencing chimpanzee chromosome 22

- July
- Succeeded in understanding the activity of the essential protein "cohesin" (chromosome adhesion factor) in cell division

- August
- Released an analysis system for disease/phenotype mapping

- Succeeded in developing the fastest high-speed LSI in the world

- September
- Molecular mechanisms of the tumor necrosis factor signaling revealed by NMR

- Discovered a chemical compound, a candidate for an anti-SARS corona virus drug

- The International Human Genome Sequence Consortium, which includes GSC, published the results of the high-precision analysis of the human genome

- 2005
- January
- Established the RTK Consortium
- The atomic structure determination of the enzyme that can produce a new protein including a heavy atom

- June
- Released a search system for the DNA Book database

- Estimated unknown gene structures using gene expression information (tiling array)

- September
-
Published an article detailing the discovery of a "new RNA continent" following large-scale Transcriptome analysisThe New RNA Continent
Through the comprehensive collection of mouse full-length cDNAs, Yoshihide Hayashizaki, Project Director of the Genome Exploration Research Group, studied the role of those RNAs in protein synthesis. The results showed, in fact, that 53% or more of the RNA were transcribed as ncRNA, which is not usually used for protein synthesis. The ncRNAs identified during the study are drawing attention as a new continent of genomics due to its various functions, such as those associated with cell differentiation by taking the role of regulating gene expression.
Discovered the existence of a large number of RNAs that are not used for protein synthesis (ncRNA)
New RNA Continent
Further suggested that such ncRNAs have various functions including regulation of gene expression, and thus this was a great discovery disproving an established theory.
- Determination of protein structure constituting the genetic code of life

- October
- Completion of the Rice Full-Length cDNA Encyclopedia DNA Book containing 30,000 completely sequenced genes of the rice genome

- 2006
- January
- The article announced the sequencing of the base sequence of the chimpanzee Y-chromosome and comparative analysis results of the DNA sequences of human and chimpanzee. Succeeded in sequencing the chimpanzee Y-chromosome. Showed that the rate of evolution was faster in the chimpanzee Y-chromosome than in the human Y-chromosome.

Gon, the chimpanzee - Public release of the protein structure database against new type of influenza in the world for drug discovery acceleration

- April
- Captured the moment of RNA unwinding of RNA by a protein

- Discovered the regulatory region of "broad type" promoters for a new mechanism of gene regulation

- June
-

MDGRAPE-3 Board
The MDGRAPE-3 system comprises
138 servers in a lineBuilt an exclusive-use high-speed computer system that was the first to break the Petaflop barrierDeveloped "MDGRAPE-3", a computer exclusively used for molecular dynamics simulation. The theoretical peak performance is 1 Petaflop, or 1 quadrillion operations per second.MDGRAPE-3In 2006, Makoto Taiji, Team Leader of the Computational and Experimental Systems Biology Group, completed MDGRAPE-3, the fastest exclusive-use computer system in the world. MDGRAPE-3 is a special-purpose, highly sophisticated computer with 1 Petaflop peak performance specializing in the simulation of molecular dynamics. The completion of MDGRAPE-3 made possible visualization of the movement of molecules in vivo with great accuracy, and it has become feasible to study protein performance and/or the cause of disease at the atomic level. This system is also expected to contribute to new drug development since it can simulate how drug candidates will act when combined with a protein.

- August
-


Unnatural Base Pair "Ds-Pa"Creation of a novel unnatural base pair system for the expansion of the genetic alphabet toward future biotechnologySucceeded in creating the unnatural base pair "Ds-Pa", which is compatible with the natural A-T and G-C base pairs in replication. First in the world to realize unnatural base pair, that can function in replication and transcription.Artificial Base PairsIchiro Hirao, Team Leader of the Protein Research Group, and his colleagues developed two kinds of unnatural bases called "TDs"T and "TPa"T. These bases specifically form a pair with each other, like the natural A-T and G-C pairs in DNA. Although the Ds-Pa pair has no hydrogen bonds, found in the natural base pairs, each unnatural base has a specific shape complementation. DNA molecules containing the Ds-Pa pair can be faithfully amplified by PCR and used as templates for making RNA molecules containing extra components by transcription. Thus, the unnatural base pair could open the door to create novel nucleic acids and proteins useful for diagnostics and therapeutics.

- September
- Elucidation of the inhibition mechanism of an antibiotics kasgamycin on protein synthesis
- Open the modeling database of three-dimensional protein structures to the public via the web on a worldwide level

- October
- Open access to the search software "OmicBrowse" for the Genome Database Developed and released "OmicBrowse" software, which can search and visualize simultaneously various database groups related to annotation information of genes existing on the genome

"OmicBrowse" Top Page - Discovered the smallest ever genome in the living world from the commensal bacterium Carsonella ruddii

Commensal Bacterium, C. ruddii
- November
- Succeeded in high throughput peptide cohesion simulation.
Peptide cohesion simulation wins the Gordon Bell Prize in 2006. - Signed an agreement on collaboration with Edinburgh University
- December
- Elucidated the precise control of gene expression of anticancer drug target molecules
- Protein cooperates with RNA to form a pocket that conforms to substrate
- 2007
- January
- Discovery of a protein that converts the enzyme peroxisome into a mature form
- February
- Clarification of the structure and function of an oncoprotein involved in lung cancer

-
Developed the new gene diagnostic technology, the "SMAP method". Succeeded in developing the rapid isothermal DNA amplification method, "SMAP method" Clinical research was also started in Japan and overseas to investigate the potential technology for determining drug efficacy and for diagnosing side effects within 30 minutes using a drop of blood.

Reagents used in the SMAP method
- March
- Structural Basis for interactions of the Ribosome with the Switch Regions of GTP-bound Elongation Factors
- Structure of an archaeal protein referred to as a living fossil in the evolutionary history of the "genetic code dictionary"
- A snapshot of a supramolecular complex at the first step of protein biosynthesis
-

Figure shows the structure of proteins.
The left panel shows a glycosylated protein, human galectin 4, and the right panel shows human ZFAT which is involved in the regulation of autoimmune thyroid desease.Completed the Protein 3000 Project
Finally determined the structure of 2675 proteinsNational Project on Protein Structural and Functional Analysis(Protein 3000)The national "TProtein 3000 Project"T led by the Ministry of Education, Culture, Sports, Science and Technology, which targeted the analysis of the 3000 basic structure of proteins within 5 years, was completed in March 2007. Shigeyuki Yokoyama, Director of the project and his colleagues at RIKEN, accomplished 2675 structure determinations and led the project to success. They achieved automation of high-quality sample preparation in large quantities, and were able to improve preparation speed markedly. Their achievement had huge impact on global research and contributed greatly to the development of structural proteomics.
- Completion of the National Bio Resource Project Creation of a total of 399 mutant mice
- May
-
Succeeded in developing a mouse model of schizophrenia/depression Developed two mouse lineages having variations in amino acid replacement in the Disc1 gene, which is said to contribute to schizophrenia. It was clarified that this lineage can serve as a mouse model for both depression and schizophrenia.

A mouse presenting with schizophrenia (left) or depression symptoms (right) showed 13% and 6% decrease in brain volume, respectively. - Elucidated an endocytosis mechanism, the process in which extra-cellular substances are internalized through the cellular membrane