Business Wire

A research paper from COGNANO with co-authors from Google on a large-scale dataset of antigen-antibody interactions was accepted by NeurIPS 2023

Share

Antibodies are the most important therapeutic modality in drug discovery because there is no substance that binds to target molecules (antigens) as precisely and strongly as antibodies. Living organisms can produce a huge variety of antibodies, derived from genes, in almost unlimited quantities - so much so that it is theoretically possible to search for effective antibodies based on huge collections of antibody-encoding genes in vivo. However, it is not easy to decipher them, and there are limits for data accumulation due to the complicated genes.

This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20230927038103/en/

To view this piece of content from mms.businesswire.com, please give your consent at the top of this page.

Website for downloading released dataset (https://avida-hil6.cognanous.com/) (Graphic: Business Wire)

By immunizing alpacas, which possess a simple antibody-encoding gene meaning they can produce a wide array of antibodies, COGNANO acquired a digital ‘library’ of antibody sequences and their binding activity to different antigens. Generally, the binding between an antibody and an antigen is one-to-one correspondence, and there is only one binding site (known as the epitope). We demonstrate with this dataset that the artificial intelligence has the potential to predict the binding ability of previously unknown antibodies. We are making this dataset available for the research community as the world's largest and most precise antigen/antibody dataset, in the hope that it accelerates progress in AI enabled drug discovery.

We hope that future work explores the possibility of not only predicting binding, but also identifying epitopes and the responsible amino acid sequences in both antigens and antibodies. We believe that this is an important step forward in automatic drug discovery. COGNANO will present this achievement at NeurIPS 2023 in collaboration with the Google team.

Title: AVIDa-hIL6: A Large-Scale VHH Dataset Produced from an Immunized Alpaca for Predicting Antigen-Antibody Interactions

Authors: Hirofumi Tsuruta, Hiroyuki Yamazaki, Ryota Maeda, Ryotaro Tamura, Jennifer N. Wei, Zelda Mariet, Poomarin Phloyphisut, Hidetoshi Shimokawa, Joseph R. Ledsam, Lucy Colwell, Akihiro Imura

URL: https://arxiv.org/abs/2306.03329

1. Background

Antibodies are proteins that play an essential role in the immune system. Antibodies have become an important class of therapeutic agents to treat human diseases because of their high target specificity and binding affinity. To accelerate therapeutic antibody discovery, computational methods, especially machine learning, have attracted considerable interest for predicting specific interactions between antibody candidates and target antigens such as viruses and bacteria. However, progress in therapeutic antibody discovery has lagged behind progress in other areas of drug discovery because of the lack of availability of high-quality, large-scale datasets of antigen-antibody interactions. In particular, the publicly available datasets in existing studies have notable limitations, such as small sizes and the lack of non-binding samples and exact amino acid sequences. Therefore, large-scale datasets that overcome the limitations of existing datasets are essential to further accelerate AI drug discovery.

2. Research contributions

  • We release AVIDa-hIL6, which is the largest existing dataset for predicting antigen-antibody interactions (10 times larger than any other public dataset) and contains amino acid sequences of antigens and antibodies and binary labels for binding and non-binding pairs.
  • We have designed a novel data generation method by using the immune system of a live alpaca. Because our data generation method is applicable to any target antigen, it can be a fundamental technology for establishing a more comprehensive database of antigen-antibody interactions. In fact, we used the same approach to generate a dataset for SARS-CoV-2 variants and successfully found effective antibodies.

Reference paper: https://www.nature.com/articles/s42003-022-03630-3

  • We report experimental benchmark results on AVIDa-hIL6 by using machine learning models. These results confirm that AVIDa-hIL6 provides valuable benchmarks for machine learning research in the growing field of predicting antigen-antibody interactions.

3. Released dataset (AVIDa-hIL6)

AVIDa-hIL6 is available on the website ( https://avida-hil6.cognanous.com ) under a CC BY-NC 4.0 license. AVIDa-hIL6 contains amino acid sequences of the human interleukin-6 (IL-6) protein used as the antigen and antibodies and binary labels for binding and non-binding pairs.

Furthermore, AVIDa-hIL6 contains information on the interaction of diverse antibodies with 30 different mutants produced by artificial point mutations, in addition to the wild-type IL-6 protein. This assumes that antigen mutants emerge one after another to evade the immune system, as in the COVID-19 pandemic. Notably, AVIDa-hIL6 contains many sensitive cases in which point mutations in the IL-6 protein enhance or inhibit antibody binding, thus providing researchers with valuable insights into the effects of antigen mutations on antibody binding.

4. Perspectives

The major limitation of AVIDa-hIL6 is the lack of antigen diversity: specifically, AVIDa-hIL6 only has the IL-6 protein as an antigen. This limitation leads to the narrow applicability of a model trained on AVIDa-hIL6. In fact, it is difficult for a machine learning model trained using only AVIDa-hIL6 to predict antibodies that are effective against antigens other than IL-6 protein. However, in drug discovery applications, there is a need to find effective antibodies against new emerging antigens.

An essential approach to overcome this limitation will be to accumulate labeled data for a wider variety of antigens and their mutants. Our data generation method has the advantage of being applicable to any target antigen. In the future, we plan to generate and release datasets for various antigens, which should be more practical for building models to predict antigen-antibody interactions.

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

Contact information

COGNANO, Inc.
Akihiro Imura, +81-75-741-6962
cognano@cognano.co.jp

About Business Wire

For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Ant International Partners with Barclays on Global Treasury Management with Proprietary AI-Powered FX Model9.5.2025 16:57:00 EEST | Press release

Ant International has entered a partnership with leading UK bank Barclays to enhance efficiency and resilience in global treasury management for businesses. Under the partnership, the two sides will combine innovative solutions, including Ant’s proprietary Time-Series Transformer (TST) AI FX Model, to help businesses reduce FX-related costs and risks against global volatilities. At the initial stage of the collaboration, Ant International has successfully completed the first batch of its intra-group FX transactions with Barclays. Ant International’s TST Model is a transformer architecture-based big data model with close to 2 billion parameters. By integrating the latest time series forecasting algorithms, the TST Model predicts patterns over time. Ant also created new pre-training and Supervised Fine-Tuning (SFT) frameworks to train the model and improve its predictions over time. The TST Model now forecasts the company's cashflow and FX exposure on an hourly, daily and weekly basis, w

Monument Re Transfers €1.4bn Greycastle Portfolio to RGA and Strengthens European Life Insurance Consolidation Platform9.5.2025 15:00:00 EEST | Press release

Monument Re Limited (“Monument”) announces today that it has transferred a legacy €1.4bn reinsurance portfolio, comprising annuity and other life insurance liabilities acquired as part of the 2020 Greycastle transaction, to RGA Americas Reinsurance Company, Ltd. (“RGA”). This transaction releases capital resources that Monument will redirect to its core strategy of consolidation in European life insurance markets. The transaction completed on 2May 2025 following approval by the Boards of Directors of both Monument and RGA and non-objection from the Bermuda Monetary Authority. Monument has taken significant steps in recent months to strengthen its business operations by consolidating its European group support functions in Dublin and by aligning with the recently strengthened regulatory regime in Bermuda. With its strong financial position and best in class capabilities in the Group, Monument remains ideally positioned to build on its success to date and grow its European footprint, del

IFF Completes Divestiture of Nitrocellulose Business9.5.2025 15:00:00 EEST | Press release

IFF (NYSE: IFF) today announced that it has completed the divestiture of its nitrocellulose business, including Walsrode Industrial Park in lower Saxony, Germany, to Czechoslovak Group (CSG). The business manufactures nitrocellulose strictly for industrial purposes, serving customers primarily in coatings and printing inks, and had been part of IFF’s Pharma Solutions business unit. “The divestiture of our nitrocellulose business builds upon our deleveraging journey and enables us to focus on our core businesses,” said Erik Fyrwald, IFF CEO. “I’d like to thank our nitrocellulose colleagues for their dedication and wish them continued success as part of CSG.” Welcome to IFF At IFF (NYSE: IFF), we make joy through science, creativity and heart. As the global leader in flavors, fragrances, food ingredients, health and biosciences, we deliver groundbreaking, sustainable innovations that elevate everyday products—advancing wellness, delighting the senses and enhancing the human experience.Le

CPAC Systems AB Announces Strategic Minority Investment in Flying Fish9.5.2025 13:33:00 EEST | Press release

CPAC Systems AB, a leader in advanced control systems and embedded vessel and commercial vehicle technology, today announced a strategic minority investment in Flying Fish Maritime Innovations B.V., a pioneer in advanced shared water mobility solutions and robust, cost-effective hydrofoil technology. The investment underscores both companies’ commitment to redefining water-based transportation through innovation, sustainability, and seamless integration. It marks the beginning of a deeper collaboration aimed at enabling smarter, cleaner, and more connected mobility on the water — for both recreational and commercial applications. “We are thrilled to support Flying Fish and their impressive work in redefining water mobility,"saidMarcus Wingolf, CEO of CPAC Systems. "Our investment in Flying Fish represents a strategic alignment of our technical proficiencies and innovative ambitions. This partnership opens exciting new possibilities for integration and sustainable transformation across

Kioxia Receives IEEE Corporate Innovation Award9.5.2025 10:00:00 EEST | Press release

Kioxia Corporation, a world leader in memory solutions, today announced that it has received the IEEE Corporate Innovation Award from the Institute of Electrical & Electronics Engineers (IEEE), the world’s largest technical professional organization dedicated to advancing technology for the benefit of humanity. This award recognizes Kioxia's outstanding contribution in the field of electrical and electronics engineering through its BiCS FLASH™ technology, a low-cost, high-capacity 3D flash memory innovation. The award ceremony was held on April 24 in Tokyo. The IEEE Corporate Innovation Award is a globally-recognized honor bestowed upon organizations that have developed innovative technologies, products or services that have made a substantial contribution to the advancement of electrical and electronics engineering. Since its inception in 1985, the award has been presented to leading electronics manufacturers and IT companies worldwide, and Kioxia is proud to be the seventh Japanese c

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye