What is an Author’s Note: A Concise Guide

By: Author Paul Jenkins

Posted on April 19, 2023

Categories Writing , Publishing

An author’s note is a statement from the author to the readers about a book, providing insights into the making of the book, the story behind writing it, and other relevant information. It can appear in fiction and nonfiction works and serves different purposes depending on the context. In nonfiction, an author’s note is typically used to explain or add context to their decisions, such as how they chose sources, decided what information to include, or why they used particular language.

In fiction, an author’s note may provide background information about the story, including the author’s inspiration, research, or creative process. This type of note is often intended to be shorter than a preface and sometimes replaces it. It is a channel for authors to communicate directly with their readers, telling them the story behind it.

An author’s note may also contain additional information, such as acknowledgments, disclaimers, or statements regarding conflicts of interest and help or funding that supported the research or writing process. Ultimately, an author’s note aims to provide transparency and context while enhancing the reader’s understanding and connection to the work.

Definition of an Author’s Note

An author’s note directly communicates with their readers, providing insights and context about their written work. This element can be found in fiction and nonfiction books, allowing authors to share the story behind it, their motivations, decisions, or even disclaimers and acknowledgments.

Although often shorter than a preface, an author’s note can sometimes replace it, depending on the author’s intent and the note’s content. In nonfiction books, it may be used to explain the author’s choice of sources, the rationale behind including specific information, or their use of particular language. In academic or research papers, the author’s note can include information about study registration, data sharing, funding sources, and conflicts of interest.

In summary, an author’s note is a way for authors to give readers a glimpse into the creative process and background of their written work. This meta-writing helps connect the author and the reader and adds depth to the reading experience.

Purpose of an Author’s Note

Context and background.

An author’s note provides context and background information about the author’s work. This can include details about the writing process, research, and the reasons behind confident stylistic or content choices. In fiction and nonfiction books, an author’s note can help clarify the author’s intentions, convey additional meaning to readers, and resolve possible confusion or unanswered questions in the text.

Acknowledgments

Another purpose of an author’s note is to acknowledge the people, resources, or funding that supported the book’s creation. This section allows the author to thank individuals who contributed to the work, such as editors, research participants, or mentors. It can also mention any grants, sponsorships, or financial assistance received during the development of the work.

Disclaimers

Disclaimers in the author’s note can address potential conflicts of interest, highlight limitations in work, or clarify any misconceptions readers might have. It might include details about the author’s personal experiences, biases, or perspectives that informed the work or highlight the tentative nature of specific findings or conclusions. By including such disclaimers, authors can demonstrate transparency in the research or writing process and maintain credibility with their readers.

Using an author’s note enables authors to engage with their readers while enhancing the overall experience and depth of understanding. Addressing context, acknowledgments, and disclaimers ensures the work is well-rounded and transparent.

Placement and Formatting

An author’s note can typically be found at the beginning or end of a book. Sometimes, it replaces a preface or appears as a separate section entirely. It serves as a direct communication between the author and the reader, providing insights into the making of the book, the author’s decisions, and other relevant information.

The length of an author’s note may vary depending on the content and the author’s intent. However, it is generally intended to be shorter than a preface. The author should aim to keep it brief and focused on the most important points they want to convey to the reader.

The style of an author’s note is usually more informal than the rest of the book. The author can address the reader directly and use a personal tone reflecting their experience with the book’s creation. It allows for more flexibility in expressing emotions, thought processes, and behind-the-scenes details that might not fit within the book’s main body.

Regarding formatting, keeping the author’s note easily readable and well-organized is essential. Suppose additional information needs to be conveyed, such as sources, acknowledgments, or statements of support. In that case, it can be incorporated using appropriate HTML elements like lists, tables, or emphasized text as necessary:

  • <ul> or <ol> for bulleted or numbered lists
  • <table> for organizing data in a table format
  • <em> or <strong> for emphasizing important text

In summary, an author’s note is valuable for providing readers with insights and context about a book’s creation. Its placement, length, and style can vary, but it should always aim to convey critical information clearly and effectively to the reader.

Writing an Effective Author’s Note

An author’s note provides valuable insight into the thought process and decisions behind a piece of writing. If approached carefully, an effective author’s note can help bridge the gap between the writer and the reader. This section will discuss three key elements to consider when writing an effective author’s note: addressing the reader, clarity and brevity, and tone and voice.

Addressing the Reader

Connecting with the reader in an author’s note is essential by speaking directly to them. Address any questions or concerns they might have, and provide insight into the creation process for the book. This can include discussions of research, source selection, and the reasons behind specific content choices. By acknowledging the reader and creating a conversation, an author’s note becomes more engaging and helpful to the audience.

Clarity and Brevity

Keeping an author’s note brief and clear is crucial to maintain the reader’s interest. Focus on essential points that give context to the work or help establish the author’s credibility. Avoid lengthy explanations or digressions, as they can detract from the book’s main content. Use simple language and concise sentences, ensuring the information is easily understood.

Tone and Voice

Striking the right balance between tone and voice is key to an effective author’s note. Use a confident and knowledgeable tone to establish the author’s expertise and reliability, but remain neutral to avoid alienating the reader. Express appreciation for the reader’s time and interest, and maintain a clear and straightforward writing style. By conveying information to the reader in a thoughtful and relatable manner, an author’s note becomes an essential part of the reading experience.

Examples of Author’s Notes

An author’s note is a statement from the author to the readers about the book, providing insights into the making of the book and communicating the story behind its writing.

One common type of author’s note is a disclaimer informing readers that the book is a work of fiction and not intended to represent real people, places, or events. This is often included to avoid potential legal issues or misunderstandings arising from the book’s content.

An author’s note can also address the writing product, such as summarizing the text as an abstract or retelling a poem in prose. This can help readers better understand the work and its structure.

In some cases, the author’s note may provide more details and context about the book’s topic that was not included in the main narrative. For example, in her biography of Isamu Noguchi, Christy Hale uses the author’s note to include dates, places, and additional details about the subject that were omitted from the main story.

Another example of an author’s note can be found in historical fiction or nonfiction books. The author may use the note to explain their research process, and the sources used, and any liberties that were taken with historical facts or events to enhance the narrative.

In summary, the author’s note can serve various purposes, such as providing disclaimers, additional context, details, insights into the writing process, or addressing the structure and content of the book. The format and content of an author’s note will vary depending on the author’s intentions and the specific needs of their work.

13.5 Research Process: Making Notes, Synthesizing Information, and Keeping a Research Log

Learning outcomes.

By the end of this section, you will be able to:

  • Employ the methods and technologies commonly used for research and communication within various fields.
  • Practice and apply strategies such as interpretation, synthesis, response, and critique to compose texts that integrate the writer’s ideas with those from appropriate sources.
  • Analyze and make informed decisions about intellectual property based on the concepts that motivate them.
  • Apply citation conventions systematically.

As you conduct research, you will work with a range of “texts” in various forms, including sources and documents from online databases as well as images, audio, and video files from the Internet. You may also work with archival materials and with transcribed and analyzed primary data. Additionally, you will be taking notes and recording quotations from secondary sources as you find materials that shape your understanding of your topic and, at the same time, provide you with facts and perspectives. You also may download articles as PDFs that you then annotate. Like many other students, you may find it challenging to keep so much material organized, accessible, and easy to work with while you write a major research paper. As it does for many of those students, a research log for your ideas and sources will help you keep track of the scope, purpose, and possibilities of any research project.

A research log is essentially a journal in which you collect information, ask questions, and monitor the results. Even if you are completing the annotated bibliography for Writing Process: Informing and Analyzing , keeping a research log is an effective organizational tool. Like Lily Tran’s research log entry, most entries have three parts: a part for notes on secondary sources, a part for connections to the thesis or main points, and a part for your own notes or questions. Record source notes by date, and allow room to add cross-references to other entries.

Summary of Assignment: Research Log

Your assignment is to create a research log similar to the student model. You will use it for the argumentative research project assigned in Writing Process: Integrating Research to record all secondary source information: your notes, complete publication data, relation to thesis, and other information as indicated in the right-hand column of the sample entry.

Another Lens. A somewhat different approach to maintaining a research log is to customize it to your needs or preferences. You can apply shading or color coding to headers, rows, and/or columns in the three-column format (for colors and shading). Or you can add columns to accommodate more information, analysis, synthesis, or commentary, formatting them as you wish. Consider adding a column for questions only or one for connections to other sources. Finally, consider a different visual format , such as one without columns. Another possibility is to record some of your comments and questions so that you have an aural rather than a written record of these.

Writing Center

At this point, or at any other point during the research and writing process, you may find that your school’s writing center can provide extensive assistance. If you are unfamiliar with the writing center, now is a good time to pay your first visit. Writing centers provide free peer tutoring for all types and phases of writing. Discussing your research with a trained writing center tutor can help you clarify, analyze, and connect ideas as well as provide feedback on works in progress.

Quick Launch: Beginning Questions

You may begin your research log with some open pages in which you freewrite, exploring answers to the following questions. Although you generally would do this at the beginning, it is a process to which you likely will return as you find more information about your topic and as your focus changes, as it may during the course of your research.

  • What information have I found so far?
  • What do I still need to find?
  • Where am I most likely to find it?

These are beginning questions. Like Lily Tran, however, you will come across general questions or issues that a quick note or freewrite may help you resolve. The key to this section is to revisit it regularly. Written answers to these and other self-generated questions in your log clarify your tasks as you go along, helping you articulate ideas and examine supporting evidence critically. As you move further into the process, consider answering the following questions in your freewrite:

  • What evidence looks as though it best supports my thesis?
  • What evidence challenges my working thesis?
  • How is my thesis changing from where it started?

Creating the Research Log

As you gather source material for your argumentative research paper, keep in mind that the research is intended to support original thinking. That is, you are not writing an informational report in which you simply supply facts to readers. Instead, you are writing to support a thesis that shows original thinking, and you are collecting and incorporating research into your paper to support that thinking. Therefore, a research log, whether digital or handwritten, is a great way to keep track of your thinking as well as your notes and bibliographic information.

In the model below, Lily Tran records the correct MLA bibliographic citation for the source. Then, she records a note and includes the in-text citation here to avoid having to retrieve this information later. Perhaps most important, Tran records why she noted this information—how it supports her thesis: The human race must turn to sustainable food systems that provide healthy diets with minimal environmental impact, starting now . Finally, she makes a note to herself about an additional visual to include in the final paper to reinforce the point regarding the current pressure on food systems. And she connects the information to other information she finds, thus cross-referencing and establishing a possible synthesis. Use a format similar to that in Table 13.4 to begin your own research log.

Types of Research Notes

Taking good notes will make the research process easier by enabling you to locate and remember sources and use them effectively. While some research projects requiring only a few sources may seem easily tracked, research projects requiring more than a few sources are more effectively managed when you take good bibliographic and informational notes. As you gather evidence for your argumentative research paper, follow the descriptions and the electronic model to record your notes. You can combine these with your research log, or you can use the research log for secondary sources and your own note-taking system for primary sources if a division of this kind is helpful. Either way, be sure to include all necessary information.

Bibliographic Notes

These identify the source you are using. When you locate a useful source, record the information necessary to find that source again. It is important to do this as you find each source, even before taking notes from it. If you create bibliographic notes as you go along, then you can easily arrange them in alphabetical order later to prepare the reference list required at the end of formal academic papers. If your instructor requires you to use MLA formatting for your essay, be sure to record the following information:

  • Title of source
  • Title of container (larger work in which source is included)
  • Other contributors
  • Publication date

When using MLA style with online sources, also record the following information:

  • Date of original publication
  • Date of access
  • DOI (A DOI, or digital object identifier, is a series of digits and letters that leads to the location of an online source. Articles in journals are often assigned DOIs to ensure that the source can be located, even if the URL changes. If your source is listed with a DOI, use that instead of a URL.)

It is important to understand which documentation style your instructor will require you to use. Check the Handbook for MLA Documentation and Format and APA Documentation and Format styles . In addition, you can check the style guide information provided by the Purdue Online Writing Lab .

Informational Notes

These notes record the relevant information found in your sources. When writing your essay, you will work from these notes, so be sure they contain all the information you need from every source you intend to use. Also try to focus your notes on your research question so that their relevance is clear when you read them later. To avoid confusion, work with separate entries for each piece of information recorded. At the top of each entry, identify the source through brief bibliographic identification (author and title), and note the page numbers on which the information appears. Also helpful is to add personal notes, including ideas for possible use of the information or cross-references to other information. As noted in Writing Process: Integrating Research , you will be using a variety of formats when borrowing from sources. Below is a quick review of these formats in terms of note-taking processes. By clarifying whether you are quoting directly, paraphrasing, or summarizing during these stages, you can record information accurately and thus take steps to avoid plagiarism.

Direct Quotations, Paraphrases, and Summaries

A direct quotation is an exact duplication of the author’s words as they appear in the original source. In your notes, put quotation marks around direct quotations so that you remember these words are the author’s, not yours. One advantage of copying exact quotations is that it allows you to decide later whether to include a quotation, paraphrase, or summary. ln general, though, use direct quotations only when the author’s words are particularly lively or persuasive.

A paraphrase is a restatement of the author’s words in your own words. Paraphrase to simplify or clarify the original author’s point. In your notes, use paraphrases when you need to record details but not exact words.

A summary is a brief condensation or distillation of the main point and most important details of the original source. Write a summary in your own words, with facts and ideas accurately represented. A summary is useful when specific details in the source are unimportant or irrelevant to your research question. You may find you can summarize several paragraphs or even an entire article or chapter in just a few sentences without losing useful information. It is a good idea to note when your entry contains a summary to remind you later that it omits detailed information. See Writing Process Integrating Research for more detailed information and examples of quotations, paraphrases, and summaries and when to use them.

Other Systems for Organizing Research Logs and Digital Note-Taking

Students often become frustrated and at times overwhelmed by the quantity of materials to be managed in the research process. If this is your first time working with both primary and secondary sources, finding ways to keep all of the information in one place and well organized is essential.

Because gathering primary evidence may be a relatively new practice, this section is designed to help you navigate the process. As mentioned earlier, information gathered in fieldwork is not cataloged, organized, indexed, or shelved for your convenience. Obtaining it requires diligence, energy, and planning. Online resources can assist you with keeping a research log. Your college library may have subscriptions to tools such as Todoist or EndNote. Consult with a librarian to find out whether you have access to any of these. If not, use something like the template shown in Figure 13.8 , or another like it, as a template for creating your own research notes and organizational tool. You will need to have a record of all field research data as well as the research log for all secondary sources.

As an Amazon Associate we earn from qualifying purchases.

This book may not be used in the training of large language models or otherwise be ingested into large language models or generative AI offerings without OpenStax's permission.

Want to cite, share, or modify this book? This book uses the Creative Commons Attribution License and you must attribute OpenStax.

Access for free at https://openstax.org/books/writing-guide/pages/1-unit-introduction
  • Authors: Michelle Bachelor Robinson, Maria Jerskey, featuring Toby Fulwiler
  • Publisher/website: OpenStax
  • Book title: Writing Guide with Handbook
  • Publication date: Dec 21, 2021
  • Location: Houston, Texas
  • Book URL: https://openstax.org/books/writing-guide/pages/1-unit-introduction
  • Section URL: https://openstax.org/books/writing-guide/pages/13-5-research-process-making-notes-synthesizing-information-and-keeping-a-research-log

© Dec 19, 2023 OpenStax. Textbook content produced by OpenStax is licensed under a Creative Commons Attribution License . The OpenStax name, OpenStax logo, OpenStax book covers, OpenStax CNX name, and OpenStax CNX logo are not subject to the Creative Commons license and may not be reproduced without the prior and express written consent of Rice University.

Tutorial Thursday: Adding an Author Note in APA Style CENTRAL

For today’s edition of Tutorial Thursday, we’d like to highlight a video on our APA Style CENTRAL YouTube channel:  APA Style CENTRAL: Adding an Author Note .

This brief video (2 min) demonstrates how to add an Author Note to your paper in APA Style CENTRAL, and includes:

  • The distinction between the options of “Student assignment” or “Professional manuscript” when starting a new paper;
  • Using the Customize Paper Sections tool to add an Author Note section to a student paper; and
  • Viewing the formatted Author Note on the manuscript’s title page via the Preview tab.

Please feel free to link to this tutorial in any location where students, faculty, and researchers will find it!

Want to learn more? View our related resources:

  • APA Style CENTRAL Handouts and Guides
  • A recorded session of our Writing Papers in APA Style CENTRAL webinar  (28 min)
  • Training schedule for APA Style CENTRAL Webinars

Follow

Patti Avellanet

Logo for M Libraries Publishing

Want to create or adapt books like this? Learn more about how Pressbooks supports open publishing practices.

13.1 Formatting a Research Paper

Learning objectives.

  • Identify the major components of a research paper written using American Psychological Association (APA) style.
  • Apply general APA style and formatting conventions in a research paper.

In this chapter, you will learn how to use APA style , the documentation and formatting style followed by the American Psychological Association, as well as MLA style , from the Modern Language Association. There are a few major formatting styles used in academic texts, including AMA, Chicago, and Turabian:

  • AMA (American Medical Association) for medicine, health, and biological sciences
  • APA (American Psychological Association) for education, psychology, and the social sciences
  • Chicago—a common style used in everyday publications like magazines, newspapers, and books
  • MLA (Modern Language Association) for English, literature, arts, and humanities
  • Turabian—another common style designed for its universal application across all subjects and disciplines

While all the formatting and citation styles have their own use and applications, in this chapter we focus our attention on the two styles you are most likely to use in your academic studies: APA and MLA.

If you find that the rules of proper source documentation are difficult to keep straight, you are not alone. Writing a good research paper is, in and of itself, a major intellectual challenge. Having to follow detailed citation and formatting guidelines as well may seem like just one more task to add to an already-too-long list of requirements.

Following these guidelines, however, serves several important purposes. First, it signals to your readers that your paper should be taken seriously as a student’s contribution to a given academic or professional field; it is the literary equivalent of wearing a tailored suit to a job interview. Second, it shows that you respect other people’s work enough to give them proper credit for it. Finally, it helps your reader find additional materials if he or she wishes to learn more about your topic.

Furthermore, producing a letter-perfect APA-style paper need not be burdensome. Yes, it requires careful attention to detail. However, you can simplify the process if you keep these broad guidelines in mind:

  • Work ahead whenever you can. Chapter 11 “Writing from Research: What Will I Learn?” includes tips for keeping track of your sources early in the research process, which will save time later on.
  • Get it right the first time. Apply APA guidelines as you write, so you will not have much to correct during the editing stage. Again, putting in a little extra time early on can save time later.
  • Use the resources available to you. In addition to the guidelines provided in this chapter, you may wish to consult the APA website at http://www.apa.org or the Purdue University Online Writing lab at http://owl.english.purdue.edu , which regularly updates its online style guidelines.

General Formatting Guidelines

This chapter provides detailed guidelines for using the citation and formatting conventions developed by the American Psychological Association, or APA. Writers in disciplines as diverse as astrophysics, biology, psychology, and education follow APA style. The major components of a paper written in APA style are listed in the following box.

These are the major components of an APA-style paper:

Body, which includes the following:

  • Headings and, if necessary, subheadings to organize the content
  • In-text citations of research sources
  • References page

All these components must be saved in one document, not as separate documents.

The title page of your paper includes the following information:

  • Title of the paper
  • Author’s name
  • Name of the institution with which the author is affiliated
  • Header at the top of the page with the paper title (in capital letters) and the page number (If the title is lengthy, you may use a shortened form of it in the header.)

List the first three elements in the order given in the previous list, centered about one third of the way down from the top of the page. Use the headers and footers tool of your word-processing program to add the header, with the title text at the left and the page number in the upper-right corner. Your title page should look like the following example.

Beyond the Hype: Evaluating Low-Carb Diets cover page

The next page of your paper provides an abstract , or brief summary of your findings. An abstract does not need to be provided in every paper, but an abstract should be used in papers that include a hypothesis. A good abstract is concise—about one hundred fifty to two hundred fifty words—and is written in an objective, impersonal style. Your writing voice will not be as apparent here as in the body of your paper. When writing the abstract, take a just-the-facts approach, and summarize your research question and your findings in a few sentences.

In Chapter 12 “Writing a Research Paper” , you read a paper written by a student named Jorge, who researched the effectiveness of low-carbohydrate diets. Read Jorge’s abstract. Note how it sums up the major ideas in his paper without going into excessive detail.

Beyond the Hype: Abstract

Write an abstract summarizing your paper. Briefly introduce the topic, state your findings, and sum up what conclusions you can draw from your research. Use the word count feature of your word-processing program to make sure your abstract does not exceed one hundred fifty words.

Depending on your field of study, you may sometimes write research papers that present extensive primary research, such as your own experiment or survey. In your abstract, summarize your research question and your findings, and briefly indicate how your study relates to prior research in the field.

Margins, Pagination, and Headings

APA style requirements also address specific formatting concerns, such as margins, pagination, and heading styles, within the body of the paper. Review the following APA guidelines.

Use these general guidelines to format the paper:

  • Set the top, bottom, and side margins of your paper at 1 inch.
  • Use double-spaced text throughout your paper.
  • Use a standard font, such as Times New Roman or Arial, in a legible size (10- to 12-point).
  • Use continuous pagination throughout the paper, including the title page and the references section. Page numbers appear flush right within your header.
  • Section headings and subsection headings within the body of your paper use different types of formatting depending on the level of information you are presenting. Additional details from Jorge’s paper are provided.

Cover Page

Begin formatting the final draft of your paper according to APA guidelines. You may work with an existing document or set up a new document if you choose. Include the following:

  • Your title page
  • The abstract you created in Note 13.8 “Exercise 1”
  • Correct headers and page numbers for your title page and abstract

APA style uses section headings to organize information, making it easy for the reader to follow the writer’s train of thought and to know immediately what major topics are covered. Depending on the length and complexity of the paper, its major sections may also be divided into subsections, sub-subsections, and so on. These smaller sections, in turn, use different heading styles to indicate different levels of information. In essence, you are using headings to create a hierarchy of information.

The following heading styles used in APA formatting are listed in order of greatest to least importance:

  • Section headings use centered, boldface type. Headings use title case, with important words in the heading capitalized.
  • Subsection headings use left-aligned, boldface type. Headings use title case.
  • The third level uses left-aligned, indented, boldface type. Headings use a capital letter only for the first word, and they end in a period.
  • The fourth level follows the same style used for the previous level, but the headings are boldfaced and italicized.
  • The fifth level follows the same style used for the previous level, but the headings are italicized and not boldfaced.

Visually, the hierarchy of information is organized as indicated in Table 13.1 “Section Headings” .

Table 13.1 Section Headings

A college research paper may not use all the heading levels shown in Table 13.1 “Section Headings” , but you are likely to encounter them in academic journal articles that use APA style. For a brief paper, you may find that level 1 headings suffice. Longer or more complex papers may need level 2 headings or other lower-level headings to organize information clearly. Use your outline to craft your major section headings and determine whether any subtopics are substantial enough to require additional levels of headings.

Working with the document you developed in Note 13.11 “Exercise 2” , begin setting up the heading structure of the final draft of your research paper according to APA guidelines. Include your title and at least two to three major section headings, and follow the formatting guidelines provided above. If your major sections should be broken into subsections, add those headings as well. Use your outline to help you.

Because Jorge used only level 1 headings, his Exercise 3 would look like the following:

Citation Guidelines

In-text citations.

Throughout the body of your paper, include a citation whenever you quote or paraphrase material from your research sources. As you learned in Chapter 11 “Writing from Research: What Will I Learn?” , the purpose of citations is twofold: to give credit to others for their ideas and to allow your reader to follow up and learn more about the topic if desired. Your in-text citations provide basic information about your source; each source you cite will have a longer entry in the references section that provides more detailed information.

In-text citations must provide the name of the author or authors and the year the source was published. (When a given source does not list an individual author, you may provide the source title or the name of the organization that published the material instead.) When directly quoting a source, it is also required that you include the page number where the quote appears in your citation.

This information may be included within the sentence or in a parenthetical reference at the end of the sentence, as in these examples.

Epstein (2010) points out that “junk food cannot be considered addictive in the same way that we think of psychoactive drugs as addictive” (p. 137).

Here, the writer names the source author when introducing the quote and provides the publication date in parentheses after the author’s name. The page number appears in parentheses after the closing quotation marks and before the period that ends the sentence.

Addiction researchers caution that “junk food cannot be considered addictive in the same way that we think of psychoactive drugs as addictive” (Epstein, 2010, p. 137).

Here, the writer provides a parenthetical citation at the end of the sentence that includes the author’s name, the year of publication, and the page number separated by commas. Again, the parenthetical citation is placed after the closing quotation marks and before the period at the end of the sentence.

As noted in the book Junk Food, Junk Science (Epstein, 2010, p. 137), “junk food cannot be considered addictive in the same way that we think of psychoactive drugs as addictive.”

Here, the writer chose to mention the source title in the sentence (an optional piece of information to include) and followed the title with a parenthetical citation. Note that the parenthetical citation is placed before the comma that signals the end of the introductory phrase.

David Epstein’s book Junk Food, Junk Science (2010) pointed out that “junk food cannot be considered addictive in the same way that we think of psychoactive drugs as addictive” (p. 137).

Another variation is to introduce the author and the source title in your sentence and include the publication date and page number in parentheses within the sentence or at the end of the sentence. As long as you have included the essential information, you can choose the option that works best for that particular sentence and source.

Citing a book with a single author is usually a straightforward task. Of course, your research may require that you cite many other types of sources, such as books or articles with more than one author or sources with no individual author listed. You may also need to cite sources available in both print and online and nonprint sources, such as websites and personal interviews. Chapter 13 “APA and MLA Documentation and Formatting” , Section 13.2 “Citing and Referencing Techniques” and Section 13.3 “Creating a References Section” provide extensive guidelines for citing a variety of source types.

Writing at Work

APA is just one of several different styles with its own guidelines for documentation, formatting, and language usage. Depending on your field of interest, you may be exposed to additional styles, such as the following:

  • MLA style. Determined by the Modern Languages Association and used for papers in literature, languages, and other disciplines in the humanities.
  • Chicago style. Outlined in the Chicago Manual of Style and sometimes used for papers in the humanities and the sciences; many professional organizations use this style for publications as well.
  • Associated Press (AP) style. Used by professional journalists.

References List

The brief citations included in the body of your paper correspond to the more detailed citations provided at the end of the paper in the references section. In-text citations provide basic information—the author’s name, the publication date, and the page number if necessary—while the references section provides more extensive bibliographical information. Again, this information allows your reader to follow up on the sources you cited and do additional reading about the topic if desired.

The specific format of entries in the list of references varies slightly for different source types, but the entries generally include the following information:

  • The name(s) of the author(s) or institution that wrote the source
  • The year of publication and, where applicable, the exact date of publication
  • The full title of the source
  • For books, the city of publication
  • For articles or essays, the name of the periodical or book in which the article or essay appears
  • For magazine and journal articles, the volume number, issue number, and pages where the article appears
  • For sources on the web, the URL where the source is located

The references page is double spaced and lists entries in alphabetical order by the author’s last name. If an entry continues for more than one line, the second line and each subsequent line are indented five spaces. Review the following example. ( Chapter 13 “APA and MLA Documentation and Formatting” , Section 13.3 “Creating a References Section” provides extensive guidelines for formatting reference entries for different types of sources.)

References Section

In APA style, book and article titles are formatted in sentence case, not title case. Sentence case means that only the first word is capitalized, along with any proper nouns.

Key Takeaways

  • Following proper citation and formatting guidelines helps writers ensure that their work will be taken seriously, give proper credit to other authors for their work, and provide valuable information to readers.
  • Working ahead and taking care to cite sources correctly the first time are ways writers can save time during the editing stage of writing a research paper.
  • APA papers usually include an abstract that concisely summarizes the paper.
  • APA papers use a specific headings structure to provide a clear hierarchy of information.
  • In APA papers, in-text citations usually include the name(s) of the author(s) and the year of publication.
  • In-text citations correspond to entries in the references section, which provide detailed bibliographical information about a source.

Writing for Success Copyright © 2015 by University of Minnesota is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License , except where otherwise noted.

  • Writing Home
  • Writing Advice Home

Taking Notes from Research Reading

  • Printable PDF Version
  • Fair-Use Policy

If you take notes efficiently, you can read with more understanding and also save time and frustration when you come to write your paper. These are three main principles

1. Know what kind of ideas you need to record

Focus your approach to the topic before you start detailed research. Then you will read with a purpose in mind, and you will be able to sort out relevant ideas.

  • First, review the commonly known facts about your topic, and also become aware of the range of thinking and opinions on it. Review your class notes and textbook and browse in an encyclopaedia or other reference work.
  • Try making a preliminary list of the subtopics you would expect to find in your reading. These will guide your attention and may come in handy as labels for notes.
  • Choose a component or angle that interests you, perhaps one on which there is already some controversy. Now formulate your research question. It should allow for reasoning as well as gathering of information—not just what the proto-Iroquoians ate, for instance, but how valid the evidence is for early introduction of corn. You may even want to jot down a tentative thesis statement as a preliminary answer to your question. (See Using Thesis Statements .)
  • Then you will know what to look for in your research reading: facts and theories that help answer your question, and other people’s opinions about whether specific answers are good ones.

2. Don’t write down too much

Your essay must be an expression of your own thinking, not a patchwork of borrowed ideas. Plan therefore to invest your research time in understanding your sources and integrating them into your own thinking. Your note cards or note sheets will record only ideas that are relevant to your focus on the topic; and they will mostly summarize rather than quote.

  • Copy out exact words only when the ideas are memorably phrased or surprisingly expressed—when you might use them as actual quotations in your essay.
  • Otherwise, compress ideas in your own words . Paraphrasing word by word is a waste of time. Choose the most important ideas and write them down as labels or headings. Then fill in with a few subpoints that explain or exemplify.
  • Don’t depend on underlining and highlighting. Find your own words for notes in the margin (or on “sticky” notes).

3. Label your notes intelligently

Whether you use cards or pages for note-taking, take notes in a way that allows for later use.

  • Save bother later by developing the habit of recording bibliographic information in a master list when you begin looking at each source (don’t forget to note book and journal information on photocopies). Then you can quickly identify each note by the author’s name and page number; when you refer to sources in the essay you can fill in details of publication easily from your master list. Keep a format guide handy (see Documentation Formats ).
  • Try as far as possible to put notes on separate cards or sheets. This will let you label the topic of each note. Not only will that keep your notetaking focussed, but it will also allow for grouping and synthesizing of ideas later. It is especially satisfying to shuffle notes and see how the conjunctions create new ideas—yours.
  • Leave lots of space in your notes for comments of your own—questions and reactions as you read, second thoughts and cross-references when you look back at what you’ve written. These comments can become a virtual first draft of your paper.

Want to create or adapt books like this? Learn more about how Pressbooks supports open publishing practices.

Note-taking for Research

As you determine which sources you will rely on most, it is important to establish a system for keeping track of your sources and taking notes. There are several ways to go about it, and no one system is necessarily superior. What matters is that you keep materials in order; record bibliographical information you will need later; and take detailed, organized notes.

Keeping Track of Your Sources

As you conduct research, taking time to keep track of source information and to organize that information now will help ensure that you are not scrambling to find it at the last minute, which easily leads to problems ranging from incomplete essays to plagiarism. Throughout your research, record bibliographical information for each source as soon as you begin using it. Maintaining an electronic list (even by copying and pasting information) can be quick and efficient, but you may instead feel more in control of the information you’ve collected by using pen-and-paper methods, such as a notebook or note cards.

The table below shows the kinds of details you should record for commonly used source types. Use these details to develop a working bibliography —a preliminary list of sources that you will later use to develop the final Works Cited page of your essay.

Details for Commonly Used Source Types

Your research may involve less common types of sources not listed above. For additional information on citing different sources, see the chapter MLA Format and Citation.

Taking Notes Efficiently

Good researchers stay focused and organized as they gather information from sources. Before you begin taking notes, take a moment to step back and think about your goal as a researcher—to find information that will help you answer your research question. When you write your essay, you will present your conclusions about the subject supported by your research. That goal will determine what information you record and how you organize it.

Writers sometimes get caught up in taking extensive notes, so much so that they lose sight of how their notes relate to the questions and ideas they started out with. Remember that you do not need to write down every detail from your reading. Focus on finding and recording details that will help you answer your research questions. The following strategies will help you take notes efficiently.

Use Headings to Organize Ideas

Whether you use old-fashioned index cards or organize your notes using word-processing software, such as MS Word or Google Docs, record just one major point from each source at a time, and use a heading to summarize the information covered. Keep all your notes in one file, digital or otherwise. Doing so will help you identify connections among different pieces of information. It will also help you make connections between your notes and the research questions and subtopics you identified earlier.

Know When to Summarize, Paraphrase, or Directly Quote a Source

Your notes will fall under three categories—summary notes, paraphrased information, and direct quotations from your sources. Effective researchers make choices about which type of notes is most appropriate for their purpose.

  • Summary notes give an overview of the main ideas in a source in a few sentences or a short paragraph. A summary is considerably shorter than the original text and captures only the major ideas. Use summary notes when you do not need to record specific details but you intend to refer to broad concepts the author discusses.
  • Paraphrased notes restate a fact or idea from a source using your own words and sentence structure, particularly in a way that better suits your purpose and audience than the way the original source said it.
  • Direct quotations use the exact wording used by the original source and enclose the quoted material in quotation marks. It is a good strategy to copy direct quotations when an author expresses an idea in an especially lively or memorable way. However, do not rely exclusively on direct quotations in your note taking.

Most of your notes should be paraphrased from the original source. Paraphrasing as you take notes is usually a better strategy than copying direct quotations, because it forces you to think through the information in your source and understand it well enough to restate it. In short, it helps you stay engaged with the material instead of simply copying and pasting. For more information on this, see the section Summary, Paraphrasis, and Quotation.

Maintain Complete, Accurate Notes

Regardless of the format used, any notes you take should include enough information to help you organize ideas and locate them instantly in the original text if you need to review them. Make sure your notes include the vital bibliographic information noted above.

Throughout the process of taking notes, be scrupulous about making sure you have correctly attributed each idea to its source. Always include source information so you know exactly which ideas came from which sources. Use quotation marks to set off any words for phrases taken directly from the original text. If you add your own responses and ideas, make sure they are distinct from ideas you quoted or paraphrased.

Finally, make sure your notes accurately reflect the content of the original text. Make sure quoted material is copied verbatim. If you omit words from a quotation, use ellipses to show the omission and make sure the omission does not change the author’s meaning. Paraphrase ideas carefully, and check your paraphrased notes against the original text to make sure that you have restated the author’s ideas accurately in your own words. For more information on this, see the section Summary, Paraphrasis, and Quotation.

Use a System That Works for You

There are several formats you can use to take notes. No technique is necessarily better than the others—it is more important to choose a format you are comfortable using. Choosing the format that works best for you will ensure your notes are organized, complete, and accurate. Consider implementing one of these formats when you begin taking notes:

  • Use index cards. This traditional format involves writing each note on a separate index card. It takes more time than copying and pasting into an electronic document, which encourages you to be selective in choosing which ideas to record. Recording notes on separate cards makes it easy to later organize your notes according to major topics. Some writers color-code their cards to make them still more organized.
  • Use note-taking software. Word-processing and office software packages often include different types of note-taking software. Although you may need to set aside some time to learn the software, this method combines the speed of typing with the same degree of organization associated with handwritten note cards.
  • Maintain a research notebook. Instead of using index cards or electronic note cards, you may wish to keep a notebook or electronic folder, allotting a few pages (or one file) for each of your sources. This method makes it easy to create a separate column or section of the document where you add your responses to the information you encounter in your research.
  • Annotate your sources. This method involves making handwritten notes in the margins of sources that you have printed or photocopied. If using electronic sources, you can make comments within the source document. For example, you might add comment boxes to a PDF version of an article. This method works best for experienced researchers who have already thought a great deal about the topic because it can be difficult to organize your notes later when starting your draft.

Choose one of the methods from the list to use for taking notes. Continue gathering sources and taking notes. In the next section, you will learn strategies for organizing and synthesizing the information you have found.

The Writing Textbook Copyright © 2021 by Josh Woods, editor and contributor, as well as an unnamed author (by request from the original publisher), and other authors named separately is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License , except where otherwise noted.

Share This Book

  • USC Libraries
  • Research Guides

Organizing Your Social Sciences Research Paper

  • Footnotes or Endnotes?
  • Purpose of Guide
  • Design Flaws to Avoid
  • Independent and Dependent Variables
  • Glossary of Research Terms
  • Reading Research Effectively
  • Narrowing a Topic Idea
  • Broadening a Topic Idea
  • Extending the Timeliness of a Topic Idea
  • Academic Writing Style
  • Choosing a Title
  • Making an Outline
  • Paragraph Development
  • Research Process Video Series
  • Executive Summary
  • The C.A.R.S. Model
  • Background Information
  • The Research Problem/Question
  • Theoretical Framework
  • Citation Tracking
  • Content Alert Services
  • Evaluating Sources
  • Primary Sources
  • Secondary Sources
  • Tiertiary Sources
  • Scholarly vs. Popular Publications
  • Qualitative Methods
  • Quantitative Methods
  • Insiderness
  • Using Non-Textual Elements
  • Limitations of the Study
  • Common Grammar Mistakes
  • Writing Concisely
  • Avoiding Plagiarism
  • Further Readings
  • Generative AI and Writing
  • USC Libraries Tutorials and Other Guides
  • Bibliography

Endnote Note citing a particular source or making a brief explanatory comment placed at the end of a research paper and arranged sequentially in relation to where the reference appears in the paper.

Footnote Note citing a particular source or making a brief explanatory comment placed at the bottom of a page corresponding to the item cited in the corresponding text above.

Fiske, Robert Hartwell. To the Point: A Dictionary of Concise Writing . New York: W.W. Norton and Company, 2014.

Structure and Writing Style

Advantages of Using Endnotes

  • Endnotes are less distracting to the reader and allows the narrative to flow better.
  • Endnotes don't clutter up the page.
  • As a separate section of a research paper, endnotes allow the reader to read and contemplate all the notes at once.

Disadvantages of Using Endnotes

  • If you want to look at the text of a particular endnote, you have to flip to the end of the research paper to find the information.
  • Depending on how they are created [i.e., continuous numbering or numbers that start over for each chapter], you may have to remember the chapter number as well as the endnote number in order to find the correct one.
  • Endnotes may carry a negative connotation much like the proverbial "fine print" or hidden disclaimers in advertising. A reader may believe you are trying to hide something by burying it in a hard-to-find endnote.

Advantages of Using Footnotes

  • Readers interested in identifying the source or note can quickly glance down the page to find what they are looking for.
  • It allows the reader to immediately link the footnote to the subject of the text without having to take the time to find the note at the back of the paper.
  • Footnotes are automatically included when printing off specific pages.

Disadvantages of Using Footnotes

  • Footnotes can clutter up the page and, thus, negatively impact the overall look of the page.
  • If there are multiple columns, charts, or tables below only a small segment of text that includes a footnote, then you must decide where the footnotes should appear.
  • If the footnotes are lengthy, there's a risk they could dominate the page, although this issue is considered acceptable in legal scholarship.
  • Adding lengthy footnotes after the paper has been completed can alter the page where other sources are located [i.e., a long footnote can push text to the next page].
  • It is more difficult learning how to insert footnotes using your word processing program than simply adding endnotes at the end of your paper.

Things to keep in mind when considering using either endnotes or footnotes in your research paper :

1.    Footnotes are numbered consecutively throughout a research paper, except for those notes accompanying special material (e.g., figures, tables, charts, etc.). Numbering of footnotes are "superscript"--Arabic numbers typed slightly above the line of text. Do not include periods, parentheses, or slashes. They can follow all punctuation marks except dashes. In general, to avoid interrupting the continuity of the text, footnote numbers are placed at the end of the sentence, clause, or phrase containing the quoted or paraphrased material. 2.    Depending on the writing style used in your class, endnotes may take the place of a list of resources cited in your paper or they may represent non-bibliographic items, such as comments or observations, followed by a separate list of references to the sources you cited and arranged alphabetically by the author's last name. If you are unsure about how to use endnotes, consult with your professor. 3.    In general, the use of footnotes in most academic writing is now considered a bit outdated and has been replaced by endnotes, which are much easier to place in your paper, even with the advent of word processing programs. However, some disciplines, such as law and history, still predominantly utilize footnotes. Consult with your professor about which form to use and always remember that, whichever style of citation you choose, apply it consistently throughout your paper.

NOTE:   Always think critically about the information you place in a footnote or endnote. Ask yourself, is this supplementary or tangential information that would otherwise disrupt the narrative flow of the text or is this essential information that I should integrate into the main text? If you are not sure, it's better to work it into the text. Too many notes implies a disorganized paper.

Cermak, Bonni and Jennifer Troxell. A Guide to Footnotes and Endnotes for NASA History Authors . NASA History Program. History Division; Hale, Ali. Should You Use Footnotes or Endnotes? DailyWritingTips.com; Tables, Appendices, Footnotes and Endnotes. The Writing Lab and The OWL. Purdue University; Lunsford, Andrea A. and Robert Connors. The St. Martin's Handbook . New York: St. Martin's Press, 1989; Saller, Carol. “Endnotes or Footnotes? Some Considerations.” The Chronicle of Higher Education 58 (January 6, 2012): http://chronicle.com/blogs/linguafranca/2012/01/06/endnotes-or-footnotes-some-considerations/.

  • << Previous: Avoiding Plagiarism
  • Next: Further Readings >>
  • Last Updated: Apr 5, 2024 1:38 PM
  • URL: https://libguides.usc.edu/writingguide

How to Write and Publish a Research Paper for a Peer-Reviewed Journal

  • Open access
  • Published: 30 April 2020
  • Volume 36 , pages 909–913, ( 2021 )

Cite this article

You have full access to this open access article

  • Clara Busse   ORCID: orcid.org/0000-0002-0178-1000 1 &
  • Ella August   ORCID: orcid.org/0000-0001-5151-1036 1 , 2  

263k Accesses

15 Citations

706 Altmetric

Explore all metrics

Communicating research findings is an essential step in the research process. Often, peer-reviewed journals are the forum for such communication, yet many researchers are never taught how to write a publishable scientific paper. In this article, we explain the basic structure of a scientific paper and describe the information that should be included in each section. We also identify common pitfalls for each section and recommend strategies to avoid them. Further, we give advice about target journal selection and authorship. In the online resource 1 , we provide an example of a high-quality scientific paper, with annotations identifying the elements we describe in this article.

Similar content being viewed by others

author's note in research paper

Literature reviews as independent studies: guidelines for academic practice

Sascha Kraus, Matthias Breier, … João J. Ferreira

author's note in research paper

Plagiarism in research

Gert Helgesson & Stefan Eriksson

author's note in research paper

Open peer review: promoting transparency in open science

Dietmar Wolfram, Peiling Wang, … Hyoungjoo Park

Avoid common mistakes on your manuscript.

Introduction

Writing a scientific paper is an important component of the research process, yet researchers often receive little formal training in scientific writing. This is especially true in low-resource settings. In this article, we explain why choosing a target journal is important, give advice about authorship, provide a basic structure for writing each section of a scientific paper, and describe common pitfalls and recommendations for each section. In the online resource 1 , we also include an annotated journal article that identifies the key elements and writing approaches that we detail here. Before you begin your research, make sure you have ethical clearance from all relevant ethical review boards.

Select a Target Journal Early in the Writing Process

We recommend that you select a “target journal” early in the writing process; a “target journal” is the journal to which you plan to submit your paper. Each journal has a set of core readers and you should tailor your writing to this readership. For example, if you plan to submit a manuscript about vaping during pregnancy to a pregnancy-focused journal, you will need to explain what vaping is because readers of this journal may not have a background in this topic. However, if you were to submit that same article to a tobacco journal, you would not need to provide as much background information about vaping.

Information about a journal’s core readership can be found on its website, usually in a section called “About this journal” or something similar. For example, the Journal of Cancer Education presents such information on the “Aims and Scope” page of its website, which can be found here: https://www.springer.com/journal/13187/aims-and-scope .

Peer reviewer guidelines from your target journal are an additional resource that can help you tailor your writing to the journal and provide additional advice about crafting an effective article [ 1 ]. These are not always available, but it is worth a quick web search to find out.

Identify Author Roles Early in the Process

Early in the writing process, identify authors, determine the order of authors, and discuss the responsibilities of each author. Standard author responsibilities have been identified by The International Committee of Medical Journal Editors (ICMJE) [ 2 ]. To set clear expectations about each team member’s responsibilities and prevent errors in communication, we also suggest outlining more detailed roles, such as who will draft each section of the manuscript, write the abstract, submit the paper electronically, serve as corresponding author, and write the cover letter. It is best to formalize this agreement in writing after discussing it, circulating the document to the author team for approval. We suggest creating a title page on which all authors are listed in the agreed-upon order. It may be necessary to adjust authorship roles and order during the development of the paper. If a new author order is agreed upon, be sure to update the title page in the manuscript draft.

In the case where multiple papers will result from a single study, authors should discuss who will author each paper. Additionally, authors should agree on a deadline for each paper and the lead author should take responsibility for producing an initial draft by this deadline.

Structure of the Introduction Section

The introduction section should be approximately three to five paragraphs in length. Look at examples from your target journal to decide the appropriate length. This section should include the elements shown in Fig.  1 . Begin with a general context, narrowing to the specific focus of the paper. Include five main elements: why your research is important, what is already known about the topic, the “gap” or what is not yet known about the topic, why it is important to learn the new information that your research adds, and the specific research aim(s) that your paper addresses. Your research aim should address the gap you identified. Be sure to add enough background information to enable readers to understand your study. Table 1 provides common introduction section pitfalls and recommendations for addressing them.

figure 1

The main elements of the introduction section of an original research article. Often, the elements overlap

Methods Section

The purpose of the methods section is twofold: to explain how the study was done in enough detail to enable its replication and to provide enough contextual detail to enable readers to understand and interpret the results. In general, the essential elements of a methods section are the following: a description of the setting and participants, the study design and timing, the recruitment and sampling, the data collection process, the dataset, the dependent and independent variables, the covariates, the analytic approach for each research objective, and the ethical approval. The hallmark of an exemplary methods section is the justification of why each method was used. Table 2 provides common methods section pitfalls and recommendations for addressing them.

Results Section

The focus of the results section should be associations, or lack thereof, rather than statistical tests. Two considerations should guide your writing here. First, the results should present answers to each part of the research aim. Second, return to the methods section to ensure that the analysis and variables for each result have been explained.

Begin the results section by describing the number of participants in the final sample and details such as the number who were approached to participate, the proportion who were eligible and who enrolled, and the number of participants who dropped out. The next part of the results should describe the participant characteristics. After that, you may organize your results by the aim or by putting the most exciting results first. Do not forget to report your non-significant associations. These are still findings.

Tables and figures capture the reader’s attention and efficiently communicate your main findings [ 3 ]. Each table and figure should have a clear message and should complement, rather than repeat, the text. Tables and figures should communicate all salient details necessary for a reader to understand the findings without consulting the text. Include information on comparisons and tests, as well as information about the sample and timing of the study in the title, legend, or in a footnote. Note that figures are often more visually interesting than tables, so if it is feasible to make a figure, make a figure. To avoid confusing the reader, either avoid abbreviations in tables and figures, or define them in a footnote. Note that there should not be citations in the results section and you should not interpret results here. Table 3 provides common results section pitfalls and recommendations for addressing them.

Discussion Section

Opposite the introduction section, the discussion should take the form of a right-side-up triangle beginning with interpretation of your results and moving to general implications (Fig.  2 ). This section typically begins with a restatement of the main findings, which can usually be accomplished with a few carefully-crafted sentences.

figure 2

Major elements of the discussion section of an original research article. Often, the elements overlap

Next, interpret the meaning or explain the significance of your results, lifting the reader’s gaze from the study’s specific findings to more general applications. Then, compare these study findings with other research. Are these findings in agreement or disagreement with those from other studies? Does this study impart additional nuance to well-accepted theories? Situate your findings within the broader context of scientific literature, then explain the pathways or mechanisms that might give rise to, or explain, the results.

Journals vary in their approach to strengths and limitations sections: some are embedded paragraphs within the discussion section, while some mandate separate section headings. Keep in mind that every study has strengths and limitations. Candidly reporting yours helps readers to correctly interpret your research findings.

The next element of the discussion is a summary of the potential impacts and applications of the research. Should these results be used to optimally design an intervention? Does the work have implications for clinical protocols or public policy? These considerations will help the reader to further grasp the possible impacts of the presented work.

Finally, the discussion should conclude with specific suggestions for future work. Here, you have an opportunity to illuminate specific gaps in the literature that compel further study. Avoid the phrase “future research is necessary” because the recommendation is too general to be helpful to readers. Instead, provide substantive and specific recommendations for future studies. Table 4 provides common discussion section pitfalls and recommendations for addressing them.

Follow the Journal’s Author Guidelines

After you select a target journal, identify the journal’s author guidelines to guide the formatting of your manuscript and references. Author guidelines will often (but not always) include instructions for titles, cover letters, and other components of a manuscript submission. Read the guidelines carefully. If you do not follow the guidelines, your article will be sent back to you.

Finally, do not submit your paper to more than one journal at a time. Even if this is not explicitly stated in the author guidelines of your target journal, it is considered inappropriate and unprofessional.

Your title should invite readers to continue reading beyond the first page [ 4 , 5 ]. It should be informative and interesting. Consider describing the independent and dependent variables, the population and setting, the study design, the timing, and even the main result in your title. Because the focus of the paper can change as you write and revise, we recommend you wait until you have finished writing your paper before composing the title.

Be sure that the title is useful for potential readers searching for your topic. The keywords you select should complement those in your title to maximize the likelihood that a researcher will find your paper through a database search. Avoid using abbreviations in your title unless they are very well known, such as SNP, because it is more likely that someone will use a complete word rather than an abbreviation as a search term to help readers find your paper.

After you have written a complete draft, use the checklist (Fig. 3 ) below to guide your revisions and editing. Additional resources are available on writing the abstract and citing references [ 5 ]. When you feel that your work is ready, ask a trusted colleague or two to read the work and provide informal feedback. The box below provides a checklist that summarizes the key points offered in this article.

figure 3

Checklist for manuscript quality

Data Availability

Michalek AM (2014) Down the rabbit hole…advice to reviewers. J Cancer Educ 29:4–5

Article   Google Scholar  

International Committee of Medical Journal Editors. Defining the role of authors and contributors: who is an author? http://www.icmje.org/recommendations/browse/roles-and-responsibilities/defining-the-role-of-authosrs-and-contributors.html . Accessed 15 January, 2020

Vetto JT (2014) Short and sweet: a short course on concise medical writing. J Cancer Educ 29(1):194–195

Brett M, Kording K (2017) Ten simple rules for structuring papers. PLoS ComputBiol. https://doi.org/10.1371/journal.pcbi.1005619

Lang TA (2017) Writing a better research article. J Public Health Emerg. https://doi.org/10.21037/jphe.2017.11.06

Download references

Acknowledgments

Ella August is grateful to the Sustainable Sciences Institute for mentoring her in training researchers on writing and publishing their research.

Code Availability

Not applicable.

Author information

Authors and affiliations.

Department of Maternal and Child Health, University of North Carolina Gillings School of Global Public Health, 135 Dauer Dr, 27599, Chapel Hill, NC, USA

Clara Busse & Ella August

Department of Epidemiology, University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI, 48109-2029, USA

Ella August

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Ella August .

Ethics declarations

Conflicts of interests.

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

(PDF 362 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Busse, C., August, E. How to Write and Publish a Research Paper for a Peer-Reviewed Journal. J Canc Educ 36 , 909–913 (2021). https://doi.org/10.1007/s13187-020-01751-z

Download citation

Published : 30 April 2020

Issue Date : October 2021

DOI : https://doi.org/10.1007/s13187-020-01751-z

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Manuscripts
  • Scientific writing
  • Find a journal
  • Publish with us
  • Track your research

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • Yale J Biol Med
  • v.84(3); 2011 Sep

Logo of yjbm

Focus: Education — Career Advice

How to write your first research paper.

Writing a research manuscript is an intimidating process for many novice writers in the sciences. One of the stumbling blocks is the beginning of the process and creating the first draft. This paper presents guidelines on how to initiate the writing process and draft each section of a research manuscript. The paper discusses seven rules that allow the writer to prepare a well-structured and comprehensive manuscript for a publication submission. In addition, the author lists different strategies for successful revision. Each of those strategies represents a step in the revision process and should help the writer improve the quality of the manuscript. The paper could be considered a brief manual for publication.

It is late at night. You have been struggling with your project for a year. You generated an enormous amount of interesting data. Your pipette feels like an extension of your hand, and running western blots has become part of your daily routine, similar to brushing your teeth. Your colleagues think you are ready to write a paper, and your lab mates tease you about your “slow” writing progress. Yet days pass, and you cannot force yourself to sit down to write. You have not written anything for a while (lab reports do not count), and you feel you have lost your stamina. How does the writing process work? How can you fit your writing into a daily schedule packed with experiments? What section should you start with? What distinguishes a good research paper from a bad one? How should you revise your paper? These and many other questions buzz in your head and keep you stressed. As a result, you procrastinate. In this paper, I will discuss the issues related to the writing process of a scientific paper. Specifically, I will focus on the best approaches to start a scientific paper, tips for writing each section, and the best revision strategies.

1. Schedule your writing time in Outlook

Whether you have written 100 papers or you are struggling with your first, starting the process is the most difficult part unless you have a rigid writing schedule. Writing is hard. It is a very difficult process of intense concentration and brain work. As stated in Hayes’ framework for the study of writing: “It is a generative activity requiring motivation, and it is an intellectual activity requiring cognitive processes and memory” [ 1 ]. In his book How to Write a Lot: A Practical Guide to Productive Academic Writing , Paul Silvia says that for some, “it’s easier to embalm the dead than to write an article about it” [ 2 ]. Just as with any type of hard work, you will not succeed unless you practice regularly. If you have not done physical exercises for a year, only regular workouts can get you into good shape again. The same kind of regular exercises, or I call them “writing sessions,” are required to be a productive author. Choose from 1- to 2-hour blocks in your daily work schedule and consider them as non-cancellable appointments. When figuring out which blocks of time will be set for writing, you should select the time that works best for this type of work. For many people, mornings are more productive. One Yale University graduate student spent a semester writing from 8 a.m. to 9 a.m. when her lab was empty. At the end of the semester, she was amazed at how much she accomplished without even interrupting her regular lab hours. In addition, doing the hardest task first thing in the morning contributes to the sense of accomplishment during the rest of the day. This positive feeling spills over into our work and life and has a very positive effect on our overall attitude.

Rule 1: Create regular time blocks for writing as appointments in your calendar and keep these appointments.

2. start with an outline.

Now that you have scheduled time, you need to decide how to start writing. The best strategy is to start with an outline. This will not be an outline that you are used to, with Roman numerals for each section and neat parallel listing of topic sentences and supporting points. This outline will be similar to a template for your paper. Initially, the outline will form a structure for your paper; it will help generate ideas and formulate hypotheses. Following the advice of George M. Whitesides, “. . . start with a blank piece of paper, and write down, in any order, all important ideas that occur to you concerning the paper” [ 3 ]. Use Table 1 as a starting point for your outline. Include your visuals (figures, tables, formulas, equations, and algorithms), and list your findings. These will constitute the first level of your outline, which will eventually expand as you elaborate.

The next stage is to add context and structure. Here you will group all your ideas into sections: Introduction, Methods, Results, and Discussion/Conclusion ( Table 2 ). This step will help add coherence to your work and sift your ideas.

Now that you have expanded your outline, you are ready for the next step: discussing the ideas for your paper with your colleagues and mentor. Many universities have a writing center where graduate students can schedule individual consultations and receive assistance with their paper drafts. Getting feedback during early stages of your draft can save a lot of time. Talking through ideas allows people to conceptualize and organize thoughts to find their direction without wasting time on unnecessary writing. Outlining is the most effective way of communicating your ideas and exchanging thoughts. Moreover, it is also the best stage to decide to which publication you will submit the paper. Many people come up with three choices and discuss them with their mentors and colleagues. Having a list of journal priorities can help you quickly resubmit your paper if your paper is rejected.

Rule 2: Create a detailed outline and discuss it with your mentor and peers.

3. continue with drafts.

After you get enough feedback and decide on the journal you will submit to, the process of real writing begins. Copy your outline into a separate file and expand on each of the points, adding data and elaborating on the details. When you create the first draft, do not succumb to the temptation of editing. Do not slow down to choose a better word or better phrase; do not halt to improve your sentence structure. Pour your ideas into the paper and leave revision and editing for later. As Paul Silvia explains, “Revising while you generate text is like drinking decaffeinated coffee in the early morning: noble idea, wrong time” [ 2 ].

Many students complain that they are not productive writers because they experience writer’s block. Staring at an empty screen is frustrating, but your screen is not really empty: You have a template of your article, and all you need to do is fill in the blanks. Indeed, writer’s block is a logical fallacy for a scientist ― it is just an excuse to procrastinate. When scientists start writing a research paper, they already have their files with data, lab notes with materials and experimental designs, some visuals, and tables with results. All they need to do is scrutinize these pieces and put them together into a comprehensive paper.

3.1. Starting with Materials and Methods

If you still struggle with starting a paper, then write the Materials and Methods section first. Since you have all your notes, it should not be problematic for you to describe the experimental design and procedures. Your most important goal in this section is to be as explicit as possible by providing enough detail and references. In the end, the purpose of this section is to allow other researchers to evaluate and repeat your work. So do not run into the same problems as the writers of the sentences in (1):

1a. Bacteria were pelleted by centrifugation. 1b. To isolate T cells, lymph nodes were collected.

As you can see, crucial pieces of information are missing: the speed of centrifuging your bacteria, the time, and the temperature in (1a); the source of lymph nodes for collection in (b). The sentences can be improved when information is added, as in (2a) and (2b), respectfully:

2a. Bacteria were pelleted by centrifugation at 3000g for 15 min at 25°C. 2b. To isolate T cells, mediastinal and mesenteric lymph nodes from Balb/c mice were collected at day 7 after immunization with ovabumin.

If your method has previously been published and is well-known, then you should provide only the literature reference, as in (3a). If your method is unpublished, then you need to make sure you provide all essential details, as in (3b).

3a. Stem cells were isolated, according to Johnson [23]. 3b. Stem cells were isolated using biotinylated carbon nanotubes coated with anti-CD34 antibodies.

Furthermore, cohesion and fluency are crucial in this section. One of the malpractices resulting in disrupted fluency is switching from passive voice to active and vice versa within the same paragraph, as shown in (4). This switching misleads and distracts the reader.

4. Behavioral computer-based experiments of Study 1 were programmed by using E-Prime. We took ratings of enjoyment, mood, and arousal as the patients listened to preferred pleasant music and unpreferred music by using Visual Analogue Scales (SI Methods). The preferred and unpreferred status of the music was operationalized along a continuum of pleasantness [ 4 ].

The problem with (4) is that the reader has to switch from the point of view of the experiment (passive voice) to the point of view of the experimenter (active voice). This switch causes confusion about the performer of the actions in the first and the third sentences. To improve the coherence and fluency of the paragraph above, you should be consistent in choosing the point of view: first person “we” or passive voice [ 5 ]. Let’s consider two revised examples in (5).

5a. We programmed behavioral computer-based experiments of Study 1 by using E-Prime. We took ratings of enjoyment, mood, and arousal by using Visual Analogue Scales (SI Methods) as the patients listened to preferred pleasant music and unpreferred music. We operationalized the preferred and unpreferred status of the music along a continuum of pleasantness. 5b. Behavioral computer-based experiments of Study 1 were programmed by using E-Prime. Ratings of enjoyment, mood, and arousal were taken as the patients listened to preferred pleasant music and unpreferred music by using Visual Analogue Scales (SI Methods). The preferred and unpreferred status of the music was operationalized along a continuum of pleasantness.

If you choose the point of view of the experimenter, then you may end up with repetitive “we did this” sentences. For many readers, paragraphs with sentences all beginning with “we” may also sound disruptive. So if you choose active sentences, you need to keep the number of “we” subjects to a minimum and vary the beginnings of the sentences [ 6 ].

Interestingly, recent studies have reported that the Materials and Methods section is the only section in research papers in which passive voice predominantly overrides the use of the active voice [ 5 , 7 , 8 , 9 ]. For example, Martínez shows a significant drop in active voice use in the Methods sections based on the corpus of 1 million words of experimental full text research articles in the biological sciences [ 7 ]. According to the author, the active voice patterned with “we” is used only as a tool to reveal personal responsibility for the procedural decisions in designing and performing experimental work. This means that while all other sections of the research paper use active voice, passive voice is still the most predominant in Materials and Methods sections.

Writing Materials and Methods sections is a meticulous and time consuming task requiring extreme accuracy and clarity. This is why when you complete your draft, you should ask for as much feedback from your colleagues as possible. Numerous readers of this section will help you identify the missing links and improve the technical style of this section.

Rule 3: Be meticulous and accurate in describing the Materials and Methods. Do not change the point of view within one paragraph.

3.2. writing results section.

For many authors, writing the Results section is more intimidating than writing the Materials and Methods section . If people are interested in your paper, they are interested in your results. That is why it is vital to use all your writing skills to objectively present your key findings in an orderly and logical sequence using illustrative materials and text.

Your Results should be organized into different segments or subsections where each one presents the purpose of the experiment, your experimental approach, data including text and visuals (tables, figures, schematics, algorithms, and formulas), and data commentary. For most journals, your data commentary will include a meaningful summary of the data presented in the visuals and an explanation of the most significant findings. This data presentation should not repeat the data in the visuals, but rather highlight the most important points. In the “standard” research paper approach, your Results section should exclude data interpretation, leaving it for the Discussion section. However, interpretations gradually and secretly creep into research papers: “Reducing the data, generalizing from the data, and highlighting scientific cases are all highly interpretive processes. It should be clear by now that we do not let the data speak for themselves in research reports; in summarizing our results, we interpret them for the reader” [ 10 ]. As a result, many journals including the Journal of Experimental Medicine and the Journal of Clinical Investigation use joint Results/Discussion sections, where results are immediately followed by interpretations.

Another important aspect of this section is to create a comprehensive and supported argument or a well-researched case. This means that you should be selective in presenting data and choose only those experimental details that are essential for your reader to understand your findings. You might have conducted an experiment 20 times and collected numerous records, but this does not mean that you should present all those records in your paper. You need to distinguish your results from your data and be able to discard excessive experimental details that could distract and confuse the reader. However, creating a picture or an argument should not be confused with data manipulation or falsification, which is a willful distortion of data and results. If some of your findings contradict your ideas, you have to mention this and find a plausible explanation for the contradiction.

In addition, your text should not include irrelevant and peripheral information, including overview sentences, as in (6).

6. To show our results, we first introduce all components of experimental system and then describe the outcome of infections.

Indeed, wordiness convolutes your sentences and conceals your ideas from readers. One common source of wordiness is unnecessary intensifiers. Adverbial intensifiers such as “clearly,” “essential,” “quite,” “basically,” “rather,” “fairly,” “really,” and “virtually” not only add verbosity to your sentences, but also lower your results’ credibility. They appeal to the reader’s emotions but lower objectivity, as in the common examples in (7):

7a. Table 3 clearly shows that … 7b. It is obvious from figure 4 that …

Another source of wordiness is nominalizations, i.e., nouns derived from verbs and adjectives paired with weak verbs including “be,” “have,” “do,” “make,” “cause,” “provide,” and “get” and constructions such as “there is/are.”

8a. We tested the hypothesis that there is a disruption of membrane asymmetry. 8b. In this paper we provide an argument that stem cells repopulate injured organs.

In the sentences above, the abstract nominalizations “disruption” and “argument” do not contribute to the clarity of the sentences, but rather clutter them with useless vocabulary that distracts from the meaning. To improve your sentences, avoid unnecessary nominalizations and change passive verbs and constructions into active and direct sentences.

9a. We tested the hypothesis that the membrane asymmetry is disrupted. 9b. In this paper we argue that stem cells repopulate injured organs.

Your Results section is the heart of your paper, representing a year or more of your daily research. So lead your reader through your story by writing direct, concise, and clear sentences.

Rule 4: Be clear, concise, and objective in describing your Results.

3.3. now it is time for your introduction.

Now that you are almost half through drafting your research paper, it is time to update your outline. While describing your Methods and Results, many of you diverged from the original outline and re-focused your ideas. So before you move on to create your Introduction, re-read your Methods and Results sections and change your outline to match your research focus. The updated outline will help you review the general picture of your paper, the topic, the main idea, and the purpose, which are all important for writing your introduction.

The best way to structure your introduction is to follow the three-move approach shown in Table 3 .

Adapted from Swales and Feak [ 11 ].

The moves and information from your outline can help to create your Introduction efficiently and without missing steps. These moves are traffic signs that lead the reader through the road of your ideas. Each move plays an important role in your paper and should be presented with deep thought and care. When you establish the territory, you place your research in context and highlight the importance of your research topic. By finding the niche, you outline the scope of your research problem and enter the scientific dialogue. The final move, “occupying the niche,” is where you explain your research in a nutshell and highlight your paper’s significance. The three moves allow your readers to evaluate their interest in your paper and play a significant role in the paper review process, determining your paper reviewers.

Some academic writers assume that the reader “should follow the paper” to find the answers about your methodology and your findings. As a result, many novice writers do not present their experimental approach and the major findings, wrongly believing that the reader will locate the necessary information later while reading the subsequent sections [ 5 ]. However, this “suspense” approach is not appropriate for scientific writing. To interest the reader, scientific authors should be direct and straightforward and present informative one-sentence summaries of the results and the approach.

Another problem is that writers understate the significance of the Introduction. Many new researchers mistakenly think that all their readers understand the importance of the research question and omit this part. However, this assumption is faulty because the purpose of the section is not to evaluate the importance of the research question in general. The goal is to present the importance of your research contribution and your findings. Therefore, you should be explicit and clear in describing the benefit of the paper.

The Introduction should not be long. Indeed, for most journals, this is a very brief section of about 250 to 600 words, but it might be the most difficult section due to its importance.

Rule 5: Interest your reader in the Introduction section by signalling all its elements and stating the novelty of the work.

3.4. discussion of the results.

For many scientists, writing a Discussion section is as scary as starting a paper. Most of the fear comes from the variation in the section. Since every paper has its unique results and findings, the Discussion section differs in its length, shape, and structure. However, some general principles of writing this section still exist. Knowing these rules, or “moves,” can change your attitude about this section and help you create a comprehensive interpretation of your results.

The purpose of the Discussion section is to place your findings in the research context and “to explain the meaning of the findings and why they are important, without appearing arrogant, condescending, or patronizing” [ 11 ]. The structure of the first two moves is almost a mirror reflection of the one in the Introduction. In the Introduction, you zoom in from general to specific and from the background to your research question; in the Discussion section, you zoom out from the summary of your findings to the research context, as shown in Table 4 .

Adapted from Swales and Feak and Hess [ 11 , 12 ].

The biggest challenge for many writers is the opening paragraph of the Discussion section. Following the moves in Table 1 , the best choice is to start with the study’s major findings that provide the answer to the research question in your Introduction. The most common starting phrases are “Our findings demonstrate . . .,” or “In this study, we have shown that . . .,” or “Our results suggest . . .” In some cases, however, reminding the reader about the research question or even providing a brief context and then stating the answer would make more sense. This is important in those cases where the researcher presents a number of findings or where more than one research question was presented. Your summary of the study’s major findings should be followed by your presentation of the importance of these findings. One of the most frequent mistakes of the novice writer is to assume the importance of his findings. Even if the importance is clear to you, it may not be obvious to your reader. Digesting the findings and their importance to your reader is as crucial as stating your research question.

Another useful strategy is to be proactive in the first move by predicting and commenting on the alternative explanations of the results. Addressing potential doubts will save you from painful comments about the wrong interpretation of your results and will present you as a thoughtful and considerate researcher. Moreover, the evaluation of the alternative explanations might help you create a logical step to the next move of the discussion section: the research context.

The goal of the research context move is to show how your findings fit into the general picture of the current research and how you contribute to the existing knowledge on the topic. This is also the place to discuss any discrepancies and unexpected findings that may otherwise distort the general picture of your paper. Moreover, outlining the scope of your research by showing the limitations, weaknesses, and assumptions is essential and adds modesty to your image as a scientist. However, make sure that you do not end your paper with the problems that override your findings. Try to suggest feasible explanations and solutions.

If your submission does not require a separate Conclusion section, then adding another paragraph about the “take-home message” is a must. This should be a general statement reiterating your answer to the research question and adding its scientific implications, practical application, or advice.

Just as in all other sections of your paper, the clear and precise language and concise comprehensive sentences are vital. However, in addition to that, your writing should convey confidence and authority. The easiest way to illustrate your tone is to use the active voice and the first person pronouns. Accompanied by clarity and succinctness, these tools are the best to convince your readers of your point and your ideas.

Rule 6: Present the principles, relationships, and generalizations in a concise and convincing tone.

4. choosing the best working revision strategies.

Now that you have created the first draft, your attitude toward your writing should have improved. Moreover, you should feel more confident that you are able to accomplish your project and submit your paper within a reasonable timeframe. You also have worked out your writing schedule and followed it precisely. Do not stop ― you are only at the midpoint from your destination. Just as the best and most precious diamond is no more than an unattractive stone recognized only by trained professionals, your ideas and your results may go unnoticed if they are not polished and brushed. Despite your attempts to present your ideas in a logical and comprehensive way, first drafts are frequently a mess. Use the advice of Paul Silvia: “Your first drafts should sound like they were hastily translated from Icelandic by a non-native speaker” [ 2 ]. The degree of your success will depend on how you are able to revise and edit your paper.

The revision can be done at the macrostructure and the microstructure levels [ 13 ]. The macrostructure revision includes the revision of the organization, content, and flow. The microstructure level includes individual words, sentence structure, grammar, punctuation, and spelling.

The best way to approach the macrostructure revision is through the outline of the ideas in your paper. The last time you updated your outline was before writing the Introduction and the Discussion. Now that you have the beginning and the conclusion, you can take a bird’s-eye view of the whole paper. The outline will allow you to see if the ideas of your paper are coherently structured, if your results are logically built, and if the discussion is linked to the research question in the Introduction. You will be able to see if something is missing in any of the sections or if you need to rearrange your information to make your point.

The next step is to revise each of the sections starting from the beginning. Ideally, you should limit yourself to working on small sections of about five pages at a time [ 14 ]. After these short sections, your eyes get used to your writing and your efficiency in spotting problems decreases. When reading for content and organization, you should control your urge to edit your paper for sentence structure and grammar and focus only on the flow of your ideas and logic of your presentation. Experienced researchers tend to make almost three times the number of changes to meaning than novice writers [ 15 , 16 ]. Revising is a difficult but useful skill, which academic writers obtain with years of practice.

In contrast to the macrostructure revision, which is a linear process and is done usually through a detailed outline and by sections, microstructure revision is a non-linear process. While the goal of the macrostructure revision is to analyze your ideas and their logic, the goal of the microstructure editing is to scrutinize the form of your ideas: your paragraphs, sentences, and words. You do not need and are not recommended to follow the order of the paper to perform this type of revision. You can start from the end or from different sections. You can even revise by reading sentences backward, sentence by sentence and word by word.

One of the microstructure revision strategies frequently used during writing center consultations is to read the paper aloud [ 17 ]. You may read aloud to yourself, to a tape recorder, or to a colleague or friend. When reading and listening to your paper, you are more likely to notice the places where the fluency is disrupted and where you stumble because of a very long and unclear sentence or a wrong connector.

Another revision strategy is to learn your common errors and to do a targeted search for them [ 13 ]. All writers have a set of problems that are specific to them, i.e., their writing idiosyncrasies. Remembering these problems is as important for an academic writer as remembering your friends’ birthdays. Create a list of these idiosyncrasies and run a search for these problems using your word processor. If your problem is demonstrative pronouns without summary words, then search for “this/these/those” in your text and check if you used the word appropriately. If you have a problem with intensifiers, then search for “really” or “very” and delete them from the text. The same targeted search can be done to eliminate wordiness. Searching for “there is/are” or “and” can help you avoid the bulky sentences.

The final strategy is working with a hard copy and a pencil. Print a double space copy with font size 14 and re-read your paper in several steps. Try reading your paper line by line with the rest of the text covered with a piece of paper. When you are forced to see only a small portion of your writing, you are less likely to get distracted and are more likely to notice problems. You will end up spotting more unnecessary words, wrongly worded phrases, or unparallel constructions.

After you apply all these strategies, you are ready to share your writing with your friends, colleagues, and a writing advisor in the writing center. Get as much feedback as you can, especially from non-specialists in your field. Patiently listen to what others say to you ― you are not expected to defend your writing or explain what you wanted to say. You may decide what you want to change and how after you receive the feedback and sort it in your head. Even though some researchers make the revision an endless process and can hardly stop after a 14th draft; having from five to seven drafts of your paper is a norm in the sciences. If you can’t stop revising, then set a deadline for yourself and stick to it. Deadlines always help.

Rule 7: Revise your paper at the macrostructure and the microstructure level using different strategies and techniques. Receive feedback and revise again.

5. it is time to submit.

It is late at night again. You are still in your lab finishing revisions and getting ready to submit your paper. You feel happy ― you have finally finished a year’s worth of work. You will submit your paper tomorrow, and regardless of the outcome, you know that you can do it. If one journal does not take your paper, you will take advantage of the feedback and resubmit again. You will have a publication, and this is the most important achievement.

What is even more important is that you have your scheduled writing time that you are going to keep for your future publications, for reading and taking notes, for writing grants, and for reviewing papers. You are not going to lose stamina this time, and you will become a productive scientist. But for now, let’s celebrate the end of the paper.

  • Hayes JR. In: The Science of Writing: Theories, Methods, Individual Differences, and Applications. Levy CM, Ransdell SE, editors. Mahwah, NJ: Lawrence Erlbaum; 1996. A new framework for understanding cognition and affect in writing; pp. 1–28. [ Google Scholar ]
  • Silvia PJ. How to Write a Lot. Washington, DC: American Psychological Association; 2007. [ Google Scholar ]
  • Whitesides GM. Whitesides’ Group: Writing a Paper. Adv Mater. 2004; 16 (15):1375–1377. [ Google Scholar ]
  • Soto D, Funes MJ, Guzmán-García A, Warbrick T, Rotshtein T, Humphreys GW. Pleasant music overcomes the loss of awareness in patients with visual neglect. Proc Natl Acad Sci USA. 2009; 106 (14):6011–6016. [ PMC free article ] [ PubMed ] [ Google Scholar ]
  • Hofmann AH. Scientific Writing and Communication. Papers, Proposals, and Presentations. New York: Oxford University Press; 2010. [ Google Scholar ]
  • Zeiger M. Essentials of Writing Biomedical Research Papers. 2nd edition. San Francisco, CA: McGraw-Hill Companies, Inc.; 2000. [ Google Scholar ]
  • Martínez I. Native and non-native writers’ use of first person pronouns in the different sections of biology research articles in English. Journal of Second Language Writing. 2005; 14 (3):174–190. [ Google Scholar ]
  • Rodman L. The Active Voice In Scientific Articles: Frequency And Discourse Functions. Journal Of Technical Writing And Communication. 1994; 24 (3):309–331. [ Google Scholar ]
  • Tarone LE, Dwyer S, Gillette S, Icke V. On the use of the passive in two astrophysics journal papers with extensions to other languages and other fields. English for Specific Purposes. 1998; 17 :113–132. [ Google Scholar ]
  • Penrose AM, Katz SB. Writing in the sciences: Exploring conventions of scientific discourse. New York: St. Martin’s Press; 1998. [ Google Scholar ]
  • Swales JM, Feak CB. Academic Writing for Graduate Students. 2nd edition. Ann Arbor: University of Michigan Press; 2004. [ Google Scholar ]
  • Hess DR. How to Write an Effective Discussion. Respiratory Care. 2004; 29 (10):1238–1241. [ PubMed ] [ Google Scholar ]
  • Belcher WL. Writing Your Journal Article in 12 Weeks: a guide to academic publishing success. Thousand Oaks, CA: SAGE Publications; 2009. [ Google Scholar ]
  • Single PB. Demystifying Dissertation Writing: A Streamlined Process of Choice of Topic to Final Text. Virginia: Stylus Publishing LLC; 2010. [ Google Scholar ]
  • Faigley L, Witte SP. Analyzing revision. Composition and Communication. 1981; 32 :400–414. [ Google Scholar ]
  • Flower LS, Hayes JR, Carey L, Schriver KS, Stratman J. Detection, diagnosis, and the strategies of revision. College Composition and Communication. 1986; 37 (1):16–55. [ Google Scholar ]
  • Young BR. In: A Tutor’s Guide: Helping Writers One to One. Rafoth B, editor. Portsmouth, NH: Boynton/Cook Publishers; 2005. Can You Proofread This? pp. 140–158. [ Google Scholar ]

Have a language expert improve your writing

Run a free plagiarism check in 10 minutes, generate accurate citations for free.

  • Knowledge Base
  • Research paper

Writing a Research Paper Introduction | Step-by-Step Guide

Published on September 24, 2022 by Jack Caulfield . Revised on March 27, 2023.

Writing a Research Paper Introduction

The introduction to a research paper is where you set up your topic and approach for the reader. It has several key goals:

  • Present your topic and get the reader interested
  • Provide background or summarize existing research
  • Position your own approach
  • Detail your specific research problem and problem statement
  • Give an overview of the paper’s structure

The introduction looks slightly different depending on whether your paper presents the results of original empirical research or constructs an argument by engaging with a variety of sources.

Instantly correct all language mistakes in your text

Upload your document to correct all your mistakes in minutes

upload-your-document-ai-proofreader

Table of contents

Step 1: introduce your topic, step 2: describe the background, step 3: establish your research problem, step 4: specify your objective(s), step 5: map out your paper, research paper introduction examples, frequently asked questions about the research paper introduction.

The first job of the introduction is to tell the reader what your topic is and why it’s interesting or important. This is generally accomplished with a strong opening hook.

The hook is a striking opening sentence that clearly conveys the relevance of your topic. Think of an interesting fact or statistic, a strong statement, a question, or a brief anecdote that will get the reader wondering about your topic.

For example, the following could be an effective hook for an argumentative paper about the environmental impact of cattle farming:

A more empirical paper investigating the relationship of Instagram use with body image issues in adolescent girls might use the following hook:

Don’t feel that your hook necessarily has to be deeply impressive or creative. Clarity and relevance are still more important than catchiness. The key thing is to guide the reader into your topic and situate your ideas.

Receive feedback on language, structure, and formatting

Professional editors proofread and edit your paper by focusing on:

  • Academic style
  • Vague sentences
  • Style consistency

See an example

author's note in research paper

This part of the introduction differs depending on what approach your paper is taking.

In a more argumentative paper, you’ll explore some general background here. In a more empirical paper, this is the place to review previous research and establish how yours fits in.

Argumentative paper: Background information

After you’ve caught your reader’s attention, specify a bit more, providing context and narrowing down your topic.

Provide only the most relevant background information. The introduction isn’t the place to get too in-depth; if more background is essential to your paper, it can appear in the body .

Empirical paper: Describing previous research

For a paper describing original research, you’ll instead provide an overview of the most relevant research that has already been conducted. This is a sort of miniature literature review —a sketch of the current state of research into your topic, boiled down to a few sentences.

This should be informed by genuine engagement with the literature. Your search can be less extensive than in a full literature review, but a clear sense of the relevant research is crucial to inform your own work.

Begin by establishing the kinds of research that have been done, and end with limitations or gaps in the research that you intend to respond to.

The next step is to clarify how your own research fits in and what problem it addresses.

Argumentative paper: Emphasize importance

In an argumentative research paper, you can simply state the problem you intend to discuss, and what is original or important about your argument.

Empirical paper: Relate to the literature

In an empirical research paper, try to lead into the problem on the basis of your discussion of the literature. Think in terms of these questions:

  • What research gap is your work intended to fill?
  • What limitations in previous work does it address?
  • What contribution to knowledge does it make?

You can make the connection between your problem and the existing research using phrases like the following.

Now you’ll get into the specifics of what you intend to find out or express in your research paper.

The way you frame your research objectives varies. An argumentative paper presents a thesis statement, while an empirical paper generally poses a research question (sometimes with a hypothesis as to the answer).

Argumentative paper: Thesis statement

The thesis statement expresses the position that the rest of the paper will present evidence and arguments for. It can be presented in one or two sentences, and should state your position clearly and directly, without providing specific arguments for it at this point.

Empirical paper: Research question and hypothesis

The research question is the question you want to answer in an empirical research paper.

Present your research question clearly and directly, with a minimum of discussion at this point. The rest of the paper will be taken up with discussing and investigating this question; here you just need to express it.

A research question can be framed either directly or indirectly.

  • This study set out to answer the following question: What effects does daily use of Instagram have on the prevalence of body image issues among adolescent girls?
  • We investigated the effects of daily Instagram use on the prevalence of body image issues among adolescent girls.

If your research involved testing hypotheses , these should be stated along with your research question. They are usually presented in the past tense, since the hypothesis will already have been tested by the time you are writing up your paper.

For example, the following hypothesis might respond to the research question above:

Here's why students love Scribbr's proofreading services

Discover proofreading & editing

The final part of the introduction is often dedicated to a brief overview of the rest of the paper.

In a paper structured using the standard scientific “introduction, methods, results, discussion” format, this isn’t always necessary. But if your paper is structured in a less predictable way, it’s important to describe the shape of it for the reader.

If included, the overview should be concise, direct, and written in the present tense.

  • This paper will first discuss several examples of survey-based research into adolescent social media use, then will go on to …
  • This paper first discusses several examples of survey-based research into adolescent social media use, then goes on to …

Full examples of research paper introductions are shown in the tabs below: one for an argumentative paper, the other for an empirical paper.

  • Argumentative paper
  • Empirical paper

Are cows responsible for climate change? A recent study (RIVM, 2019) shows that cattle farmers account for two thirds of agricultural nitrogen emissions in the Netherlands. These emissions result from nitrogen in manure, which can degrade into ammonia and enter the atmosphere. The study’s calculations show that agriculture is the main source of nitrogen pollution, accounting for 46% of the country’s total emissions. By comparison, road traffic and households are responsible for 6.1% each, the industrial sector for 1%. While efforts are being made to mitigate these emissions, policymakers are reluctant to reckon with the scale of the problem. The approach presented here is a radical one, but commensurate with the issue. This paper argues that the Dutch government must stimulate and subsidize livestock farmers, especially cattle farmers, to transition to sustainable vegetable farming. It first establishes the inadequacy of current mitigation measures, then discusses the various advantages of the results proposed, and finally addresses potential objections to the plan on economic grounds.

The rise of social media has been accompanied by a sharp increase in the prevalence of body image issues among women and girls. This correlation has received significant academic attention: Various empirical studies have been conducted into Facebook usage among adolescent girls (Tiggermann & Slater, 2013; Meier & Gray, 2014). These studies have consistently found that the visual and interactive aspects of the platform have the greatest influence on body image issues. Despite this, highly visual social media (HVSM) such as Instagram have yet to be robustly researched. This paper sets out to address this research gap. We investigated the effects of daily Instagram use on the prevalence of body image issues among adolescent girls. It was hypothesized that daily Instagram use would be associated with an increase in body image concerns and a decrease in self-esteem ratings.

The introduction of a research paper includes several key elements:

  • A hook to catch the reader’s interest
  • Relevant background on the topic
  • Details of your research problem

and your problem statement

  • A thesis statement or research question
  • Sometimes an overview of the paper

Don’t feel that you have to write the introduction first. The introduction is often one of the last parts of the research paper you’ll write, along with the conclusion.

This is because it can be easier to introduce your paper once you’ve already written the body ; you may not have the clearest idea of your arguments until you’ve written them, and things can change during the writing process .

The way you present your research problem in your introduction varies depending on the nature of your research paper . A research paper that presents a sustained argument will usually encapsulate this argument in a thesis statement .

A research paper designed to present the results of empirical research tends to present a research question that it seeks to answer. It may also include a hypothesis —a prediction that will be confirmed or disproved by your research.

Cite this Scribbr article

If you want to cite this source, you can copy and paste the citation or click the “Cite this Scribbr article” button to automatically add the citation to our free Citation Generator.

Caulfield, J. (2023, March 27). Writing a Research Paper Introduction | Step-by-Step Guide. Scribbr. Retrieved April 6, 2024, from https://www.scribbr.com/research-paper/research-paper-introduction/

Is this article helpful?

Jack Caulfield

Jack Caulfield

Other students also liked, writing strong research questions | criteria & examples, writing a research paper conclusion | step-by-step guide, research paper format | apa, mla, & chicago templates, unlimited academic ai-proofreading.

✔ Document error-free in 5minutes ✔ Unlimited document corrections ✔ Specialized in correcting academic texts

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • View all journals
  • My Account Login
  • Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • Open access
  • Published: 26 March 2024

Predicting and improving complex beer flavor through machine learning

  • Michiel Schreurs   ORCID: orcid.org/0000-0002-9449-5619 1 , 2 , 3   na1 ,
  • Supinya Piampongsant 1 , 2 , 3   na1 ,
  • Miguel Roncoroni   ORCID: orcid.org/0000-0001-7461-1427 1 , 2 , 3   na1 ,
  • Lloyd Cool   ORCID: orcid.org/0000-0001-9936-3124 1 , 2 , 3 , 4 ,
  • Beatriz Herrera-Malaver   ORCID: orcid.org/0000-0002-5096-9974 1 , 2 , 3 ,
  • Christophe Vanderaa   ORCID: orcid.org/0000-0001-7443-5427 4 ,
  • Florian A. Theßeling 1 , 2 , 3 ,
  • Łukasz Kreft   ORCID: orcid.org/0000-0001-7620-4657 5 ,
  • Alexander Botzki   ORCID: orcid.org/0000-0001-6691-4233 5 ,
  • Philippe Malcorps 6 ,
  • Luk Daenen 6 ,
  • Tom Wenseleers   ORCID: orcid.org/0000-0002-1434-861X 4 &
  • Kevin J. Verstrepen   ORCID: orcid.org/0000-0002-3077-6219 1 , 2 , 3  

Nature Communications volume  15 , Article number:  2368 ( 2024 ) Cite this article

49k Accesses

847 Altmetric

Metrics details

  • Chemical engineering
  • Gas chromatography
  • Machine learning
  • Metabolomics
  • Taste receptors

The perception and appreciation of food flavor depends on many interacting chemical compounds and external factors, and therefore proves challenging to understand and predict. Here, we combine extensive chemical and sensory analyses of 250 different beers to train machine learning models that allow predicting flavor and consumer appreciation. For each beer, we measure over 200 chemical properties, perform quantitative descriptive sensory analysis with a trained tasting panel and map data from over 180,000 consumer reviews to train 10 different machine learning models. The best-performing algorithm, Gradient Boosting, yields models that significantly outperform predictions based on conventional statistics and accurately predict complex food features and consumer appreciation from chemical profiles. Model dissection allows identifying specific and unexpected compounds as drivers of beer flavor and appreciation. Adding these compounds results in variants of commercial alcoholic and non-alcoholic beers with improved consumer appreciation. Together, our study reveals how big data and machine learning uncover complex links between food chemistry, flavor and consumer perception, and lays the foundation to develop novel, tailored foods with superior flavors.

Similar content being viewed by others

author's note in research paper

BitterSweet: Building machine learning models for predicting the bitter and sweet taste of small molecules

Rudraksh Tuwani, Somin Wadhwa & Ganesh Bagler

author's note in research paper

Sensory lexicon and aroma volatiles analysis of brewing malt

Xiaoxia Su, Miao Yu, … Tianyi Du

author's note in research paper

Predicting odor from molecular structure: a multi-label classification approach

Kushagra Saini & Venkatnarayan Ramanathan

Introduction

Predicting and understanding food perception and appreciation is one of the major challenges in food science. Accurate modeling of food flavor and appreciation could yield important opportunities for both producers and consumers, including quality control, product fingerprinting, counterfeit detection, spoilage detection, and the development of new products and product combinations (food pairing) 1 , 2 , 3 , 4 , 5 , 6 . Accurate models for flavor and consumer appreciation would contribute greatly to our scientific understanding of how humans perceive and appreciate flavor. Moreover, accurate predictive models would also facilitate and standardize existing food assessment methods and could supplement or replace assessments by trained and consumer tasting panels, which are variable, expensive and time-consuming 7 , 8 , 9 . Lastly, apart from providing objective, quantitative, accurate and contextual information that can help producers, models can also guide consumers in understanding their personal preferences 10 .

Despite the myriad of applications, predicting food flavor and appreciation from its chemical properties remains a largely elusive goal in sensory science, especially for complex food and beverages 11 , 12 . A key obstacle is the immense number of flavor-active chemicals underlying food flavor. Flavor compounds can vary widely in chemical structure and concentration, making them technically challenging and labor-intensive to quantify, even in the face of innovations in metabolomics, such as non-targeted metabolic fingerprinting 13 , 14 . Moreover, sensory analysis is perhaps even more complicated. Flavor perception is highly complex, resulting from hundreds of different molecules interacting at the physiochemical and sensorial level. Sensory perception is often non-linear, characterized by complex and concentration-dependent synergistic and antagonistic effects 15 , 16 , 17 , 18 , 19 , 20 , 21 that are further convoluted by the genetics, environment, culture and psychology of consumers 22 , 23 , 24 . Perceived flavor is therefore difficult to measure, with problems of sensitivity, accuracy, and reproducibility that can only be resolved by gathering sufficiently large datasets 25 . Trained tasting panels are considered the prime source of quality sensory data, but require meticulous training, are low throughput and high cost. Public databases containing consumer reviews of food products could provide a valuable alternative, especially for studying appreciation scores, which do not require formal training 25 . Public databases offer the advantage of amassing large amounts of data, increasing the statistical power to identify potential drivers of appreciation. However, public datasets suffer from biases, including a bias in the volunteers that contribute to the database, as well as confounding factors such as price, cult status and psychological conformity towards previous ratings of the product.

Classical multivariate statistics and machine learning methods have been used to predict flavor of specific compounds by, for example, linking structural properties of a compound to its potential biological activities or linking concentrations of specific compounds to sensory profiles 1 , 26 . Importantly, most previous studies focused on predicting organoleptic properties of single compounds (often based on their chemical structure) 27 , 28 , 29 , 30 , 31 , 32 , 33 , thus ignoring the fact that these compounds are present in a complex matrix in food or beverages and excluding complex interactions between compounds. Moreover, the classical statistics commonly used in sensory science 34 , 35 , 36 , 37 , 38 , 39 require a large sample size and sufficient variance amongst predictors to create accurate models. They are not fit for studying an extensive set of hundreds of interacting flavor compounds, since they are sensitive to outliers, have a high tendency to overfit and are less suited for non-linear and discontinuous relationships 40 .

In this study, we combine extensive chemical analyses and sensory data of a set of different commercial beers with machine learning approaches to develop models that predict taste, smell, mouthfeel and appreciation from compound concentrations. Beer is particularly suited to model the relationship between chemistry, flavor and appreciation. First, beer is a complex product, consisting of thousands of flavor compounds that partake in complex sensory interactions 41 , 42 , 43 . This chemical diversity arises from the raw materials (malt, yeast, hops, water and spices) and biochemical conversions during the brewing process (kilning, mashing, boiling, fermentation, maturation and aging) 44 , 45 . Second, the advent of the internet saw beer consumers embrace online review platforms, such as RateBeer (ZX Ventures, Anheuser-Busch InBev SA/NV) and BeerAdvocate (Next Glass, inc.). In this way, the beer community provides massive data sets of beer flavor and appreciation scores, creating extraordinarily large sensory databases to complement the analyses of our professional sensory panel. Specifically, we characterize over 200 chemical properties of 250 commercial beers, spread across 22 beer styles, and link these to the descriptive sensory profiling data of a 16-person in-house trained tasting panel and data acquired from over 180,000 public consumer reviews. These unique and extensive datasets enable us to train a suite of machine learning models to predict flavor and appreciation from a beer’s chemical profile. Dissection of the best-performing models allows us to pinpoint specific compounds as potential drivers of beer flavor and appreciation. Follow-up experiments confirm the importance of these compounds and ultimately allow us to significantly improve the flavor and appreciation of selected commercial beers. Together, our study represents a significant step towards understanding complex flavors and reinforces the value of machine learning to develop and refine complex foods. In this way, it represents a stepping stone for further computer-aided food engineering applications 46 .

To generate a comprehensive dataset on beer flavor, we selected 250 commercial Belgian beers across 22 different beer styles (Supplementary Fig.  S1 ). Beers with ≤ 4.2% alcohol by volume (ABV) were classified as non-alcoholic and low-alcoholic. Blonds and Tripels constitute a significant portion of the dataset (12.4% and 11.2%, respectively) reflecting their presence on the Belgian beer market and the heterogeneity of beers within these styles. By contrast, lager beers are less diverse and dominated by a handful of brands. Rare styles such as Brut or Faro make up only a small fraction of the dataset (2% and 1%, respectively) because fewer of these beers are produced and because they are dominated by distinct characteristics in terms of flavor and chemical composition.

Extensive analysis identifies relationships between chemical compounds in beer

For each beer, we measured 226 different chemical properties, including common brewing parameters such as alcohol content, iso-alpha acids, pH, sugar concentration 47 , and over 200 flavor compounds (Methods, Supplementary Table  S1 ). A large portion (37.2%) are terpenoids arising from hopping, responsible for herbal and fruity flavors 16 , 48 . A second major category are yeast metabolites, such as esters and alcohols, that result in fruity and solvent notes 48 , 49 , 50 . Other measured compounds are primarily derived from malt, or other microbes such as non- Saccharomyces yeasts and bacteria (‘wild flora’). Compounds that arise from spices or staling are labeled under ‘Others’. Five attributes (caloric value, total acids and total ester, hop aroma and sulfur compounds) are calculated from multiple individually measured compounds.

As a first step in identifying relationships between chemical properties, we determined correlations between the concentrations of the compounds (Fig.  1 , upper panel, Supplementary Data  1 and 2 , and Supplementary Fig.  S2 . For the sake of clarity, only a subset of the measured compounds is shown in Fig.  1 ). Compounds of the same origin typically show a positive correlation, while absence of correlation hints at parameters varying independently. For example, the hop aroma compounds citronellol, and alpha-terpineol show moderate correlations with each other (Spearman’s rho=0.39 and 0.57), but not with the bittering hop component iso-alpha acids (Spearman’s rho=0.16 and −0.07). This illustrates how brewers can independently modify hop aroma and bitterness by selecting hop varieties and dosage time. If hops are added early in the boiling phase, chemical conversions increase bitterness while aromas evaporate, conversely, late addition of hops preserves aroma but limits bitterness 51 . Similarly, hop-derived iso-alpha acids show a strong anti-correlation with lactic acid and acetic acid, likely reflecting growth inhibition of lactic acid and acetic acid bacteria, or the consequent use of fewer hops in sour beer styles, such as West Flanders ales and Fruit beers, that rely on these bacteria for their distinct flavors 52 . Finally, yeast-derived esters (ethyl acetate, ethyl decanoate, ethyl hexanoate, ethyl octanoate) and alcohols (ethanol, isoamyl alcohol, isobutanol, and glycerol), correlate with Spearman coefficients above 0.5, suggesting that these secondary metabolites are correlated with the yeast genetic background and/or fermentation parameters and may be difficult to influence individually, although the choice of yeast strain may offer some control 53 .

figure 1

Spearman rank correlations are shown. Descriptors are grouped according to their origin (malt (blue), hops (green), yeast (red), wild flora (yellow), Others (black)), and sensory aspect (aroma, taste, palate, and overall appreciation). Please note that for the chemical compounds, for the sake of clarity, only a subset of the total number of measured compounds is shown, with an emphasis on the key compounds for each source. For more details, see the main text and Methods section. Chemical data can be found in Supplementary Data  1 , correlations between all chemical compounds are depicted in Supplementary Fig.  S2 and correlation values can be found in Supplementary Data  2 . See Supplementary Data  4 for sensory panel assessments and Supplementary Data  5 for correlation values between all sensory descriptors.

Interestingly, different beer styles show distinct patterns for some flavor compounds (Supplementary Fig.  S3 ). These observations agree with expectations for key beer styles, and serve as a control for our measurements. For instance, Stouts generally show high values for color (darker), while hoppy beers contain elevated levels of iso-alpha acids, compounds associated with bitter hop taste. Acetic and lactic acid are not prevalent in most beers, with notable exceptions such as Kriek, Lambic, Faro, West Flanders ales and Flanders Old Brown, which use acid-producing bacteria ( Lactobacillus and Pediococcus ) or unconventional yeast ( Brettanomyces ) 54 , 55 . Glycerol, ethanol and esters show similar distributions across all beer styles, reflecting their common origin as products of yeast metabolism during fermentation 45 , 53 . Finally, low/no-alcohol beers contain low concentrations of glycerol and esters. This is in line with the production process for most of the low/no-alcohol beers in our dataset, which are produced through limiting fermentation or by stripping away alcohol via evaporation or dialysis, with both methods having the unintended side-effect of reducing the amount of flavor compounds in the final beer 56 , 57 .

Besides expected associations, our data also reveals less trivial associations between beer styles and specific parameters. For example, geraniol and citronellol, two monoterpenoids responsible for citrus, floral and rose flavors and characteristic of Citra hops, are found in relatively high amounts in Christmas, Saison, and Brett/co-fermented beers, where they may originate from terpenoid-rich spices such as coriander seeds instead of hops 58 .

Tasting panel assessments reveal sensorial relationships in beer

To assess the sensory profile of each beer, a trained tasting panel evaluated each of the 250 beers for 50 sensory attributes, including different hop, malt and yeast flavors, off-flavors and spices. Panelists used a tasting sheet (Supplementary Data  3 ) to score the different attributes. Panel consistency was evaluated by repeating 12 samples across different sessions and performing ANOVA. In 95% of cases no significant difference was found across sessions ( p  > 0.05), indicating good panel consistency (Supplementary Table  S2 ).

Aroma and taste perception reported by the trained panel are often linked (Fig.  1 , bottom left panel and Supplementary Data  4 and 5 ), with high correlations between hops aroma and taste (Spearman’s rho=0.83). Bitter taste was found to correlate with hop aroma and taste in general (Spearman’s rho=0.80 and 0.69), and particularly with “grassy” noble hops (Spearman’s rho=0.75). Barnyard flavor, most often associated with sour beers, is identified together with stale hops (Spearman’s rho=0.97) that are used in these beers. Lactic and acetic acid, which often co-occur, are correlated (Spearman’s rho=0.66). Interestingly, sweetness and bitterness are anti-correlated (Spearman’s rho = −0.48), confirming the hypothesis that they mask each other 59 , 60 . Beer body is highly correlated with alcohol (Spearman’s rho = 0.79), and overall appreciation is found to correlate with multiple aspects that describe beer mouthfeel (alcohol, carbonation; Spearman’s rho= 0.32, 0.39), as well as with hop and ester aroma intensity (Spearman’s rho=0.39 and 0.35).

Similar to the chemical analyses, sensorial analyses confirmed typical features of specific beer styles (Supplementary Fig.  S4 ). For example, sour beers (Faro, Flanders Old Brown, Fruit beer, Kriek, Lambic, West Flanders ale) were rated acidic, with flavors of both acetic and lactic acid. Hoppy beers were found to be bitter and showed hop-associated aromas like citrus and tropical fruit. Malt taste is most detected among scotch, stout/porters, and strong ales, while low/no-alcohol beers, which often have a reputation for being ‘worty’ (reminiscent of unfermented, sweet malt extract) appear in the middle. Unsurprisingly, hop aromas are most strongly detected among hoppy beers. Like its chemical counterpart (Supplementary Fig.  S3 ), acidity shows a right-skewed distribution, with the most acidic beers being Krieks, Lambics, and West Flanders ales.

Tasting panel assessments of specific flavors correlate with chemical composition

We find that the concentrations of several chemical compounds strongly correlate with specific aroma or taste, as evaluated by the tasting panel (Fig.  2 , Supplementary Fig.  S5 , Supplementary Data  6 ). In some cases, these correlations confirm expectations and serve as a useful control for data quality. For example, iso-alpha acids, the bittering compounds in hops, strongly correlate with bitterness (Spearman’s rho=0.68), while ethanol and glycerol correlate with tasters’ perceptions of alcohol and body, the mouthfeel sensation of fullness (Spearman’s rho=0.82/0.62 and 0.72/0.57 respectively) and darker color from roasted malts is a good indication of malt perception (Spearman’s rho=0.54).

figure 2

Heatmap colors indicate Spearman’s Rho. Axes are organized according to sensory categories (aroma, taste, mouthfeel, overall), chemical categories and chemical sources in beer (malt (blue), hops (green), yeast (red), wild flora (yellow), Others (black)). See Supplementary Data  6 for all correlation values.

Interestingly, for some relationships between chemical compounds and perceived flavor, correlations are weaker than expected. For example, the rose-smelling phenethyl acetate only weakly correlates with floral aroma. This hints at more complex relationships and interactions between compounds and suggests a need for a more complex model than simple correlations. Lastly, we uncovered unexpected correlations. For instance, the esters ethyl decanoate and ethyl octanoate appear to correlate slightly with hop perception and bitterness, possibly due to their fruity flavor. Iron is anti-correlated with hop aromas and bitterness, most likely because it is also anti-correlated with iso-alpha acids. This could be a sign of metal chelation of hop acids 61 , given that our analyses measure unbound hop acids and total iron content, or could result from the higher iron content in dark and Fruit beers, which typically have less hoppy and bitter flavors 62 .

Public consumer reviews complement expert panel data

To complement and expand the sensory data of our trained tasting panel, we collected 180,000 reviews of our 250 beers from the online consumer review platform RateBeer. This provided numerical scores for beer appearance, aroma, taste, palate, overall quality as well as the average overall score.

Public datasets are known to suffer from biases, such as price, cult status and psychological conformity towards previous ratings of a product. For example, prices correlate with appreciation scores for these online consumer reviews (rho=0.49, Supplementary Fig.  S6 ), but not for our trained tasting panel (rho=0.19). This suggests that prices affect consumer appreciation, which has been reported in wine 63 , while blind tastings are unaffected. Moreover, we observe that some beer styles, like lagers and non-alcoholic beers, generally receive lower scores, reflecting that online reviewers are mostly beer aficionados with a preference for specialty beers over lager beers. In general, we find a modest correlation between our trained panel’s overall appreciation score and the online consumer appreciation scores (Fig.  3 , rho=0.29). Apart from the aforementioned biases in the online datasets, serving temperature, sample freshness and surroundings, which are all tightly controlled during the tasting panel sessions, can vary tremendously across online consumers and can further contribute to (among others, appreciation) differences between the two categories of tasters. Importantly, in contrast to the overall appreciation scores, for many sensory aspects the results from the professional panel correlated well with results obtained from RateBeer reviews. Correlations were highest for features that are relatively easy to recognize even for untrained tasters, like bitterness, sweetness, alcohol and malt aroma (Fig.  3 and below).

figure 3

RateBeer text mining results can be found in Supplementary Data  7 . Rho values shown are Spearman correlation values, with asterisks indicating significant correlations ( p  < 0.05, two-sided). All p values were smaller than 0.001, except for Esters aroma (0.0553), Esters taste (0.3275), Esters aroma—banana (0.0019), Coriander (0.0508) and Diacetyl (0.0134).

Besides collecting consumer appreciation from these online reviews, we developed automated text analysis tools to gather additional data from review texts (Supplementary Data  7 ). Processing review texts on the RateBeer database yielded comparable results to the scores given by the trained panel for many common sensory aspects, including acidity, bitterness, sweetness, alcohol, malt, and hop tastes (Fig.  3 ). This is in line with what would be expected, since these attributes require less training for accurate assessment and are less influenced by environmental factors such as temperature, serving glass and odors in the environment. Consumer reviews also correlate well with our trained panel for 4-vinyl guaiacol, a compound associated with a very characteristic aroma. By contrast, correlations for more specific aromas like ester, coriander or diacetyl are underrepresented in the online reviews, underscoring the importance of using a trained tasting panel and standardized tasting sheets with explicit factors to be scored for evaluating specific aspects of a beer. Taken together, our results suggest that public reviews are trustworthy for some, but not all, flavor features and can complement or substitute taste panel data for these sensory aspects.

Models can predict beer sensory profiles from chemical data

The rich datasets of chemical analyses, tasting panel assessments and public reviews gathered in the first part of this study provided us with a unique opportunity to develop predictive models that link chemical data to sensorial features. Given the complexity of beer flavor, basic statistical tools such as correlations or linear regression may not always be the most suitable for making accurate predictions. Instead, we applied different machine learning models that can model both simple linear and complex interactive relationships. Specifically, we constructed a set of regression models to predict (a) trained panel scores for beer flavor and quality and (b) public reviews’ appreciation scores from beer chemical profiles. We trained and tested 10 different models (Methods), 3 linear regression-based models (simple linear regression with first-order interactions (LR), lasso regression with first-order interactions (Lasso), partial least squares regressor (PLSR)), 5 decision tree models (AdaBoost regressor (ABR), extra trees (ET), gradient boosting regressor (GBR), random forest (RF) and XGBoost regressor (XGBR)), 1 support vector regression (SVR), and 1 artificial neural network (ANN) model.

To compare the performance of our machine learning models, the dataset was randomly split into a training and test set, stratified by beer style. After a model was trained on data in the training set, its performance was evaluated on its ability to predict the test dataset obtained from multi-output models (based on the coefficient of determination, see Methods). Additionally, individual-attribute models were ranked per descriptor and the average rank was calculated, as proposed by Korneva et al. 64 . Importantly, both ways of evaluating the models’ performance agreed in general. Performance of the different models varied (Table  1 ). It should be noted that all models perform better at predicting RateBeer results than results from our trained tasting panel. One reason could be that sensory data is inherently variable, and this variability is averaged out with the large number of public reviews from RateBeer. Additionally, all tree-based models perform better at predicting taste than aroma. Linear models (LR) performed particularly poorly, with negative R 2 values, due to severe overfitting (training set R 2  = 1). Overfitting is a common issue in linear models with many parameters and limited samples, especially with interaction terms further amplifying the number of parameters. L1 regularization (Lasso) successfully overcomes this overfitting, out-competing multiple tree-based models on the RateBeer dataset. Similarly, the dimensionality reduction of PLSR avoids overfitting and improves performance, to some extent. Still, tree-based models (ABR, ET, GBR, RF and XGBR) show the best performance, out-competing the linear models (LR, Lasso, PLSR) commonly used in sensory science 65 .

GBR models showed the best overall performance in predicting sensory responses from chemical information, with R 2 values up to 0.75 depending on the predicted sensory feature (Supplementary Table  S4 ). The GBR models predict consumer appreciation (RateBeer) better than our trained panel’s appreciation (R 2 value of 0.67 compared to R 2 value of 0.09) (Supplementary Table  S3 and Supplementary Table  S4 ). ANN models showed intermediate performance, likely because neural networks typically perform best with larger datasets 66 . The SVR shows intermediate performance, mostly due to the weak predictions of specific attributes that lower the overall performance (Supplementary Table  S4 ).

Model dissection identifies specific, unexpected compounds as drivers of consumer appreciation

Next, we leveraged our models to infer important contributors to sensory perception and consumer appreciation. Consumer preference is a crucial sensory aspects, because a product that shows low consumer appreciation scores often does not succeed commercially 25 . Additionally, the requirement for a large number of representative evaluators makes consumer trials one of the more costly and time-consuming aspects of product development. Hence, a model for predicting chemical drivers of overall appreciation would be a welcome addition to the available toolbox for food development and optimization.

Since GBR models on our RateBeer dataset showed the best overall performance, we focused on these models. Specifically, we used two approaches to identify important contributors. First, rankings of the most important predictors for each sensorial trait in the GBR models were obtained based on impurity-based feature importance (mean decrease in impurity). High-ranked parameters were hypothesized to be either the true causal chemical properties underlying the trait, to correlate with the actual causal properties, or to take part in sensory interactions affecting the trait 67 (Fig.  4A ). In a second approach, we used SHAP 68 to determine which parameters contributed most to the model for making predictions of consumer appreciation (Fig.  4B ). SHAP calculates parameter contributions to model predictions on a per-sample basis, which can be aggregated into an importance score.

figure 4

A The impurity-based feature importance (mean deviance in impurity, MDI) calculated from the Gradient Boosting Regression (GBR) model predicting RateBeer appreciation scores. The top 15 highest ranked chemical properties are shown. B SHAP summary plot for the top 15 parameters contributing to our GBR model. Each point on the graph represents a sample from our dataset. The color represents the concentration of that parameter, with bluer colors representing low values and redder colors representing higher values. Greater absolute values on the horizontal axis indicate a higher impact of the parameter on the prediction of the model. C Spearman correlations between the 15 most important chemical properties and consumer overall appreciation. Numbers indicate the Spearman Rho correlation coefficient, and the rank of this correlation compared to all other correlations. The top 15 important compounds were determined using SHAP (panel B).

Both approaches identified ethyl acetate as the most predictive parameter for beer appreciation (Fig.  4 ). Ethyl acetate is the most abundant ester in beer with a typical ‘fruity’, ‘solvent’ and ‘alcoholic’ flavor, but is often considered less important than other esters like isoamyl acetate. The second most important parameter identified by SHAP is ethanol, the most abundant beer compound after water. Apart from directly contributing to beer flavor and mouthfeel, ethanol drastically influences the physical properties of beer, dictating how easily volatile compounds escape the beer matrix to contribute to beer aroma 69 . Importantly, it should also be noted that the importance of ethanol for appreciation is likely inflated by the very low appreciation scores of non-alcoholic beers (Supplementary Fig.  S4 ). Despite not often being considered a driver of beer appreciation, protein level also ranks highly in both approaches, possibly due to its effect on mouthfeel and body 70 . Lactic acid, which contributes to the tart taste of sour beers, is the fourth most important parameter identified by SHAP, possibly due to the generally high appreciation of sour beers in our dataset.

Interestingly, some of the most important predictive parameters for our model are not well-established as beer flavors or are even commonly regarded as being negative for beer quality. For example, our models identify methanethiol and ethyl phenyl acetate, an ester commonly linked to beer staling 71 , as a key factor contributing to beer appreciation. Although there is no doubt that high concentrations of these compounds are considered unpleasant, the positive effects of modest concentrations are not yet known 72 , 73 .

To compare our approach to conventional statistics, we evaluated how well the 15 most important SHAP-derived parameters correlate with consumer appreciation (Fig.  4C ). Interestingly, only 6 of the properties derived by SHAP rank amongst the top 15 most correlated parameters. For some chemical compounds, the correlations are so low that they would have likely been considered unimportant. For example, lactic acid, the fourth most important parameter, shows a bimodal distribution for appreciation, with sour beers forming a separate cluster, that is missed entirely by the Spearman correlation. Additionally, the correlation plots reveal outliers, emphasizing the need for robust analysis tools. Together, this highlights the need for alternative models, like the Gradient Boosting model, that better grasp the complexity of (beer) flavor.

Finally, to observe the relationships between these chemical properties and their predicted targets, partial dependence plots were constructed for the six most important predictors of consumer appreciation 74 , 75 , 76 (Supplementary Fig.  S7 ). One-way partial dependence plots show how a change in concentration affects the predicted appreciation. These plots reveal an important limitation of our models: appreciation predictions remain constant at ever-increasing concentrations. This implies that once a threshold concentration is reached, further increasing the concentration does not affect appreciation. This is false, as it is well-documented that certain compounds become unpleasant at high concentrations, including ethyl acetate (‘nail polish’) 77 and methanethiol (‘sulfury’ and ‘rotten cabbage’) 78 . The inability of our models to grasp that flavor compounds have optimal levels, above which they become negative, is a consequence of working with commercial beer brands where (off-)flavors are rarely too high to negatively impact the product. The two-way partial dependence plots show how changing the concentration of two compounds influences predicted appreciation, visualizing their interactions (Supplementary Fig.  S7 ). In our case, the top 5 parameters are dominated by additive or synergistic interactions, with high concentrations for both compounds resulting in the highest predicted appreciation.

To assess the robustness of our best-performing models and model predictions, we performed 100 iterations of the GBR, RF and ET models. In general, all iterations of the models yielded similar performance (Supplementary Fig.  S8 ). Moreover, the main predictors (including the top predictors ethanol and ethyl acetate) remained virtually the same, especially for GBR and RF. For the iterations of the ET model, we did observe more variation in the top predictors, which is likely a consequence of the model’s inherent random architecture in combination with co-correlations between certain predictors. However, even in this case, several of the top predictors (ethanol and ethyl acetate) remain unchanged, although their rank in importance changes (Supplementary Fig.  S8 ).

Next, we investigated if a combination of RateBeer and trained panel data into one consolidated dataset would lead to stronger models, under the hypothesis that such a model would suffer less from bias in the datasets. A GBR model was trained to predict appreciation on the combined dataset. This model underperformed compared to the RateBeer model, both in the native case and when including a dataset identifier (R 2  = 0.67, 0.26 and 0.42 respectively). For the latter, the dataset identifier is the most important feature (Supplementary Fig.  S9 ), while most of the feature importance remains unchanged, with ethyl acetate and ethanol ranking highest, like in the original model trained only on RateBeer data. It seems that the large variation in the panel dataset introduces noise, weakening the models’ performances and reliability. In addition, it seems reasonable to assume that both datasets are fundamentally different, with the panel dataset obtained by blind tastings by a trained professional panel.

Lastly, we evaluated whether beer style identifiers would further enhance the model’s performance. A GBR model was trained with parameters that explicitly encoded the styles of the samples. This did not improve model performance (R2 = 0.66 with style information vs R2 = 0.67). The most important chemical features are consistent with the model trained without style information (eg. ethanol and ethyl acetate), and with the exception of the most preferred (strong ale) and least preferred (low/no-alcohol) styles, none of the styles were among the most important features (Supplementary Fig.  S9 , Supplementary Table  S5 and S6 ). This is likely due to a combination of style-specific chemical signatures, such as iso-alpha acids and lactic acid, that implicitly convey style information to the original models, as well as the low number of samples belonging to some styles, making it difficult for the model to learn style-specific patterns. Moreover, beer styles are not rigorously defined, with some styles overlapping in features and some beers being misattributed to a specific style, all of which leads to more noise in models that use style parameters.

Model validation

To test if our predictive models give insight into beer appreciation, we set up experiments aimed at improving existing commercial beers. We specifically selected overall appreciation as the trait to be examined because of its complexity and commercial relevance. Beer flavor comprises a complex bouquet rather than single aromas and tastes 53 . Hence, adding a single compound to the extent that a difference is noticeable may lead to an unbalanced, artificial flavor. Therefore, we evaluated the effect of combinations of compounds. Because Blond beers represent the most extensive style in our dataset, we selected a beer from this style as the starting material for these experiments (Beer 64 in Supplementary Data  1 ).

In the first set of experiments, we adjusted the concentrations of compounds that made up the most important predictors of overall appreciation (ethyl acetate, ethanol, lactic acid, ethyl phenyl acetate) together with correlated compounds (ethyl hexanoate, isoamyl acetate, glycerol), bringing them up to 95 th percentile ethanol-normalized concentrations (Methods) within the Blond group (‘Spiked’ concentration in Fig.  5A ). Compared to controls, the spiked beers were found to have significantly improved overall appreciation among trained panelists, with panelist noting increased intensity of ester flavors, sweetness, alcohol, and body fullness (Fig.  5B ). To disentangle the contribution of ethanol to these results, a second experiment was performed without the addition of ethanol. This resulted in a similar outcome, including increased perception of alcohol and overall appreciation.

figure 5

Adding the top chemical compounds, identified as best predictors of appreciation by our model, into poorly appreciated beers results in increased appreciation from our trained panel. Results of sensory tests between base beers and those spiked with compounds identified as the best predictors by the model. A Blond and Non/Low-alcohol (0.0% ABV) base beers were brought up to 95th-percentile ethanol-normalized concentrations within each style. B For each sensory attribute, tasters indicated the more intense sample and selected the sample they preferred. The numbers above the bars correspond to the p values that indicate significant changes in perceived flavor (two-sided binomial test: alpha 0.05, n  = 20 or 13).

In a last experiment, we tested whether using the model’s predictions can boost the appreciation of a non-alcoholic beer (beer 223 in Supplementary Data  1 ). Again, the addition of a mixture of predicted compounds (omitting ethanol, in this case) resulted in a significant increase in appreciation, body, ester flavor and sweetness.

Predicting flavor and consumer appreciation from chemical composition is one of the ultimate goals of sensory science. A reliable, systematic and unbiased way to link chemical profiles to flavor and food appreciation would be a significant asset to the food and beverage industry. Such tools would substantially aid in quality control and recipe development, offer an efficient and cost-effective alternative to pilot studies and consumer trials and would ultimately allow food manufacturers to produce superior, tailor-made products that better meet the demands of specific consumer groups more efficiently.

A limited set of studies have previously tried, to varying degrees of success, to predict beer flavor and beer popularity based on (a limited set of) chemical compounds and flavors 79 , 80 . Current sensitive, high-throughput technologies allow measuring an unprecedented number of chemical compounds and properties in a large set of samples, yielding a dataset that can train models that help close the gaps between chemistry and flavor, even for a complex natural product like beer. To our knowledge, no previous research gathered data at this scale (250 samples, 226 chemical parameters, 50 sensory attributes and 5 consumer scores) to disentangle and validate the chemical aspects driving beer preference using various machine-learning techniques. We find that modern machine learning models outperform conventional statistical tools, such as correlations and linear models, and can successfully predict flavor appreciation from chemical composition. This could be attributed to the natural incorporation of interactions and non-linear or discontinuous effects in machine learning models, which are not easily grasped by the linear model architecture. While linear models and partial least squares regression represent the most widespread statistical approaches in sensory science, in part because they allow interpretation 65 , 81 , 82 , modern machine learning methods allow for building better predictive models while preserving the possibility to dissect and exploit the underlying patterns. Of the 10 different models we trained, tree-based models, such as our best performing GBR, showed the best overall performance in predicting sensory responses from chemical information, outcompeting artificial neural networks. This agrees with previous reports for models trained on tabular data 83 . Our results are in line with the findings of Colantonio et al. who also identified the gradient boosting architecture as performing best at predicting appreciation and flavor (of tomatoes and blueberries, in their specific study) 26 . Importantly, besides our larger experimental scale, we were able to directly confirm our models’ predictions in vivo.

Our study confirms that flavor compound concentration does not always correlate with perception, suggesting complex interactions that are often missed by more conventional statistics and simple models. Specifically, we find that tree-based algorithms may perform best in developing models that link complex food chemistry with aroma. Furthermore, we show that massive datasets of untrained consumer reviews provide a valuable source of data, that can complement or even replace trained tasting panels, especially for appreciation and basic flavors, such as sweetness and bitterness. This holds despite biases that are known to occur in such datasets, such as price or conformity bias. Moreover, GBR models predict taste better than aroma. This is likely because taste (e.g. bitterness) often directly relates to the corresponding chemical measurements (e.g., iso-alpha acids), whereas such a link is less clear for aromas, which often result from the interplay between multiple volatile compounds. We also find that our models are best at predicting acidity and alcohol, likely because there is a direct relation between the measured chemical compounds (acids and ethanol) and the corresponding perceived sensorial attribute (acidity and alcohol), and because even untrained consumers are generally able to recognize these flavors and aromas.

The predictions of our final models, trained on review data, hold even for blind tastings with small groups of trained tasters, as demonstrated by our ability to validate specific compounds as drivers of beer flavor and appreciation. Since adding a single compound to the extent of a noticeable difference may result in an unbalanced flavor profile, we specifically tested our identified key drivers as a combination of compounds. While this approach does not allow us to validate if a particular single compound would affect flavor and/or appreciation, our experiments do show that this combination of compounds increases consumer appreciation.

It is important to stress that, while it represents an important step forward, our approach still has several major limitations. A key weakness of the GBR model architecture is that amongst co-correlating variables, the largest main effect is consistently preferred for model building. As a result, co-correlating variables often have artificially low importance scores, both for impurity and SHAP-based methods, like we observed in the comparison to the more randomized Extra Trees models. This implies that chemicals identified as key drivers of a specific sensory feature by GBR might not be the true causative compounds, but rather co-correlate with the actual causative chemical. For example, the high importance of ethyl acetate could be (partially) attributed to the total ester content, ethanol or ethyl hexanoate (rho=0.77, rho=0.72 and rho=0.68), while ethyl phenylacetate could hide the importance of prenyl isobutyrate and ethyl benzoate (rho=0.77 and rho=0.76). Expanding our GBR model to include beer style as a parameter did not yield additional power or insight. This is likely due to style-specific chemical signatures, such as iso-alpha acids and lactic acid, that implicitly convey style information to the original model, as well as the smaller sample size per style, limiting the power to uncover style-specific patterns. This can be partly attributed to the curse of dimensionality, where the high number of parameters results in the models mainly incorporating single parameter effects, rather than complex interactions such as style-dependent effects 67 . A larger number of samples may overcome some of these limitations and offer more insight into style-specific effects. On the other hand, beer style is not a rigid scientific classification, and beers within one style often differ a lot, which further complicates the analysis of style as a model factor.

Our study is limited to beers from Belgian breweries. Although these beers cover a large portion of the beer styles available globally, some beer styles and consumer patterns may be missing, while other features might be overrepresented. For example, many Belgian ales exhibit yeast-driven flavor profiles, which is reflected in the chemical drivers of appreciation discovered by this study. In future work, expanding the scope to include diverse markets and beer styles could lead to the identification of even more drivers of appreciation and better models for special niche products that were not present in our beer set.

In addition to inherent limitations of GBR models, there are also some limitations associated with studying food aroma. Even if our chemical analyses measured most of the known aroma compounds, the total number of flavor compounds in complex foods like beer is still larger than the subset we were able to measure in this study. For example, hop-derived thiols, that influence flavor at very low concentrations, are notoriously difficult to measure in a high-throughput experiment. Moreover, consumer perception remains subjective and prone to biases that are difficult to avoid. It is also important to stress that the models are still immature and that more extensive datasets will be crucial for developing more complete models in the future. Besides more samples and parameters, our dataset does not include any demographic information about the tasters. Including such data could lead to better models that grasp external factors like age and culture. Another limitation is that our set of beers consists of high-quality end-products and lacks beers that are unfit for sale, which limits the current model in accurately predicting products that are appreciated very badly. Finally, while models could be readily applied in quality control, their use in sensory science and product development is restrained by their inability to discern causal relationships. Given that the models cannot distinguish compounds that genuinely drive consumer perception from those that merely correlate, validation experiments are essential to identify true causative compounds.

Despite the inherent limitations, dissection of our models enabled us to pinpoint specific molecules as potential drivers of beer aroma and consumer appreciation, including compounds that were unexpected and would not have been identified using standard approaches. Important drivers of beer appreciation uncovered by our models include protein levels, ethyl acetate, ethyl phenyl acetate and lactic acid. Currently, many brewers already use lactic acid to acidify their brewing water and ensure optimal pH for enzymatic activity during the mashing process. Our results suggest that adding lactic acid can also improve beer appreciation, although its individual effect remains to be tested. Interestingly, ethanol appears to be unnecessary to improve beer appreciation, both for blond beer and alcohol-free beer. Given the growing consumer interest in alcohol-free beer, with a predicted annual market growth of >7% 84 , it is relevant for brewers to know what compounds can further increase consumer appreciation of these beers. Hence, our model may readily provide avenues to further improve the flavor and consumer appreciation of both alcoholic and non-alcoholic beers, which is generally considered one of the key challenges for future beer production.

Whereas we see a direct implementation of our results for the development of superior alcohol-free beverages and other food products, our study can also serve as a stepping stone for the development of novel alcohol-containing beverages. We want to echo the growing body of scientific evidence for the negative effects of alcohol consumption, both on the individual level by the mutagenic, teratogenic and carcinogenic effects of ethanol 85 , 86 , as well as the burden on society caused by alcohol abuse and addiction. We encourage the use of our results for the production of healthier, tastier products, including novel and improved beverages with lower alcohol contents. Furthermore, we strongly discourage the use of these technologies to improve the appreciation or addictive properties of harmful substances.

The present work demonstrates that despite some important remaining hurdles, combining the latest developments in chemical analyses, sensory analysis and modern machine learning methods offers exciting avenues for food chemistry and engineering. Soon, these tools may provide solutions in quality control and recipe development, as well as new approaches to sensory science and flavor research.

Beer selection

250 commercial Belgian beers were selected to cover the broad diversity of beer styles and corresponding diversity in chemical composition and aroma. See Supplementary Fig.  S1 .

Chemical dataset

Sample preparation.

Beers within their expiration date were purchased from commercial retailers. Samples were prepared in biological duplicates at room temperature, unless explicitly stated otherwise. Bottle pressure was measured with a manual pressure device (Steinfurth Mess-Systeme GmbH) and used to calculate CO 2 concentration. The beer was poured through two filter papers (Macherey-Nagel, 500713032 MN 713 ¼) to remove carbon dioxide and prevent spontaneous foaming. Samples were then prepared for measurements by targeted Headspace-Gas Chromatography-Flame Ionization Detector/Flame Photometric Detector (HS-GC-FID/FPD), Headspace-Solid Phase Microextraction-Gas Chromatography-Mass Spectrometry (HS-SPME-GC-MS), colorimetric analysis, enzymatic analysis, Near-Infrared (NIR) analysis, as described in the sections below. The mean values of biological duplicates are reported for each compound.

HS-GC-FID/FPD

HS-GC-FID/FPD (Shimadzu GC 2010 Plus) was used to measure higher alcohols, acetaldehyde, esters, 4-vinyl guaicol, and sulfur compounds. Each measurement comprised 5 ml of sample pipetted into a 20 ml glass vial containing 1.75 g NaCl (VWR, 27810.295). 100 µl of 2-heptanol (Sigma-Aldrich, H3003) (internal standard) solution in ethanol (Fisher Chemical, E/0650DF/C17) was added for a final concentration of 2.44 mg/L. Samples were flushed with nitrogen for 10 s, sealed with a silicone septum, stored at −80 °C and analyzed in batches of 20.

The GC was equipped with a DB-WAXetr column (length, 30 m; internal diameter, 0.32 mm; layer thickness, 0.50 µm; Agilent Technologies, Santa Clara, CA, USA) to the FID and an HP-5 column (length, 30 m; internal diameter, 0.25 mm; layer thickness, 0.25 µm; Agilent Technologies, Santa Clara, CA, USA) to the FPD. N 2 was used as the carrier gas. Samples were incubated for 20 min at 70 °C in the headspace autosampler (Flow rate, 35 cm/s; Injection volume, 1000 µL; Injection mode, split; Combi PAL autosampler, CTC analytics, Switzerland). The injector, FID and FPD temperatures were kept at 250 °C. The GC oven temperature was first held at 50 °C for 5 min and then allowed to rise to 80 °C at a rate of 5 °C/min, followed by a second ramp of 4 °C/min until 200 °C kept for 3 min and a final ramp of (4 °C/min) until 230 °C for 1 min. Results were analyzed with the GCSolution software version 2.4 (Shimadzu, Kyoto, Japan). The GC was calibrated with a 5% EtOH solution (VWR International) containing the volatiles under study (Supplementary Table  S7 ).

HS-SPME-GC-MS

HS-SPME-GC-MS (Shimadzu GCMS-QP-2010 Ultra) was used to measure additional volatile compounds, mainly comprising terpenoids and esters. Samples were analyzed by HS-SPME using a triphase DVB/Carboxen/PDMS 50/30 μm SPME fiber (Supelco Co., Bellefonte, PA, USA) followed by gas chromatography (Thermo Fisher Scientific Trace 1300 series, USA) coupled to a mass spectrometer (Thermo Fisher Scientific ISQ series MS) equipped with a TriPlus RSH autosampler. 5 ml of degassed beer sample was placed in 20 ml vials containing 1.75 g NaCl (VWR, 27810.295). 5 µl internal standard mix was added, containing 2-heptanol (1 g/L) (Sigma-Aldrich, H3003), 4-fluorobenzaldehyde (1 g/L) (Sigma-Aldrich, 128376), 2,3-hexanedione (1 g/L) (Sigma-Aldrich, 144169) and guaiacol (1 g/L) (Sigma-Aldrich, W253200) in ethanol (Fisher Chemical, E/0650DF/C17). Each sample was incubated at 60 °C in the autosampler oven with constant agitation. After 5 min equilibration, the SPME fiber was exposed to the sample headspace for 30 min. The compounds trapped on the fiber were thermally desorbed in the injection port of the chromatograph by heating the fiber for 15 min at 270 °C.

The GC-MS was equipped with a low polarity RXi-5Sil MS column (length, 20 m; internal diameter, 0.18 mm; layer thickness, 0.18 µm; Restek, Bellefonte, PA, USA). Injection was performed in splitless mode at 320 °C, a split flow of 9 ml/min, a purge flow of 5 ml/min and an open valve time of 3 min. To obtain a pulsed injection, a programmed gas flow was used whereby the helium gas flow was set at 2.7 mL/min for 0.1 min, followed by a decrease in flow of 20 ml/min to the normal 0.9 mL/min. The temperature was first held at 30 °C for 3 min and then allowed to rise to 80 °C at a rate of 7 °C/min, followed by a second ramp of 2 °C/min till 125 °C and a final ramp of 8 °C/min with a final temperature of 270 °C.

Mass acquisition range was 33 to 550 amu at a scan rate of 5 scans/s. Electron impact ionization energy was 70 eV. The interface and ion source were kept at 275 °C and 250 °C, respectively. A mix of linear n-alkanes (from C7 to C40, Supelco Co.) was injected into the GC-MS under identical conditions to serve as external retention index markers. Identification and quantification of the compounds were performed using an in-house developed R script as described in Goelen et al. and Reher et al. 87 , 88 (for package information, see Supplementary Table  S8 ). Briefly, chromatograms were analyzed using AMDIS (v2.71) 89 to separate overlapping peaks and obtain pure compound spectra. The NIST MS Search software (v2.0 g) in combination with the NIST2017, FFNSC3 and Adams4 libraries were used to manually identify the empirical spectra, taking into account the expected retention time. After background subtraction and correcting for retention time shifts between samples run on different days based on alkane ladders, compound elution profiles were extracted and integrated using a file with 284 target compounds of interest, which were either recovered in our identified AMDIS list of spectra or were known to occur in beer. Compound elution profiles were estimated for every peak in every chromatogram over a time-restricted window using weighted non-negative least square analysis after which peak areas were integrated 87 , 88 . Batch effect correction was performed by normalizing against the most stable internal standard compound, 4-fluorobenzaldehyde. Out of all 284 target compounds that were analyzed, 167 were visually judged to have reliable elution profiles and were used for final analysis.

Discrete photometric and enzymatic analysis

Discrete photometric and enzymatic analysis (Thermo Scientific TM Gallery TM Plus Beermaster Discrete Analyzer) was used to measure acetic acid, ammonia, beta-glucan, iso-alpha acids, color, sugars, glycerol, iron, pH, protein, and sulfite. 2 ml of sample volume was used for the analyses. Information regarding the reagents and standard solutions used for analyses and calibrations is included in Supplementary Table  S7 and Supplementary Table  S9 .

NIR analyses

NIR analysis (Anton Paar Alcolyzer Beer ME System) was used to measure ethanol. Measurements comprised 50 ml of sample, and a 10% EtOH solution was used for calibration.

Correlation calculations

Pairwise Spearman Rank correlations were calculated between all chemical properties.

Sensory dataset

Trained panel.

Our trained tasting panel consisted of volunteers who gave prior verbal informed consent. All compounds used for the validation experiment were of food-grade quality. The tasting sessions were approved by the Social and Societal Ethics Committee of the KU Leuven (G-2022-5677-R2(MAR)). All online reviewers agreed to the Terms and Conditions of the RateBeer website.

Sensory analysis was performed according to the American Society of Brewing Chemists (ASBC) Sensory Analysis Methods 90 . 30 volunteers were screened through a series of triangle tests. The sixteen most sensitive and consistent tasters were retained as taste panel members. The resulting panel was diverse in age [22–42, mean: 29], sex [56% male] and nationality [7 different countries]. The panel developed a consensus vocabulary to describe beer aroma, taste and mouthfeel. Panelists were trained to identify and score 50 different attributes, using a 7-point scale to rate attributes’ intensity. The scoring sheet is included as Supplementary Data  3 . Sensory assessments took place between 10–12 a.m. The beers were served in black-colored glasses. Per session, between 5 and 12 beers of the same style were tasted at 12 °C to 16 °C. Two reference beers were added to each set and indicated as ‘Reference 1 & 2’, allowing panel members to calibrate their ratings. Not all panelists were present at every tasting. Scores were scaled by standard deviation and mean-centered per taster. Values are represented as z-scores and clustered by Euclidean distance. Pairwise Spearman correlations were calculated between taste and aroma sensory attributes. Panel consistency was evaluated by repeating samples on different sessions and performing ANOVA to identify differences, using the ‘stats’ package (v4.2.2) in R (for package information, see Supplementary Table  S8 ).

Online reviews from a public database

The ‘scrapy’ package in Python (v3.6) (for package information, see Supplementary Table  S8 ). was used to collect 232,288 online reviews (mean=922, min=6, max=5343) from RateBeer, an online beer review database. Each review entry comprised 5 numerical scores (appearance, aroma, taste, palate and overall quality) and an optional review text. The total number of reviews per reviewer was collected separately. Numerical scores were scaled and centered per rater, and mean scores were calculated per beer.

For the review texts, the language was estimated using the packages ‘langdetect’ and ‘langid’ in Python. Reviews that were classified as English by both packages were kept. Reviewers with fewer than 100 entries overall were discarded. 181,025 reviews from >6000 reviewers from >40 countries remained. Text processing was done using the ‘nltk’ package in Python. Texts were corrected for slang and misspellings; proper nouns and rare words that are relevant to the beer context were specified and kept as-is (‘Chimay’,’Lambic’, etc.). A dictionary of semantically similar sensorial terms, for example ‘floral’ and ‘flower’, was created and collapsed together into one term. Words were stemmed and lemmatized to avoid identifying words such as ‘acid’ and ‘acidity’ as separate terms. Numbers and punctuation were removed.

Sentences from up to 50 randomly chosen reviews per beer were manually categorized according to the aspect of beer they describe (appearance, aroma, taste, palate, overall quality—not to be confused with the 5 numerical scores described above) or flagged as irrelevant if they contained no useful information. If a beer contained fewer than 50 reviews, all reviews were manually classified. This labeled data set was used to train a model that classified the rest of the sentences for all beers 91 . Sentences describing taste and aroma were extracted, and term frequency–inverse document frequency (TFIDF) was implemented to calculate enrichment scores for sensorial words per beer.

The sex of the tasting subject was not considered when building our sensory database. Instead, results from different panelists were averaged, both for our trained panel (56% male, 44% female) and the RateBeer reviews (70% male, 30% female for RateBeer as a whole).

Beer price collection and processing

Beer prices were collected from the following stores: Colruyt, Delhaize, Total Wine, BeerHawk, The Belgian Beer Shop, The Belgian Shop, and Beer of Belgium. Where applicable, prices were converted to Euros and normalized per liter. Spearman correlations were calculated between these prices and mean overall appreciation scores from RateBeer and the taste panel, respectively.

Pairwise Spearman Rank correlations were calculated between all sensory properties.

Machine learning models

Predictive modeling of sensory profiles from chemical data.

Regression models were constructed to predict (a) trained panel scores for beer flavors and quality from beer chemical profiles and (b) public reviews’ appreciation scores from beer chemical profiles. Z-scores were used to represent sensory attributes in both data sets. Chemical properties with log-normal distributions (Shapiro-Wilk test, p  <  0.05 ) were log-transformed. Missing chemical measurements (0.1% of all data) were replaced with mean values per attribute. Observations from 250 beers were randomly separated into a training set (70%, 175 beers) and a test set (30%, 75 beers), stratified per beer style. Chemical measurements (p = 231) were normalized based on the training set average and standard deviation. In total, three linear regression-based models: linear regression with first-order interaction terms (LR), lasso regression with first-order interaction terms (Lasso) and partial least squares regression (PLSR); five decision tree models, Adaboost regressor (ABR), Extra Trees (ET), Gradient Boosting regressor (GBR), Random Forest (RF) and XGBoost regressor (XGBR); one support vector machine model (SVR) and one artificial neural network model (ANN) were trained. The models were implemented using the ‘scikit-learn’ package (v1.2.2) and ‘xgboost’ package (v1.7.3) in Python (v3.9.16). Models were trained, and hyperparameters optimized, using five-fold cross-validated grid search with the coefficient of determination (R 2 ) as the evaluation metric. The ANN (scikit-learn’s MLPRegressor) was optimized using Bayesian Tree-Structured Parzen Estimator optimization with the ‘Optuna’ Python package (v3.2.0). Individual models were trained per attribute, and a multi-output model was trained on all attributes simultaneously.

Model dissection

GBR was found to outperform other methods, resulting in models with the highest average R 2 values in both trained panel and public review data sets. Impurity-based rankings of the most important predictors for each predicted sensorial trait were obtained using the ‘scikit-learn’ package. To observe the relationships between these chemical properties and their predicted targets, partial dependence plots (PDP) were constructed for the six most important predictors of consumer appreciation 74 , 75 .

The ‘SHAP’ package in Python (v0.41.0) was implemented to provide an alternative ranking of predictor importance and to visualize the predictors’ effects as a function of their concentration 68 .

Validation of causal chemical properties

To validate the effects of the most important model features on predicted sensory attributes, beers were spiked with the chemical compounds identified by the models and descriptive sensory analyses were carried out according to the American Society of Brewing Chemists (ASBC) protocol 90 .

Compound spiking was done 30 min before tasting. Compounds were spiked into fresh beer bottles, that were immediately resealed and inverted three times. Fresh bottles of beer were opened for the same duration, resealed, and inverted thrice, to serve as controls. Pairs of spiked samples and controls were served simultaneously, chilled and in dark glasses as outlined in the Trained panel section above. Tasters were instructed to select the glass with the higher flavor intensity for each attribute (directional difference test 92 ) and to select the glass they prefer.

The final concentration after spiking was equal to the within-style average, after normalizing by ethanol concentration. This was done to ensure balanced flavor profiles in the final spiked beer. The same methods were applied to improve a non-alcoholic beer. Compounds were the following: ethyl acetate (Merck KGaA, W241415), ethyl hexanoate (Merck KGaA, W243906), isoamyl acetate (Merck KGaA, W205508), phenethyl acetate (Merck KGaA, W285706), ethanol (96%, Colruyt), glycerol (Merck KGaA, W252506), lactic acid (Merck KGaA, 261106).

Significant differences in preference or perceived intensity were determined by performing the two-sided binomial test on each attribute.

Reporting summary

Further information on research design is available in the  Nature Portfolio Reporting Summary linked to this article.

Data availability

The data that support the findings of this work are available in the Supplementary Data files and have been deposited to Zenodo under accession code 10653704 93 . The RateBeer scores data are under restricted access, they are not publicly available as they are property of RateBeer (ZX Ventures, USA). Access can be obtained from the authors upon reasonable request and with permission of RateBeer (ZX Ventures, USA).  Source data are provided with this paper.

Code availability

The code for training the machine learning models, analyzing the models, and generating the figures has been deposited to Zenodo under accession code 10653704 93 .

Tieman, D. et al. A chemical genetic roadmap to improved tomato flavor. Science 355 , 391–394 (2017).

Article   ADS   CAS   PubMed   Google Scholar  

Plutowska, B. & Wardencki, W. Application of gas chromatography–olfactometry (GC–O) in analysis and quality assessment of alcoholic beverages – A review. Food Chem. 107 , 449–463 (2008).

Article   CAS   Google Scholar  

Legin, A., Rudnitskaya, A., Seleznev, B. & Vlasov, Y. Electronic tongue for quality assessment of ethanol, vodka and eau-de-vie. Anal. Chim. Acta 534 , 129–135 (2005).

Loutfi, A., Coradeschi, S., Mani, G. K., Shankar, P. & Rayappan, J. B. B. Electronic noses for food quality: A review. J. Food Eng. 144 , 103–111 (2015).

Ahn, Y.-Y., Ahnert, S. E., Bagrow, J. P. & Barabási, A.-L. Flavor network and the principles of food pairing. Sci. Rep. 1 , 196 (2011).

Article   CAS   PubMed   PubMed Central   Google Scholar  

Bartoshuk, L. M. & Klee, H. J. Better fruits and vegetables through sensory analysis. Curr. Biol. 23 , R374–R378 (2013).

Article   CAS   PubMed   Google Scholar  

Piggott, J. R. Design questions in sensory and consumer science. Food Qual. Prefer. 3293 , 217–220 (1995).

Article   Google Scholar  

Kermit, M. & Lengard, V. Assessing the performance of a sensory panel-panellist monitoring and tracking. J. Chemom. 19 , 154–161 (2005).

Cook, D. J., Hollowood, T. A., Linforth, R. S. T. & Taylor, A. J. Correlating instrumental measurements of texture and flavour release with human perception. Int. J. Food Sci. Technol. 40 , 631–641 (2005).

Chinchanachokchai, S., Thontirawong, P. & Chinchanachokchai, P. A tale of two recommender systems: The moderating role of consumer expertise on artificial intelligence based product recommendations. J. Retail. Consum. Serv. 61 , 1–12 (2021).

Ross, C. F. Sensory science at the human-machine interface. Trends Food Sci. Technol. 20 , 63–72 (2009).

Chambers, E. IV & Koppel, K. Associations of volatile compounds with sensory aroma and flavor: The complex nature of flavor. Molecules 18 , 4887–4905 (2013).

Pinu, F. R. Metabolomics—The new frontier in food safety and quality research. Food Res. Int. 72 , 80–81 (2015).

Danezis, G. P., Tsagkaris, A. S., Brusic, V. & Georgiou, C. A. Food authentication: state of the art and prospects. Curr. Opin. Food Sci. 10 , 22–31 (2016).

Shepherd, G. M. Smell images and the flavour system in the human brain. Nature 444 , 316–321 (2006).

Meilgaard, M. C. Prediction of flavor differences between beers from their chemical composition. J. Agric. Food Chem. 30 , 1009–1017 (1982).

Xu, L. et al. Widespread receptor-driven modulation in peripheral olfactory coding. Science 368 , eaaz5390 (2020).

Kupferschmidt, K. Following the flavor. Science 340 , 808–809 (2013).

Billesbølle, C. B. et al. Structural basis of odorant recognition by a human odorant receptor. Nature 615 , 742–749 (2023).

Article   ADS   PubMed   PubMed Central   Google Scholar  

Smith, B. Perspective: Complexities of flavour. Nature 486 , S6–S6 (2012).

Pfister, P. et al. Odorant receptor inhibition is fundamental to odor encoding. Curr. Biol. 30 , 2574–2587 (2020).

Moskowitz, H. W., Kumaraiah, V., Sharma, K. N., Jacobs, H. L. & Sharma, S. D. Cross-cultural differences in simple taste preferences. Science 190 , 1217–1218 (1975).

Eriksson, N. et al. A genetic variant near olfactory receptor genes influences cilantro preference. Flavour 1 , 22 (2012).

Ferdenzi, C. et al. Variability of affective responses to odors: Culture, gender, and olfactory knowledge. Chem. Senses 38 , 175–186 (2013).

Article   PubMed   Google Scholar  

Lawless, H. T. & Heymann, H. Sensory evaluation of food: Principles and practices. (Springer, New York, NY). https://doi.org/10.1007/978-1-4419-6488-5 (2010).

Colantonio, V. et al. Metabolomic selection for enhanced fruit flavor. Proc. Natl. Acad. Sci. 119 , e2115865119 (2022).

Fritz, F., Preissner, R. & Banerjee, P. VirtualTaste: a web server for the prediction of organoleptic properties of chemical compounds. Nucleic Acids Res 49 , W679–W684 (2021).

Tuwani, R., Wadhwa, S. & Bagler, G. BitterSweet: Building machine learning models for predicting the bitter and sweet taste of small molecules. Sci. Rep. 9 , 1–13 (2019).

Dagan-Wiener, A. et al. Bitter or not? BitterPredict, a tool for predicting taste from chemical structure. Sci. Rep. 7 , 1–13 (2017).

Pallante, L. et al. Toward a general and interpretable umami taste predictor using a multi-objective machine learning approach. Sci. Rep. 12 , 1–11 (2022).

Malavolta, M. et al. A survey on computational taste predictors. Eur. Food Res. Technol. 248 , 2215–2235 (2022).

Lee, B. K. et al. A principal odor map unifies diverse tasks in olfactory perception. Science 381 , 999–1006 (2023).

Mayhew, E. J. et al. Transport features predict if a molecule is odorous. Proc. Natl. Acad. Sci. 119 , e2116576119 (2022).

Niu, Y. et al. Sensory evaluation of the synergism among ester odorants in light aroma-type liquor by odor threshold, aroma intensity and flash GC electronic nose. Food Res. Int. 113 , 102–114 (2018).

Yu, P., Low, M. Y. & Zhou, W. Design of experiments and regression modelling in food flavour and sensory analysis: A review. Trends Food Sci. Technol. 71 , 202–215 (2018).

Oladokun, O. et al. The impact of hop bitter acid and polyphenol profiles on the perceived bitterness of beer. Food Chem. 205 , 212–220 (2016).

Linforth, R., Cabannes, M., Hewson, L., Yang, N. & Taylor, A. Effect of fat content on flavor delivery during consumption: An in vivo model. J. Agric. Food Chem. 58 , 6905–6911 (2010).

Guo, S., Na Jom, K. & Ge, Y. Influence of roasting condition on flavor profile of sunflower seeds: A flavoromics approach. Sci. Rep. 9 , 11295 (2019).

Ren, Q. et al. The changes of microbial community and flavor compound in the fermentation process of Chinese rice wine using Fagopyrum tataricum grain as feedstock. Sci. Rep. 9 , 3365 (2019).

Hastie, T., Friedman, J. & Tibshirani, R. The Elements of Statistical Learning. (Springer, New York, NY). https://doi.org/10.1007/978-0-387-21606-5 (2001).

Dietz, C., Cook, D., Huismann, M., Wilson, C. & Ford, R. The multisensory perception of hop essential oil: a review. J. Inst. Brew. 126 , 320–342 (2020).

CAS   Google Scholar  

Roncoroni, Miguel & Verstrepen, Kevin Joan. Belgian Beer: Tested and Tasted. (Lannoo, 2018).

Meilgaard, M. Flavor chemistry of beer: Part II: Flavor and threshold of 239 aroma volatiles. in (1975).

Bokulich, N. A. & Bamforth, C. W. The microbiology of malting and brewing. Microbiol. Mol. Biol. Rev. MMBR 77 , 157–172 (2013).

Dzialo, M. C., Park, R., Steensels, J., Lievens, B. & Verstrepen, K. J. Physiology, ecology and industrial applications of aroma formation in yeast. FEMS Microbiol. Rev. 41 , S95–S128 (2017).

Article   PubMed   PubMed Central   Google Scholar  

Datta, A. et al. Computer-aided food engineering. Nat. Food 3 , 894–904 (2022).

American Society of Brewing Chemists. Beer Methods. (American Society of Brewing Chemists, St. Paul, MN, U.S.A.).

Olaniran, A. O., Hiralal, L., Mokoena, M. P. & Pillay, B. Flavour-active volatile compounds in beer: production, regulation and control. J. Inst. Brew. 123 , 13–23 (2017).

Verstrepen, K. J. et al. Flavor-active esters: Adding fruitiness to beer. J. Biosci. Bioeng. 96 , 110–118 (2003).

Meilgaard, M. C. Flavour chemistry of beer. part I: flavour interaction between principal volatiles. Master Brew. Assoc. Am. Tech. Q 12 , 107–117 (1975).

Briggs, D. E., Boulton, C. A., Brookes, P. A. & Stevens, R. Brewing 227–254. (Woodhead Publishing). https://doi.org/10.1533/9781855739062.227 (2004).

Bossaert, S., Crauwels, S., De Rouck, G. & Lievens, B. The power of sour - A review: Old traditions, new opportunities. BrewingScience 72 , 78–88 (2019).

Google Scholar  

Verstrepen, K. J. et al. Flavor active esters: Adding fruitiness to beer. J. Biosci. Bioeng. 96 , 110–118 (2003).

Snauwaert, I. et al. Microbial diversity and metabolite composition of Belgian red-brown acidic ales. Int. J. Food Microbiol. 221 , 1–11 (2016).

Spitaels, F. et al. The microbial diversity of traditional spontaneously fermented lambic beer. PLoS ONE 9 , e95384 (2014).

Blanco, C. A., Andrés-Iglesias, C. & Montero, O. Low-alcohol Beers: Flavor Compounds, Defects, and Improvement Strategies. Crit. Rev. Food Sci. Nutr. 56 , 1379–1388 (2016).

Jackowski, M. & Trusek, A. Non-Alcohol. beer Prod. – Overv. 20 , 32–38 (2018).

Takoi, K. et al. The contribution of geraniol metabolism to the citrus flavour of beer: Synergy of geraniol and β-citronellol under coexistence with excess linalool. J. Inst. Brew. 116 , 251–260 (2010).

Kroeze, J. H. & Bartoshuk, L. M. Bitterness suppression as revealed by split-tongue taste stimulation in humans. Physiol. Behav. 35 , 779–783 (1985).

Mennella, J. A. et al. A spoonful of sugar helps the medicine go down”: Bitter masking bysucrose among children and adults. Chem. Senses 40 , 17–25 (2015).

Wietstock, P., Kunz, T., Perreira, F. & Methner, F.-J. Metal chelation behavior of hop acids in buffered model systems. BrewingScience 69 , 56–63 (2016).

Sancho, D., Blanco, C. A., Caballero, I. & Pascual, A. Free iron in pale, dark and alcohol-free commercial lager beers. J. Sci. Food Agric. 91 , 1142–1147 (2011).

Rodrigues, H. & Parr, W. V. Contribution of cross-cultural studies to understanding wine appreciation: A review. Food Res. Int. 115 , 251–258 (2019).

Korneva, E. & Blockeel, H. Towards better evaluation of multi-target regression models. in ECML PKDD 2020 Workshops (eds. Koprinska, I. et al.) 353–362 (Springer International Publishing, Cham, 2020). https://doi.org/10.1007/978-3-030-65965-3_23 .

Gastón Ares. Mathematical and Statistical Methods in Food Science and Technology. (Wiley, 2013).

Grinsztajn, L., Oyallon, E. & Varoquaux, G. Why do tree-based models still outperform deep learning on tabular data? Preprint at http://arxiv.org/abs/2207.08815 (2022).

Gries, S. T. Statistics for Linguistics with R: A Practical Introduction. in Statistics for Linguistics with R (De Gruyter Mouton, 2021). https://doi.org/10.1515/9783110718256 .

Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2 , 56–67 (2020).

Ickes, C. M. & Cadwallader, K. R. Effects of ethanol on flavor perception in alcoholic beverages. Chemosens. Percept. 10 , 119–134 (2017).

Kato, M. et al. Influence of high molecular weight polypeptides on the mouthfeel of commercial beer. J. Inst. Brew. 127 , 27–40 (2021).

Wauters, R. et al. Novel Saccharomyces cerevisiae variants slow down the accumulation of staling aldehydes and improve beer shelf-life. Food Chem. 398 , 1–11 (2023).

Li, H., Jia, S. & Zhang, W. Rapid determination of low-level sulfur compounds in beer by headspace gas chromatography with a pulsed flame photometric detector. J. Am. Soc. Brew. Chem. 66 , 188–191 (2008).

Dercksen, A., Laurens, J., Torline, P., Axcell, B. C. & Rohwer, E. Quantitative analysis of volatile sulfur compounds in beer using a membrane extraction interface. J. Am. Soc. Brew. Chem. 54 , 228–233 (1996).

Molnar, C. Interpretable Machine Learning: A Guide for Making Black-Box Models Interpretable. (2020).

Zhao, Q. & Hastie, T. Causal interpretations of black-box models. J. Bus. Econ. Stat. Publ. Am. Stat. Assoc. 39 , 272–281 (2019).

Article   MathSciNet   Google Scholar  

Hastie, T., Tibshirani, R. & Friedman, J. The Elements of Statistical Learning. (Springer, 2019).

Labrado, D. et al. Identification by NMR of key compounds present in beer distillates and residual phases after dealcoholization by vacuum distillation. J. Sci. Food Agric. 100 , 3971–3978 (2020).

Lusk, L. T., Kay, S. B., Porubcan, A. & Ryder, D. S. Key olfactory cues for beer oxidation. J. Am. Soc. Brew. Chem. 70 , 257–261 (2012).

Gonzalez Viejo, C., Torrico, D. D., Dunshea, F. R. & Fuentes, S. Development of artificial neural network models to assess beer acceptability based on sensory properties using a robotic pourer: A comparative model approach to achieve an artificial intelligence system. Beverages 5 , 33 (2019).

Gonzalez Viejo, C., Fuentes, S., Torrico, D. D., Godbole, A. & Dunshea, F. R. Chemical characterization of aromas in beer and their effect on consumers liking. Food Chem. 293 , 479–485 (2019).

Gilbert, J. L. et al. Identifying breeding priorities for blueberry flavor using biochemical, sensory, and genotype by environment analyses. PLOS ONE 10 , 1–21 (2015).

Goulet, C. et al. Role of an esterase in flavor volatile variation within the tomato clade. Proc. Natl. Acad. Sci. 109 , 19009–19014 (2012).

Article   ADS   CAS   PubMed   PubMed Central   Google Scholar  

Borisov, V. et al. Deep Neural Networks and Tabular Data: A Survey. IEEE Trans. Neural Netw. Learn. Syst. 1–21 https://doi.org/10.1109/TNNLS.2022.3229161 (2022).

Statista. Statista Consumer Market Outlook: Beer - Worldwide.

Seitz, H. K. & Stickel, F. Molecular mechanisms of alcoholmediated carcinogenesis. Nat. Rev. Cancer 7 , 599–612 (2007).

Voordeckers, K. et al. Ethanol exposure increases mutation rate through error-prone polymerases. Nat. Commun. 11 , 3664 (2020).

Goelen, T. et al. Bacterial phylogeny predicts volatile organic compound composition and olfactory response of an aphid parasitoid. Oikos 129 , 1415–1428 (2020).

Article   ADS   Google Scholar  

Reher, T. et al. Evaluation of hop (Humulus lupulus) as a repellent for the management of Drosophila suzukii. Crop Prot. 124 , 104839 (2019).

Stein, S. E. An integrated method for spectrum extraction and compound identification from gas chromatography/mass spectrometry data. J. Am. Soc. Mass Spectrom. 10 , 770–781 (1999).

American Society of Brewing Chemists. Sensory Analysis Methods. (American Society of Brewing Chemists, St. Paul, MN, U.S.A., 1992).

McAuley, J., Leskovec, J. & Jurafsky, D. Learning Attitudes and Attributes from Multi-Aspect Reviews. Preprint at https://doi.org/10.48550/arXiv.1210.3926 (2012).

Meilgaard, M. C., Carr, B. T. & Carr, B. T. Sensory Evaluation Techniques. (CRC Press, Boca Raton). https://doi.org/10.1201/b16452 (2014).

Schreurs, M. et al. Data from: Predicting and improving complex beer flavor through machine learning. Zenodo https://doi.org/10.5281/zenodo.10653704 (2024).

Download references

Acknowledgements

We thank all lab members for their discussions and thank all tasting panel members for their contributions. Special thanks go out to Dr. Karin Voordeckers for her tremendous help in proofreading and improving the manuscript. M.S. was supported by a Baillet-Latour fellowship, L.C. acknowledges financial support from KU Leuven (C16/17/006), F.A.T. was supported by a PhD fellowship from FWO (1S08821N). Research in the lab of K.J.V. is supported by KU Leuven, FWO, VIB, VLAIO and the Brewing Science Serves Health Fund. Research in the lab of T.W. is supported by FWO (G.0A51.15) and KU Leuven (C16/17/006).

Author information

These authors contributed equally: Michiel Schreurs, Supinya Piampongsant, Miguel Roncoroni.

Authors and Affiliations

VIB—KU Leuven Center for Microbiology, Gaston Geenslaan 1, B-3001, Leuven, Belgium

Michiel Schreurs, Supinya Piampongsant, Miguel Roncoroni, Lloyd Cool, Beatriz Herrera-Malaver, Florian A. Theßeling & Kevin J. Verstrepen

CMPG Laboratory of Genetics and Genomics, KU Leuven, Gaston Geenslaan 1, B-3001, Leuven, Belgium

Leuven Institute for Beer Research (LIBR), Gaston Geenslaan 1, B-3001, Leuven, Belgium

Laboratory of Socioecology and Social Evolution, KU Leuven, Naamsestraat 59, B-3000, Leuven, Belgium

Lloyd Cool, Christophe Vanderaa & Tom Wenseleers

VIB Bioinformatics Core, VIB, Rijvisschestraat 120, B-9052, Ghent, Belgium

Łukasz Kreft & Alexander Botzki

AB InBev SA/NV, Brouwerijplein 1, B-3000, Leuven, Belgium

Philippe Malcorps & Luk Daenen

You can also search for this author in PubMed   Google Scholar

Contributions

S.P., M.S. and K.J.V. conceived the experiments. S.P., M.S. and K.J.V. designed the experiments. S.P., M.S., M.R., B.H. and F.A.T. performed the experiments. S.P., M.S., L.C., C.V., L.K., A.B., P.M., L.D., T.W. and K.J.V. contributed analysis ideas. S.P., M.S., L.C., C.V., T.W. and K.J.V. analyzed the data. All authors contributed to writing the manuscript.

Corresponding author

Correspondence to Kevin J. Verstrepen .

Ethics declarations

Competing interests.

K.J.V. is affiliated with bar.on. The other authors declare no competing interests.

Peer review

Peer review information.

Nature Communications thanks Florian Bauer, Andrew John Macintosh and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary information, peer review file, description of additional supplementary files, supplementary data 1, supplementary data 2, supplementary data 3, supplementary data 4, supplementary data 5, supplementary data 6, supplementary data 7, reporting summary, source data, source data, rights and permissions.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Schreurs, M., Piampongsant, S., Roncoroni, M. et al. Predicting and improving complex beer flavor through machine learning. Nat Commun 15 , 2368 (2024). https://doi.org/10.1038/s41467-024-46346-0

Download citation

Received : 30 October 2023

Accepted : 21 February 2024

Published : 26 March 2024

DOI : https://doi.org/10.1038/s41467-024-46346-0

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

By submitting a comment you agree to abide by our Terms and Community Guidelines . If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Quick links

  • Explore articles by subject
  • Guide to authors
  • Editorial policies

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

author's note in research paper

PenMyPaper

Customer Reviews

PenMyPaper

Finished Papers

author's note in research paper

IMAGES

  1. What's new in the APA 7th Edition?

    author's note in research paper

  2. Author`s Notes Guidelines

    author's note in research paper

  3. APA Format Template

    author's note in research paper

  4. American Psychological Association (APA) Style Guide Essay Example [Updated]

    author's note in research paper

  5. Systematic Guide to Author’s Note Writing

    author's note in research paper

  6. (PDF) RESEARCH NOTE: ACADEMIC AFFILIATIONS OF SOCIAL WORK JOURNAL ARTICLE AUTHORS, 2004-2008

    author's note in research paper

VIDEO

  1. Author's Note

  2. What is the Most Iconic Author's Note in Literary History?

  3. Response to Reviewers for Resubmitting a Paper: Tips for Graduate Students

  4. How to Read a Paper Efficiently (By Prof. Pete Carr)

  5. special paper note#shots #YouTube short #viral #paper notes #magical diary #art with craft deepanshi

  6. ₹2000 के Note बनाने में कितना खर्च ? 💸🤔

COMMENTS

  1. Who needs an author note? You … maybe

    An author note provides additional information about authors, study registration, data sharing, disclaimers or statements regarding conflicts of interest, and help or funding that supported the research. It also provides a point of contact for interested readers. Professional papers intended for submission to a journal must always include an author note.

  2. Systematic Guide to Author's Note Writing

    It depends on what the author wants the reader to know — as long as it is relevant to the core text. A successful author's note should establish a connection between the reader and the component (s) of the written work addressed in the note. Style basics of the author's note include placement, brevity, and voice.

  3. What Is an Author's Note and Do You Need One?

    The author's note can also take a more personal tone by thanking readers for their support, explaining something about the journey of writing the book, or perhaps why they were moved to write the book. ... If I am writing a research paper, am I considered the author. I am using APA 7th edition. I don't know why I need to specify an author ...

  4. What is an Author's Note: A Concise Guide

    In academic or research papers, the author's note can include information about study registration, data sharing, funding sources, and conflicts of interest. In summary, an author's note is a way for authors to give readers a glimpse into the creative process and background of their written work. This meta-writing helps connect the author ...

  5. PDF APA Style Guide (Professional Version)

    paper is DUE. Authors Note At the bottom of the title page, an Author's Note is required. Center and bold. An Author's Note provides additional information about authors, study registration, data sharing, disclaimers or statements regarding conflicts of interest, and help or funding that supported the research. It

  6. 13.5 Research Process: Making Notes, Synthesizing ...

    16.1 An Author's Choices: What Text Says and How It Says It; 16.2 Textual Analysis ... As you gather evidence for your argumentative research paper, follow the descriptions and the electronic model to record your notes. ... as a template for creating your own research notes and organizational tool. You will need to have a record of all field ...

  7. APA Title Page (7th edition)

    The student version of the APA title page should include the following information (double spaced and centered): Paper title. Author name. Department and university name. Course number and name. Instructor name. Due date of the assignment. The professional title page also includes an author note (flushed left), but not a course name, instructor ...

  8. Tutorial Thursday: Adding an Author Note in APA Style CENTRAL

    This brief video (2 min) demonstrates how to add an Author Note to your paper in APA Style CENTRAL, and includes: The distinction between the options of "Student assignment" or "Professional manuscript" when starting a new paper; Using the Customize Paper Sections tool to add an Author Note section to a student paper; and.

  9. Writing for publication: Structure, form, content, and journal

    Authors can then make a note of any recurring journal titles in which relevant papers on their topics appear to have been published, as these journals are likely to be within the scope of the content of their paper, and are hopefully more likely to be commonly read by other researchers in the specific field.

  10. PDF Taking and Organizing Notes for Research Papers

    memory source. Students take notes to record information and to aid in comprehension and reflection. Note taking is an essential part of writing any research paper because they give you a better understanding of course material. While writing a research paper, you will need to gather and synthesize information from various sources. Knowing what ...

  11. 13.1 Formatting a Research Paper

    Set the top, bottom, and side margins of your paper at 1 inch. Use double-spaced text throughout your paper. Use a standard font, such as Times New Roman or Arial, in a legible size (10- to 12-point). Use continuous pagination throughout the paper, including the title page and the references section.

  12. Taking Notes from Research Reading

    Taking Notes from Research Reading. If you take notes efficiently, you can read with more understanding and also save time and frustration when you come to write your paper. These are three main principles. 1. Know what kind of ideas you need to record. Focus your approach to the topic before you start detailed research.

  13. Essential Guide to Manuscript Writing for Academic Dummies: An Editor's

    Abstract. Writing an effective manuscript is one of the pivotal steps in the successful closure of the research project, and getting it published in a peer-reviewed and indexed journal adds to the academic profile of a researcher. Writing and publishing a scientific paper is a tough task that researchers and academicians must endure in staying ...

  14. Note-taking for Research

    Note-taking for Research. As you determine which sources you will rely on most, it is important to establish a system for keeping track of your sources and taking notes. There are several ways to go about it, and no one system is necessarily superior. What matters is that you keep materials in order; record bibliographical information you will ...

  15. Organizing Your Social Sciences Research Paper

    Things to keep in mind when considering using either endnotes or footnotes in your research paper:. 1. Footnotes are numbered consecutively throughout a research paper, except for those notes accompanying special material (e.g., figures, tables, charts, etc.). Numbering of footnotes are "superscript"--Arabic numbers typed slightly above the line of text.

  16. How to Write and Publish a Research Paper for a Peer ...

    Communicating research findings is an essential step in the research process. Often, peer-reviewed journals are the forum for such communication, yet many researchers are never taught how to write a publishable scientific paper. In this article, we explain the basic structure of a scientific paper and describe the information that should be included in each section. We also identify common ...

  17. How to Write Your First Research Paper

    This paper presents guidelines on how to initiate the writing process and draft each section of a research manuscript. The paper discusses seven rules that allow the writer to prepare a well-structured and comprehensive manuscript for a publication submission. In addition, the author lists different strategies for successful revision.

  18. PDF Running head: VARYING DEFINITIONS OF ONLINE COMMUNICATION 1

    The author note should appear on printed articles and identifies each author's department and institution affiliation and any changes in affiliation, contains ... all forms of CMC should be studied. This paper examines Cummings et al.'s research against other CMC research to propose that additional research be conducted to better understand ...

  19. Writing a Research Paper Introduction

    Step 1: Introduce your topic. Step 2: Describe the background. Step 3: Establish your research problem. Step 4: Specify your objective (s) Step 5: Map out your paper. Research paper introduction examples. Frequently asked questions about the research paper introduction.

  20. Reading Well and Taking Research Notes

    The Craft of Research, Third Edition addresses notetaking in a section called "Recording What You Find" (pp. 95-100). Below is a summary of the system outlined in the book. Take full notes. Whether you take notes on cards, in a notebook, or on the computer, it's vital to record information accurately and completely.

  21. To Co-Author or Not to Co-Author: How to Write, Publish, and ...

    As we approach 2010, a clearly discernible buzz of excitement and trepidation hovers over national discussions of biology education ().In 2003, BIO2010 (the National Research Council's treatise on biology education) set the stage for rethinking how biology educators might teach an increasingly content-rich field by putting a greater emphasis on scientific methodology and critical thinking ().

  22. HDL Therapeutics

    Disclosure forms provided by the authors are available with the full text of this editorial at NEJM.org. Supplementary Material Disclosure Forms (nejme2403036_disclosures.pdf)

  23. Predicting and improving complex beer flavor through machine ...

    The perception and appreciation of food flavor depends on many interacting chemical compounds and external factors, and therefore proves challenging to understand and predict. Here, we combine ...

  24. Author's Note Research Paper

    When working with EssayService you can be sure that our professional writers will adhere to your requirements and overcome your expectations. Pay your hard-earned money only for educational writers. 100% Success rate. 1084 Orders prepared. Learn How to Order. Level: College, University, Master's, High School, PHD, Undergraduate. 1800.

  25. Mandating indoor air quality for public buildings

    Science. 28 Mar 2024. Vol 383, Issue 6690. pp. 1418 - 1420. DOI: 10.1126/science.adl0677. People living in urban and industrialized societies, which are expanding globally, spend more than 90% of their time in the indoor environment, breathing indoor air (IA). Despite decades of research and advocacy, most countries do not have legislated ...