data gathering procedure in case study

Skip to main content
Skip to primary sidebar
Skip to footer
QuestionPro

Solutions Industries Gaming Automotive Sports and events Education Government Travel & Hospitality Financial Services Healthcare Cannabis Technology Use Case NPS+ Communities Audience Contactless surveys Mobile LivePolls Member Experience GDPR Positive People Science 360 Feedback Surveys
Resources Blog eBooks Survey Templates Case Studies Training Help center

Home Market Research

Data Collection: What It Is, Methods & Tools + Examples

Let’s face it, no one wants to make decisions based on guesswork or gut feelings. The most important objective of data collection is to ensure that the data gathered is reliable and packed to the brim with juicy insights that can be analyzed and turned into data-driven decisions. There’s nothing better than good statistical analysis .

LEARN ABOUT: Level of Analysis

Collecting high-quality data is essential for conducting market research, analyzing user behavior, or just trying to get a handle on business operations. With the right approach and a few handy tools, gathering reliable and informative data.

So, let’s get ready to collect some data because when it comes to data collection, it’s all about the details.

Content Index

What is Data Collection?

Data collection methods, data collection examples, reasons to conduct online research and data collection, conducting customer surveys for data collection to multiply sales, steps to effectively conduct an online survey for data collection, survey design for data collection.

Data collection is the procedure of collecting, measuring, and analyzing accurate insights for research using standard validated techniques.

Put simply, data collection is the process of gathering information for a specific purpose. It can be used to answer research questions, make informed business decisions, or improve products and services.

To collect data, we must first identify what information we need and how we will collect it. We can also evaluate a hypothesis based on collected data. In most cases, data collection is the primary and most important step for research. The approach to data collection is different for different fields of study, depending on the required information.

LEARN ABOUT: Action Research

There are many ways to collect information when doing research. The data collection methods that the researcher chooses will depend on the research question posed. Some data collection methods include surveys, interviews, tests, physiological evaluations, observations, reviews of existing records, and biological samples. Let’s explore them.

LEARN ABOUT: Best Data Collection Tools

Phone vs. Online vs. In-Person Interviews

Essentially there are four choices for data collection – in-person interviews, mail, phone, and online. There are pros and cons to each of these modes.

Pros: In-depth and a high degree of confidence in the data
Cons: Time-consuming, expensive, and can be dismissed as anecdotal
Pros: Can reach anyone and everyone – no barrier
Cons: Expensive, data collection errors, lag time
Pros: High degree of confidence in the data collected, reach almost anyone
Cons: Expensive, cannot self-administer, need to hire an agency
Pros: Cheap, can self-administer, very low probability of data errors
Cons: Not all your customers might have an email address/be on the internet, customers may be wary of divulging information online.

In-person interviews always are better, but the big drawback is the trap you might fall into if you don’t do them regularly. It is expensive to regularly conduct interviews and not conducting enough interviews might give you false positives. Validating your research is almost as important as designing and conducting it.

We’ve seen many instances where after the research is conducted – if the results do not match up with the “gut-feel” of upper management, it has been dismissed off as anecdotal and a “one-time” phenomenon. To avoid such traps, we strongly recommend that data-collection be done on an “ongoing and regular” basis.

LEARN ABOUT: Research Process Steps

This will help you compare and analyze the change in perceptions according to marketing for your products/services. The other issue here is sample size. To be confident with your research, you must interview enough people to weed out the fringe elements.

A couple of years ago there was a lot of discussion about online surveys and their statistical analysis plan . The fact that not every customer had internet connectivity was one of the main concerns.

LEARN ABOUT: Statistical Analysis Methods

Although some of the discussions are still valid, the reach of the internet as a means of communication has become vital in the majority of customer interactions. According to the US Census Bureau, the number of households with computers has doubled between 1997 and 2001.

Learn more: Quantitative Market Research

In 2001 nearly 50% of households had a computer. Nearly 55% of all households with an income of more than 35,000 have internet access, which jumps to 70% for households with an annual income of 50,000. This data is from the US Census Bureau for 2001.

There are primarily three modes of data collection that can be employed to gather feedback – Mail, Phone, and Online. The method actually used for data collection is really a cost-benefit analysis. There is no slam-dunk solution but you can use the table below to understand the risks and advantages associated with each of the mediums:

Keep in mind, the reach here is defined as “All U.S. Households.” In most cases, you need to look at how many of your customers are online and determine. If all your customers have email addresses, you have a 100% reach of your customers.

Another important thing to keep in mind is the ever-increasing dominance of cellular phones over landline phones. United States FCC rules prevent automated dialing and calling cellular phone numbers and there is a noticeable trend towards people having cellular phones as the only voice communication device.

This introduces the inability to reach cellular phone customers who are dropping home phone lines in favor of going entirely wireless. Even if automated dialing is not used, another FCC rule prohibits from phoning anyone who would have to pay for the call.

Learn more: Qualitative Market Research

Multi-Mode Surveys

Surveys, where the data is collected via different modes (online, paper, phone etc.), is also another way of going. It is fairly straightforward and easy to have an online survey and have data-entry operators to enter in data (from the phone as well as paper surveys) into the system. The same system can also be used to collect data directly from the respondents.

Learn more: Survey Research

Data collection is an important aspect of research. Let’s consider an example of a mobile manufacturer, company X, which is launching a new product variant. To conduct research about features, price range, target market, competitor analysis, etc. data has to be collected from appropriate sources.

The marketing team can conduct various data collection activities such as online surveys or focus groups .

The survey should have all the right questions about features and pricing, such as “What are the top 3 features expected from an upcoming product?” or “How much are your likely to spend on this product?” or “Which competitors provide similar products?” etc.

For conducting a focus group, the marketing team should decide the participants and the mediator. The topic of discussion and objective behind conducting a focus group should be clarified beforehand to conduct a conclusive discussion.

Data collection methods are chosen depending on the available resources. For example, conducting questionnaires and surveys would require the least resources, while focus groups require moderately high resources.

Feedback is a vital part of any organization’s growth. Whether you conduct regular focus groups to elicit information from key players or, your account manager calls up all your marquee accounts to find out how things are going – essentially they are all processes to find out from your customers’ eyes – How are we doing? What can we do better?

Online surveys are just another medium to collect feedback from your customers , employees and anyone your business interacts with. With the advent of Do-It-Yourself tools for online surveys, data collection on the internet has become really easy, cheap and effective.

Learn more: Online Research

It is a well-established marketing fact that acquiring a new customer is 10 times more difficult and expensive than retaining an existing one. This is one of the fundamental driving forces behind the extensive adoption and interest in CRM and related customer retention tactics.

In a research study conducted by Rice University Professor Dr. Paul Dholakia and Dr. Vicki Morwitz, published in Harvard Business Review, the experiment inferred that the simple fact of asking customers how an organization was performing by itself to deliver results proved to be an effective customer retention strategy.

In the research study, conducted over the course of a year, one set of customers were sent out a satisfaction and opinion survey and the other set was not surveyed. In the next one year, the group that took the survey saw twice the number of people continuing and renewing their loyalty towards the organization data .

Learn more: Research Design

The research study provided a couple of interesting reasons on the basis of consumer psychology, behind this phenomenon:

Satisfaction surveys boost the customers’ desire to be coddled and induce positive feelings. This crops from a section of the human psychology that intends to “appreciate” a product or service they already like or prefer. The survey feedback collection method is solely a medium to convey this. The survey is a vehicle to “interact” with the company and reinforces the customer’s commitment to the company.
Surveys may increase awareness of auxiliary products and services. Surveys can be considered modes of both inbound as well as outbound communication. Surveys are generally considered to be a data collection and analysis source. Most people are unaware of the fact that consumer surveys can also serve as a medium for distributing data. It is important to note a few caveats here.
In most countries, including the US, “selling under the guise of research” is illegal. b. However, we all know that information is distributed while collecting information. c. Other disclaimers may be included in the survey to ensure users are aware of this fact. For example: “We will collect your opinion and inform you about products and services that have come online in the last year…”
Induced Judgments: The entire procedure of asking people for their feedback can prompt them to build an opinion on something they otherwise would not have thought about. This is a very underlying yet powerful argument that can be compared to the “Product Placement” strategy currently used for marketing products in mass media like movies and television shows. One example is the extensive and exclusive use of the “mini-Cooper” in the blockbuster movie “Italian Job.” This strategy is questionable and should be used with great caution.

Surveys should be considered as a critical tool in the customer journey dialog. The best thing about surveys is its ability to carry “bi-directional” information. The research conducted by Paul Dholakia and Vicki Morwitz shows that surveys not only get you the information that is critical for your business, but also enhances and builds upon the established relationship you have with your customers.

Recent technological advances have made it incredibly easy to conduct real-time surveys and opinion polls . Online tools make it easy to frame questions and answers and create surveys on the Web. Distributing surveys via email, website links or even integration with online CRM tools like Salesforce.com have made online surveying a quick-win solution.

So, you’ve decided to conduct an online survey. There are a few questions in your mind that you would like answered, and you are looking for a fast and inexpensive way to find out more about your customers, clients, etc.

First and foremost thing you need to decide what the smart objectives of the study are. Ensure that you can phrase these objectives as questions or measurements. If you can’t, you are better off looking at other data sources like focus groups and other qualitative methods . The data collected via online surveys is dominantly quantitative in nature.

Review the basic objectives of the study. What are you trying to discover? What actions do you want to take as a result of the survey? – Answers to these questions help in validating collected data. Online surveys are just one way of collecting and quantifying data .

Learn more: Qualitative Data & Qualitative Data Collection Methods

Visualize all of the relevant information items you would like to have. What will the output survey research report look like? What charts and graphs will be prepared? What information do you need to be assured that action is warranted?
Assign ranks to each topic (1 and 2) according to their priority, including the most important topics first. Revisit these items again to ensure that the objectives, topics, and information you need are appropriate. Remember, you can’t solve the research problem if you ask the wrong questions.
How easy or difficult is it for the respondent to provide information on each topic? If it is difficult, is there an alternative medium to gain insights by asking a different question? This is probably the most important step. Online surveys have to be Precise, Clear and Concise. Due to the nature of the internet and the fluctuations involved, if your questions are too difficult to understand, the survey dropout rate will be high.
Create a sequence for the topics that are unbiased. Make sure that the questions asked first do not bias the results of the next questions. Sometimes providing too much information, or disclosing purpose of the study can create bias. Once you have a series of decided topics, you can have a basic structure of a survey. It is always advisable to add an “Introductory” paragraph before the survey to explain the project objective and what is expected of the respondent. It is also sensible to have a “Thank You” text as well as information about where to find the results of the survey when they are published.
Page Breaks – The attention span of respondents can be very low when it comes to a long scrolling survey. Add page breaks as wherever possible. Having said that, a single question per page can also hamper response rates as it increases the time to complete the survey as well as increases the chances for dropouts.
Branching – Create smart and effective surveys with the implementation of branching wherever required. Eliminate the use of text such as, “If you answered No to Q1 then Answer Q4” – this leads to annoyance amongst respondents which result in increase survey dropout rates. Design online surveys using the branching logic so that appropriate questions are automatically routed based on previous responses.
Write the questions . Initially, write a significant number of survey questions out of which you can use the one which is best suited for the survey. Divide the survey into sections so that respondents do not get confused seeing a long list of questions.
Sequence the questions so that they are unbiased.
Repeat all of the steps above to find any major holes. Are the questions really answered? Have someone review it for you.
Time the length of the survey. A survey should take less than five minutes. At three to four research questions per minute, you are limited to about 15 questions. One open end text question counts for three multiple choice questions. Most online software tools will record the time taken for the respondents to answer questions.
Include a few open-ended survey questions that support your survey object. This will be a type of feedback survey.
Send an email to the project survey to your test group and then email the feedback survey afterward.
This way, you can have your test group provide their opinion about the functionality as well as usability of your project survey by using the feedback survey.
Make changes to your questionnaire based on the received feedback.
Send the survey out to all your respondents!

Online surveys have, over the course of time, evolved into an effective alternative to expensive mail or telephone surveys. However, you must be aware of a few conditions that need to be met for online surveys. If you are trying to survey a sample representing the target population, please remember that not everyone is online.

Moreover, not everyone is receptive to an online survey also. Generally, the demographic segmentation of younger individuals is inclined toward responding to an online survey.

Learn More: Examples of Qualitarive Data in Education

Good survey design is crucial for accurate data collection. From question-wording to response options, let’s explore how to create effective surveys that yield valuable insights with our tips to survey design.

Writing Great Questions for data collection

Writing great questions can be considered an art. Art always requires a significant amount of hard work, practice, and help from others.

The questions in a survey need to be clear, concise, and unbiased. A poorly worded question or a question with leading language can result in inaccurate or irrelevant responses, ultimately impacting the data’s validity.

Moreover, the questions should be relevant and specific to the research objectives. Questions that are irrelevant or do not capture the necessary information can lead to incomplete or inconsistent responses too.

Avoid loaded or leading words or questions

A small change in content can produce effective results. Words such as could , should and might are all used for almost the same purpose, but may produce a 20% difference in agreement to a question. For example, “The management could.. should.. might.. have shut the factory”.

Intense words such as – prohibit or action, representing control or action, produce similar results. For example, “Do you believe Donald Trump should prohibit insurance companies from raising rates?”.

Sometimes the content is just biased. For instance, “You wouldn’t want to go to Rudolpho’s Restaurant for the organization’s annual party, would you?”

Misplaced questions

Questions should always reference the intended context, and questions placed out of order or without its requirement should be avoided. Generally, a funnel approach should be implemented – generic questions should be included in the initial section of the questionnaire as a warm-up and specific ones should follow. Toward the end, demographic or geographic questions should be included.

Mutually non-overlapping response categories

Multiple-choice answers should be mutually unique to provide distinct choices. Overlapping answer options frustrate the respondent and make interpretation difficult at best. Also, the questions should always be precise.

For example: “Do you like water juice?”

This question is vague. In which terms is the liking for orange juice is to be rated? – Sweetness, texture, price, nutrition etc.

Avoid the use of confusing/unfamiliar words

Asking about industry-related terms such as caloric content, bits, bytes, MBS , as well as other terms and acronyms can confuse respondents . Ensure that the audience understands your language level, terminology, and, above all, the question you ask.

Non-directed questions give respondents excessive leeway

In survey design for data collection, non-directed questions can give respondents excessive leeway, which can lead to vague and unreliable data. These types of questions are also known as open-ended questions, and they do not provide any structure for the respondent to follow.

For instance, a non-directed question like “ What suggestions do you have for improving our shoes?” can elicit a wide range of answers, some of which may not be relevant to the research objectives. Some respondents may give short answers, while others may provide lengthy and detailed responses, making comparing and analyzing the data challenging.

To avoid these issues, it’s essential to ask direct questions that are specific and have a clear structure. Closed-ended questions, for example, offer structured response options and can be easier to analyze as they provide a quantitative measure of respondents’ opinions.

Never force questions

There will always be certain questions that cross certain privacy rules. Since privacy is an important issue for most people, these questions should either be eliminated from the survey or not be kept as mandatory. Survey questions about income, family income, status, religious and political beliefs, etc., should always be avoided as they are considered to be intruding, and respondents can choose not to answer them.

Unbalanced answer options in scales

Unbalanced answer options in scales such as Likert Scale and Semantic Scale may be appropriate for some situations and biased in others. When analyzing a pattern in eating habits, a study used a quantity scale that made obese people appear in the middle of the scale with the polar ends reflecting a state where people starve and an irrational amount to consume. There are cases where we usually do not expect poor service, such as hospitals.

Questions that cover two points

In survey design for data collection, questions that cover two points can be problematic for several reasons. These types of questions are often called “double-barreled” questions and can cause confusion for respondents, leading to inaccurate or irrelevant data.

For instance, a question like “Do you like the food and the service at the restaurant?” covers two points, the food and the service, and it assumes that the respondent has the same opinion about both. If the respondent only liked the food, their opinion of the service could affect their answer.

It’s important to ask one question at a time to avoid confusion and ensure that the respondent’s answer is focused and accurate. This also applies to questions with multiple concepts or ideas. In these cases, it’s best to break down the question into multiple questions that address each concept or idea separately.

Dichotomous questions

Dichotomous questions are used in case you want a distinct answer, such as: Yes/No or Male/Female . For example, the question “Do you think this candidate will win the election?” can be Yes or No.

Avoid the use of long questions

The use of long questions will definitely increase the time taken for completion, which will generally lead to an increase in the survey dropout rate. Multiple-choice questions are the longest and most complex, and open-ended questions are the shortest and easiest to answer.

Data collection is an essential part of the research process, whether you’re conducting scientific experiments, market research, or surveys. The methods and tools used for data collection will vary depending on the research type, the sample size required, and the resources available.

Several data collection methods include surveys, observations, interviews, and focus groups. We learn each method has advantages and disadvantages, and choosing the one that best suits the research goals is important.

With the rise of technology, many tools are now available to facilitate data collection, including online survey software and data visualization tools. These tools can help researchers collect, store, and analyze data more efficiently, providing greater results and accuracy.

By understanding the various methods and tools available for data collection, we can develop a solid foundation for conducting research. With these research skills , we can make informed decisions, solve problems, and contribute to advancing our understanding of the world around us.

Analyze your survey data to gauge in-depth market drivers, including competitive intelligence, purchasing behavior, and price sensitivity, with QuestionPro.

You will obtain accurate insights with various techniques, including conjoint analysis, MaxDiff analysis, sentiment analysis, TURF analysis, heatmap analysis, etc. Export quality data to external in-depth analysis tools such as SPSS and R Software, and integrate your research with external business applications. Everything you need for your data collection. Start today for free!

LEARN MORE FREE TRIAL

MORE LIKE THIS

Cannabis Industry Business Intelligence: Impact on Research

May 28, 2024

Top 10 Dynata Alternatives & Competitors

May 27, 2024

What Are My Employees Really Thinking? The Power of Open-ended Survey Analysis

May 24, 2024

When I think of “disconnected”, it is important that this is not just in relation to people analytics, Employee Experience or Customer Experience - it is also relevant to looking across them.

I Am Disconnected – Tuesday CX Thoughts

May 21, 2024

Data Science

Caltech Bootcamp / Blog / /

Data Collection Methods: A Comprehensive View

Written by John Terra
Updated on February 21, 2024

Companies that want to be competitive in today’s digital economy enjoy the benefit of countless reams of data available for market research. In fact, thanks to the advent of big data, there’s a veritable tidal wave of information ready to be put to good use, helping businesses make intelligent decisions and thrive.

But before that data can be used, it must be processed. But before it can be processed, it must be collected, and that’s what we’re here for. This article explores the subject of data collection. We will learn about the types of data collection methods and why they are essential.

We will detail primary and secondary data collection methods and discuss data collection procedures. We’ll also share how you can learn practical skills through online data science training.

But first, let’s get the definition out of the way. What is data collection?

What is Data Collection?

Data collection is the act of collecting, measuring and analyzing different kinds of information using a set of validated standard procedures and techniques. The primary objective of data collection procedures is to gather reliable, information-rich data and analyze it to make critical business decisions. Once the desired data is collected, it undergoes a process of data cleaning and processing to make the information actionable and valuable for businesses.

Your choice of data collection method (or alternately called a data gathering procedure) depends on the research questions you’re working on, the type of data required, and the available time and resources and time. You can categorize data-gathering procedures into two main methods:

Primary data collection . Primary data is collected via first-hand experiences and does not reference or use the past. The data obtained by primary data collection methods is exceptionally accurate and geared to the research’s motive. They are divided into two categories: quantitative and qualitative. We’ll explore the specifics later.
Secondary data collection. Secondary data is the information that’s been used in the past. The researcher can obtain data from internal and external sources, including organizational data.

Let’s take a closer look at specific examples of both data collection methods.

Also Read: Why Use Python for Data Science?

The Specific Types of Data Collection Methods

As mentioned, primary data collection methods are split into quantitative and qualitative. We will examine each method’s data collection tools separately. Then, we will discuss secondary data collection methods.

Quantitative Methods

Quantitative techniques for demand forecasting and market research typically use statistical tools. When using these techniques, historical data is used to forecast demand. These primary data-gathering procedures are most often used to make long-term forecasts. Statistical analysis methods are highly reliable because they carry minimal subjectivity.

Barometric Method. Also called the leading indicators approach, data analysts and researchers employ this method to speculate on future trends based on current developments. When past events are used to predict future events, they are considered leading indicators.
Smoothing Techniques. Smoothing techniques can be used in cases where the time series lacks significant trends. These techniques eliminate random variation from historical demand and help identify demand levels and patterns to estimate future demand. The most popular methods used in these techniques are the simple moving average and the weighted moving average methods.
Time Series Analysis. The term “time series” refers to the sequential order of values in a variable, also known as a trend, at equal time intervals. Using patterns, organizations can predict customer demand for their products and services during the projected time.

Qualitative Methods

Qualitative data collection methods are instrumental when no historical information is available, or numbers and mathematical calculations aren’t required. Qualitative research is closely linked to words, emotions, sounds, feelings, colors, and other non-quantifiable elements. These techniques rely on experience, conjecture, intuition, judgment, emotion, etc. Quantitative methods do not provide motives behind the participants’ responses. Additionally, they often don’t reach underrepresented populations and usually involve long data collection periods. Therefore, you get the best results using quantitative and qualitative methods together.

Questionnaires . Questionnaires are a printed set of either open-ended or closed-ended questions. Respondents must answer based on their experience and knowledge of the issue. A questionnaire is a part of a survey, while the questionnaire’s end goal doesn’t necessarily have to be a survey.
Surveys. Surveys collect data from target audiences, gathering insights into their opinions, preferences, choices, and feedback on the organization’s goods and services. Most survey software has a wide range of question types, or you can also use a ready-made survey template that saves time and effort. Surveys can be distributed via different channels such as e-mail, offline apps, websites, social media, QR codes, etc.

Once researchers collect the data, survey software generates reports and runs analytics algorithms to uncover hidden insights. Survey dashboards give you statistics relating to completion rates, response rates, filters based on demographics, export and sharing options, etc. Practical business intelligence depends on the synergy between analytics and reporting. Analytics uncovers valuable insights while reporting communicates these findings to the stakeholders.

Polls. Polls consist of one or more multiple-choice questions. Marketers can turn to polls when they want to take a quick snapshot of the audience’s sentiments. Since polls tend to be short, getting people to respond is more manageable. Like surveys, online polls can be embedded into various media and platforms. Once the respondents answer the question(s), they can be shown how they stand concerning other people’s responses.
Delphi Technique. The name is a callback to the Oracle of Delphi, a priestess at Apollo’s temple in ancient Greece, renowned for her prophecies. In this method, marketing experts are given the forecast estimates and assumptions made by other industry experts. The first batch of experts may then use the information provided by the other experts to revise and reconsider their estimates and assumptions. The total expert consensus on the demand forecasts creates the final demand forecast.
Interviews. In this method, interviewers talk to the respondents either face-to-face or by telephone. In the first case, the interviewer asks the interviewee a series of questions in person and notes the responses. The interviewer can opt for a telephone interview if the parties cannot meet in person. This data collection form is practical for use with only a few respondents; repeating the same process with a considerably larger group takes longer.
Focus Groups. Focus groups are one of the primary examples of qualitative data in education. In focus groups, small groups of people, usually around 8-10 members, discuss the research problem’s common aspects. Each person provides their insights on the issue, and a moderator regulates the discussion. When the discussion ends, the group reaches a consensus.

Also Read: A Beginner’s Guide to the Data Science Process

Secondary Data Collection Methods

Secondary data is the information that’s been used in past situations. Secondary data collection methods can include quantitative and qualitative techniques. In addition, secondary data is easily available, so it’s less time-consuming and expensive than using primary data. However, the authenticity of data gathered with secondary data collection tools cannot be verified.

Internal secondary data sources:

CRM Software
Executive summaries
Financial Statements
Mission and vision statements
Organization’s health and safety records
Sales Reports

External secondary data sources:

Business journals
Government reports
Press releases

The Importance of Data Collection Methods

Data collection methods play a critical part in the research process as they determine the accuracy and quality and accuracy of the collected data. Here’s a sample of some reasons why data collection procedures are so important:

They determine the quality and accuracy of collected data
They ensure the data and the research findings are valid, relevant and reliable
They help reduce bias and increase the sample’s representation
They are crucial for making informed decisions and arriving at accurate conclusions
They provide accurate data, which facilitates the achievement of research objectives

Also Read: What Is Data Processing? Definition, Examples, Trends

So, What’s the Difference Between Data Collecting and Data Processing?

Data collection is the first step in the data processing process. Data collection involves gathering information (raw data) from various sources such as interviews, surveys, questionnaires, etc. Data processing describes the steps taken to organize, manipulate and transform the collected data into a useful and meaningful resource. This process may include tasks such as cleaning and validating data, analyzing and summarizing data, and creating visualizations or reports.

So, data collection is just one step in the overall data processing chain of events.

Do You Want to Become a Data Scientist?

If this discussion about data collection and the professionals who conduct it has sparked your enthusiasm for a new career, why not check out this online data science program ?

The Glassdoor.com jobs website shows that data scientists in the United States typically make an average yearly salary of $129,127 plus additional bonuses and cash incentives. So, if you’re interested in a new career or are already in the field but want to upskill or refresh your current skill set, sign up for this bootcamp and prepare to tackle the challenges of today’s big data.

You might also like to read:

Navigating Data Scientist Roles and Responsibilities in Today’s Market

Differences Between Data Scientist and Data Analyst: Complete Explanation

What Is Data Collection? A Guide for Aspiring Data Scientists

A Data Scientist Job Description: The Roles and Responsibilities in 2024

Top Data Science Projects With Source Code to Try

Data Science Bootcamp

Learning Format:

Online Bootcamp

What is Exploratory Data Analysis? Types, Tools, Importance, etc.

This article highlights exploratory data analysis, including its definition, role in data science, types, and overall importance.

What is Data Wrangling? Importance, Tools, and More

This article explores data wrangling, including its definition, importance, steps, benefits, and tools.

What is Spatial Data Science? Definition, Applications, Careers & More

Do you want to know what spatial data science is? Read this guide to learn its basics, real-world applications, and the exciting career options in this field.

Data Science and Marketing: Transforming Strategies and Enhancing Engagement

Employing data science in marketing is critical for any organization today. This blog explores this intersection of the two disciplines and how professionals and businesses can ensure they have the skills to drive successful digital marketing strategies.

An Introduction to Natural Language Processing in Data Science

Natural language processing may seem straightforward, but there’s a lot going on behind the scenes. This blog explores NLP in data science.

Why Use Python for Data Science?

This article explains why you should use Python for data science tasks, including how it’s done and the benefits.

Learning Format

Program Benefits

12+ tools covered, 25+ hands-on projects
Masterclasses by distinguished Caltech CTME instructors
Caltech CTME Circle Membership
Industry-specific training from global experts
Call us on : 1800-212-7688

Qualitative Study Design and Data Collection

First Online: 10 February 2022

Cite this chapter

Charles P. Friedman 4 ,
Jeremy C. Wyatt 5 &
Joan S. Ash 6

Part of the book series: Health Informatics ((HI))

While the prior chapter set the stage for an understanding of the nature of qualitative evaluation, this chapter will offer strategies for planning a study and making decisions about how to gather data. The process is depicted as an iterative looping through steps beginning with idea generation to dissemination of results. It is critical that strategies for rigor be incorporated throughout the process. This chapter outlines methods for data collection utilizing interviews, focus groups, observation, and naturally occurring data, and then it also describes combinations often used together, which constitute toolkits of complementary techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Available as EPUB and PDF
Read on any device
Instant download
Own it forever
Compact, lightweight edition
Dispatched in 3 to 5 business days
Free shipping worldwide - see info
Durable hardcover edition

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

This is of course a major point of departure between qualitative methods and their quantitative counterparts. In quantitative work, investigators rarely acknowledge bias, and if they do, they may be disqualified from participating in the study.

For the same reasons, the observers should not dress too formally. They should dress as comparably as possible to the workers being observed in the field. Always ask ahead of time about dress codes.

Ash JS, Chin HL, Sittig DF, Dykstra R. Ambulatory computerized physician order entry implementation. Proc Am Med Inform Assoc. 2005;2005:11–5.

Google Scholar

Ash JS, Sittig DF, McMullen CK, Wright A, Bunce A, Mohan V, Cohen DJ, Middleton B. Multiple perspectives on clinical decision support: a qualitative study of fifteen clinical and vendor organizations. BMC Med Inform Decision Making. 2015 Apr 24;15:35.

Article Google Scholar

Beebe J. Rapid assessment process: an introduction. Lanham, PA: AltaMira Press; 2001.

Berg BL, Lune H. Qualitative research methods for the social sciences. 8th ed. Boston: Pearson; 2012.

Brunet LW, Morrissey CY, Gorry GA. Oral history and information technology: human voices of assessment. J Org Comput. 1991;1:251–74.

Crabtree BF, Miller WL. Doing qualitative research. 2nd ed. Thousand Oaks, CA: Sage; 1999.

Davis FD, Bagozzi RP, Warshaw PR. User acceptance of computer technology: a comparison of two theoretical models. Manag Sci. 1989;35:982–1003.

Erickson K, Stull D. Doing team ethnography: warnings and advice. Thousand Oaks, CA: Sage; 1998.

Book Google Scholar

Gaglio B, Shoup JA, Glasgow RE. The RE-AIM framework: a systematic review of use over time. Am J Public Health. 2013;103:e38–46.

Glaser BG, Strauss A. Discovery of grounded theory. Strategies for qualitative research. Mill Valley, CA: Sociology Press; 1967.

Goedhart NS, Zuiderent-Jerak T, Woudstra J, Broerse JEW, Betten AW, Dedding C. Persistent inequitable design and implementation of patient portals for users at the margins. J Am Med Inform Assoc. 2021;28:276–83.

Hussain MI, Figuerredo MC, Tran BD, Su Z, Molldrem S, Eikey EV, Chen Y. A scoping review of qualitative research in JAMIA: past contributions and opportunities for future work. J Am Med Inform Assoc. 2021;28:402–13.

Kiyimba N, Lester JN, O’Reilly M. Using naturally occurring data in qualitative Health Research: a practical guide. Amsterdam: Springer; 2019.

Leedy PD, Ormrod JE. Practical research: planning and design. 11th ed. Pearson: Boston, MA; 2016.

Linstone H. Multiple perspectives for decision making: bridging the gap between analysis and action. North-Holland Elsevier: Amsterdam, NE; 1984.

McMullen CK, Ash JS, Sittig DF, Bunce A, Guappone K, Dykstra R, et al. Rapid assessment of clinical information systems in the healthcare setting: an efficient method for time-pressed evaluation. Methods Inform Med. 2011;50:299–307.

Article CAS Google Scholar

Miles MB, Huberman AM. Qualitative data analysis. 2nd ed. Thousand Oaks, CA: Sage; 1994.

Mohan V, Woodcock D, McGrath K, Scholl G, Pransat R, Doberne JW, et al. Using simulations to improve electronic health record use, clinician training and patient safety: recommendations from a consensus conference. AMIA Ann Symp Proc. 2016;2016:904–13.

Morgan DL, Krueger RA. The focus group kit. Thousand Oaks, CA: Sage; 1998.

NIH Office of Behavioral and Social Science Research. Qualitative methods in health research: opportunities and considerations in application and review. NIH Publication No. 02-!5046, December 2001.

Patton MQ. Qualitative evaluation methods. Thousand Oaks, CA: Sage; 1980.

Pope C, Mays N. Qualitative research in health care. 4th ed. Hoboken, NJ: Wiley; 2020.

Rubin HJ, Rubin IS. Qualitative interviewing: the art of hearing data. Thousand Oaks, CA: Sage; 1995.

Strauss A, Corbin J. Basics of qualitative research: grounded theory procedures and techniques. Newbury Park, CA: Sage; 1990.

Tolley EE, Ulin PR, Mack N, Robinson ET, Succop SM. Qualitative methods in public health: a field guide for applied research. Hoboken NJ: Wiley; 2016.

University of Technology Sydney. Adapting research methods in the COVID-19 pandemic: resources for researchers, 2nd ed. UTS and University of Washington, December, 2020.

Weinstein JN, Caciu A, editors. Communities in action: pathways to health equity. New York: National Academies of Sciences, Engineering, and Medicine, National Academies Press; 2017.

Yin RK. Case study research: design and methods. 3rd ed. Thousand Oaks, CA: Sage; 2003.

Download references

Author information

Authors and affiliations.

Department of Learning Health Sciences, University of Michigan Medical School, Ann Arbor, MI, USA

Charles P. Friedman

Department of Primary Care, Population Sciences and Medical Education, School of Medicine, University of Southampton, Southampton, UK

Jeremy C. Wyatt

Department of Medical Informatics and Clinical Epidemiology, School of Medicine, Oregon Health & Science University, Portland, OR, USA

Joan S. Ash

You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Charles P. Friedman .

Answers to Self-Tests

Self-test 15.1.

Which of the strategies to ensure study rigor is primarily employed in the qualitative study scenarios below:

Data from interviews about the usability of a resource are analyzed thematically. The evaluation study team looks to see if and how similar themes have arisen in earlier meetings of the team.

Audit trail

A member of the study team, who has recently participated in another study of a similar kind of resource, becomes concerned that that person’s views about the current study are being shaped by that previous experience. That person sits with another member of the study team to share that person’s concerns and put them in perspective.

Reflexivity

At a “town hall” meeting called to present the results of a qualitative study, the sponsor of the study raises deep and serious questions about the validity of the findings. The study team returns to notes from their team meetings to review how and based on what data they came to this conclusion.

Member checking

During an evaluation project team meeting, one of the study team members finds themselves deeply repelled by off-color comments made by one of the project staff. The team member makes a note of this personal response as part of field notes.

After interviewing 10 patients participating in a study, a study team member perceives that they are hearing the same points raised by all interviewees. The team member requests a study team meeting to consider reducing the total number of interviews from 20, as previously planned, to 12.

Data saturation

A study team member “corners” a participant in a system development effort following a meeting and asks for the participant’s impressions on what transpired in the meeting.

Self-Test 15.2

Label each of the following interview scenarios, conducted as part of a qualitative study, as representing the fully structured, semi-structured or unstructured approach.

A study team member “corners” a participant in a system development project following a meeting and asks for that person’s impressions on what transpired in the meeting.

A study team member schedules time with a patient who is using an information resource to acquire specific information about the patient’s medical history.

Likely fully structured, though it could generate discussion, in which case it could veer towards semi-structured.

A study team member works with partners on the study team to develop a set of questions to be asked to all interviewees. Each question is to be followed up with the question: “Why do you think this is the case?”. At the end of the interview, subjects will be asked: “What else would you like to tell us to shed light on these matters?”

Semi-structured

An interview begins with the statement: “In general, what has been your experience using this EHR?” The remaining questions depend on how the interviewee answers this opening question.

Unstructured

A set of specific questions are read verbatim from an interview guide. No other questions are asked. The interviewees’ responses are recorded.

Fully structured

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Friedman, C.P., Wyatt, J.C., Ash, J.S. (2022). Qualitative Study Design and Data Collection. In: Evaluation Methods in Biomedical and Health Informatics. Health Informatics. Springer, Cham. https://doi.org/10.1007/978-3-030-86453-8_15

Download citation

DOI : https://doi.org/10.1007/978-3-030-86453-8_15

Published : 10 February 2022

Publisher Name : Springer, Cham

Print ISBN : 978-3-030-86452-1

Online ISBN : 978-3-030-86453-8

eBook Packages : Medicine Medicine (R0)

Share this chapter

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Publish with us

Policies and ethics

Find a journal
Track your research

Have a language expert improve your writing

Run a free plagiarism check in 10 minutes, automatically generate references for free.

Knowledge Base
Methodology
Data Collection Methods | Step-by-Step Guide & Examples

Data Collection Methods | Step-by-Step Guide & Examples

Published on 4 May 2022 by Pritha Bhandari .

Data collection is a systematic process of gathering observations or measurements. Whether you are performing research for business, governmental, or academic purposes, data collection allows you to gain first-hand knowledge and original insights into your research problem .

While methods and aims may differ between fields, the overall process of data collection remains largely the same. Before you begin collecting data, you need to consider:

The aim of the research
The type of data that you will collect
The methods and procedures you will use to collect, store, and process the data

To collect high-quality data that is relevant to your purposes, follow these four steps.

Step 1: define the aim of your research, step 2: choose your data collection method, step 3: plan your data collection procedures, step 4: collect the data, frequently asked questions about data collection.

Before you start the process of data collection, you need to identify exactly what you want to achieve. You can start by writing a problem statement : what is the practical or scientific issue that you want to address, and why does it matter?

Next, formulate one or more research questions that precisely define what you want to find out. Depending on your research questions, you might need to collect quantitative or qualitative data :

Quantitative data is expressed in numbers and graphs and is analysed through statistical methods .
Qualitative data is expressed in words and analysed through interpretations and categorisations.

If your aim is to test a hypothesis , measure something precisely, or gain large-scale statistical insights, collect quantitative data. If your aim is to explore ideas, understand experiences, or gain detailed insights into a specific context, collect qualitative data.

If you have several aims, you can use a mixed methods approach that collects both types of data.

Your first aim is to assess whether there are significant differences in perceptions of managers across different departments and office locations.
Your second aim is to gather meaningful feedback from employees to explore new ideas for how managers can improve.

Prevent plagiarism, run a free check.

Based on the data you want to collect, decide which method is best suited for your research.

Experimental research is primarily a quantitative method.
Interviews , focus groups , and ethnographies are qualitative methods.
Surveys , observations, archival research, and secondary data collection can be quantitative or qualitative methods.

Carefully consider what method you will use to gather data that helps you directly answer your research questions.

When you know which method(s) you are using, you need to plan exactly how you will implement them. What procedures will you follow to make accurate observations or measurements of the variables you are interested in?

For instance, if you’re conducting surveys or interviews, decide what form the questions will take; if you’re conducting an experiment, make decisions about your experimental design .

Operationalisation

Sometimes your variables can be measured directly: for example, you can collect data on the average age of employees simply by asking for dates of birth. However, often you’ll be interested in collecting data on more abstract concepts or variables that can’t be directly observed.

Operationalisation means turning abstract conceptual ideas into measurable observations. When planning how you will collect data, you need to translate the conceptual definition of what you want to study into the operational definition of what you will actually measure.

You ask managers to rate their own leadership skills on 5-point scales assessing the ability to delegate, decisiveness, and dependability.
You ask their direct employees to provide anonymous feedback on the managers regarding the same topics.

You may need to develop a sampling plan to obtain data systematically. This involves defining a population , the group you want to draw conclusions about, and a sample, the group you will actually collect data from.

Your sampling method will determine how you recruit participants or obtain measurements for your study. To decide on a sampling method you will need to consider factors like the required sample size, accessibility of the sample, and time frame of the data collection.

Standardising procedures

If multiple researchers are involved, write a detailed manual to standardise data collection procedures in your study.

This means laying out specific step-by-step instructions so that everyone in your research team collects data in a consistent way – for example, by conducting experiments under the same conditions and using objective criteria to record and categorise observations.

This helps ensure the reliability of your data, and you can also use it to replicate the study in the future.

Creating a data management plan

Before beginning data collection, you should also decide how you will organise and store your data.

If you are collecting data from people, you will likely need to anonymise and safeguard the data to prevent leaks of sensitive information (e.g. names or identity numbers).
If you are collecting data via interviews or pencil-and-paper formats, you will need to perform transcriptions or data entry in systematic ways to minimise distortion.
You can prevent loss of data by having an organisation system that is routinely backed up.

Finally, you can implement your chosen methods to measure or observe the variables you are interested in.

The closed-ended questions ask participants to rate their manager’s leadership skills on scales from 1 to 5. The data produced is numerical and can be statistically analysed for averages and patterns.

To ensure that high-quality data is recorded in a systematic way, here are some best practices:

Record all relevant information as and when you obtain data. For example, note down whether or how lab equipment is recalibrated during an experimental study.
Double-check manual data entry for errors.
If you collect quantitative data, you can assess the reliability and validity to get an indication of your data quality.

Data collection is the systematic process by which observations or measurements are gathered in research. It is used in many different contexts by academics, governments, businesses, and other organisations.

When conducting research, collecting original data has significant advantages:

You can tailor data collection to your specific research aims (e.g., understanding the needs of your consumers or user testing your website).
You can control and standardise the process for high reliability and validity (e.g., choosing appropriate measurements and sampling methods ).

However, there are also some drawbacks: data collection can be time-consuming, labour-intensive, and expensive. In some cases, it’s more efficient to use secondary data that has already been collected by someone else, but the data might be less reliable.

Quantitative research deals with numbers and statistics, while qualitative research deals with words and meanings.

Quantitative methods allow you to test a hypothesis by systematically collecting and analysing data, while qualitative methods allow you to explore ideas and experiences in depth.

Reliability and validity are both about how well a method measures something:

Reliability refers to the consistency of a measure (whether the results can be reproduced under the same conditions).
Validity refers to the accuracy of a measure (whether the results really do represent what they are supposed to measure).

If you are doing experimental research , you also have to consider the internal and external validity of your experiment.

In mixed methods research , you use both qualitative and quantitative data collection and analysis methods to answer your research question .

Operationalisation means turning abstract conceptual ideas into measurable observations.

For example, the concept of social anxiety isn’t directly observable, but it can be operationally defined in terms of self-rating scores, behavioural avoidance of crowded places, or physical anxiety symptoms in social situations.

Before collecting data , it’s important to consider how you will operationalise the variables that you want to measure.

Cite this Scribbr article

If you want to cite this source, you can copy and paste the citation or click the ‘Cite this Scribbr article’ button to automatically add the citation to our free Reference Generator.

Bhandari, P. (2022, May 04). Data Collection Methods | Step-by-Step Guide & Examples. Scribbr. Retrieved 27 May 2024, from https://www.scribbr.co.uk/research-methods/data-collection-guide/

Is this article helpful?

Pritha Bhandari

Other students also liked, qualitative vs quantitative research | examples & methods, triangulation in research | guide, types, examples, what is a conceptual framework | tips & examples.

Home » Data Collection – Methods Types and Examples

Data Collection – Methods Types and Examples

Table of Contents

Data Collection

Definition:

Data collection is the process of gathering and collecting information from various sources to analyze and make informed decisions based on the data collected. This can involve various methods, such as surveys, interviews, experiments, and observation.

In order for data collection to be effective, it is important to have a clear understanding of what data is needed and what the purpose of the data collection is. This can involve identifying the population or sample being studied, determining the variables to be measured, and selecting appropriate methods for collecting and recording data.

Types of Data Collection

Types of Data Collection are as follows:

Primary Data Collection

Primary data collection is the process of gathering original and firsthand information directly from the source or target population. This type of data collection involves collecting data that has not been previously gathered, recorded, or published. Primary data can be collected through various methods such as surveys, interviews, observations, experiments, and focus groups. The data collected is usually specific to the research question or objective and can provide valuable insights that cannot be obtained from secondary data sources. Primary data collection is often used in market research, social research, and scientific research.

Secondary Data Collection

Secondary data collection is the process of gathering information from existing sources that have already been collected and analyzed by someone else, rather than conducting new research to collect primary data. Secondary data can be collected from various sources, such as published reports, books, journals, newspapers, websites, government publications, and other documents.

Qualitative Data Collection

Qualitative data collection is used to gather non-numerical data such as opinions, experiences, perceptions, and feelings, through techniques such as interviews, focus groups, observations, and document analysis. It seeks to understand the deeper meaning and context of a phenomenon or situation and is often used in social sciences, psychology, and humanities. Qualitative data collection methods allow for a more in-depth and holistic exploration of research questions and can provide rich and nuanced insights into human behavior and experiences.

Quantitative Data Collection

Quantitative data collection is a used to gather numerical data that can be analyzed using statistical methods. This data is typically collected through surveys, experiments, and other structured data collection methods. Quantitative data collection seeks to quantify and measure variables, such as behaviors, attitudes, and opinions, in a systematic and objective way. This data is often used to test hypotheses, identify patterns, and establish correlations between variables. Quantitative data collection methods allow for precise measurement and generalization of findings to a larger population. It is commonly used in fields such as economics, psychology, and natural sciences.

Data Collection Methods

Data Collection Methods are as follows:

Surveys involve asking questions to a sample of individuals or organizations to collect data. Surveys can be conducted in person, over the phone, or online.

Interviews involve a one-on-one conversation between the interviewer and the respondent. Interviews can be structured or unstructured and can be conducted in person or over the phone.

Focus Groups

Focus groups are group discussions that are moderated by a facilitator. Focus groups are used to collect qualitative data on a specific topic.

Observation

Observation involves watching and recording the behavior of people, objects, or events in their natural setting. Observation can be done overtly or covertly, depending on the research question.

Experiments

Experiments involve manipulating one or more variables and observing the effect on another variable. Experiments are commonly used in scientific research.

Case Studies

Case studies involve in-depth analysis of a single individual, organization, or event. Case studies are used to gain detailed information about a specific phenomenon.

Secondary Data Analysis

Secondary data analysis involves using existing data that was collected for another purpose. Secondary data can come from various sources, such as government agencies, academic institutions, or private companies.

How to Collect Data

The following are some steps to consider when collecting data:

Define the objective : Before you start collecting data, you need to define the objective of the study. This will help you determine what data you need to collect and how to collect it.
Identify the data sources : Identify the sources of data that will help you achieve your objective. These sources can be primary sources, such as surveys, interviews, and observations, or secondary sources, such as books, articles, and databases.
Determine the data collection method : Once you have identified the data sources, you need to determine the data collection method. This could be through online surveys, phone interviews, or face-to-face meetings.
Develop a data collection plan : Develop a plan that outlines the steps you will take to collect the data. This plan should include the timeline, the tools and equipment needed, and the personnel involved.
Test the data collection process: Before you start collecting data, test the data collection process to ensure that it is effective and efficient.
Collect the data: Collect the data according to the plan you developed in step 4. Make sure you record the data accurately and consistently.
Analyze the data: Once you have collected the data, analyze it to draw conclusions and make recommendations.
Report the findings: Report the findings of your data analysis to the relevant stakeholders. This could be in the form of a report, a presentation, or a publication.
Monitor and evaluate the data collection process: After the data collection process is complete, monitor and evaluate the process to identify areas for improvement in future data collection efforts.
Ensure data quality: Ensure that the collected data is of high quality and free from errors. This can be achieved by validating the data for accuracy, completeness, and consistency.
Maintain data security: Ensure that the collected data is secure and protected from unauthorized access or disclosure. This can be achieved by implementing data security protocols and using secure storage and transmission methods.
Follow ethical considerations: Follow ethical considerations when collecting data, such as obtaining informed consent from participants, protecting their privacy and confidentiality, and ensuring that the research does not cause harm to participants.
Use appropriate data analysis methods : Use appropriate data analysis methods based on the type of data collected and the research objectives. This could include statistical analysis, qualitative analysis, or a combination of both.
Record and store data properly: Record and store the collected data properly, in a structured and organized format. This will make it easier to retrieve and use the data in future research or analysis.
Collaborate with other stakeholders : Collaborate with other stakeholders, such as colleagues, experts, or community members, to ensure that the data collected is relevant and useful for the intended purpose.

Applications of Data Collection

Data collection methods are widely used in different fields, including social sciences, healthcare, business, education, and more. Here are some examples of how data collection methods are used in different fields:

Social sciences : Social scientists often use surveys, questionnaires, and interviews to collect data from individuals or groups. They may also use observation to collect data on social behaviors and interactions. This data is often used to study topics such as human behavior, attitudes, and beliefs.
Healthcare : Data collection methods are used in healthcare to monitor patient health and track treatment outcomes. Electronic health records and medical charts are commonly used to collect data on patients’ medical history, diagnoses, and treatments. Researchers may also use clinical trials and surveys to collect data on the effectiveness of different treatments.
Business : Businesses use data collection methods to gather information on consumer behavior, market trends, and competitor activity. They may collect data through customer surveys, sales reports, and market research studies. This data is used to inform business decisions, develop marketing strategies, and improve products and services.
Education : In education, data collection methods are used to assess student performance and measure the effectiveness of teaching methods. Standardized tests, quizzes, and exams are commonly used to collect data on student learning outcomes. Teachers may also use classroom observation and student feedback to gather data on teaching effectiveness.
Agriculture : Farmers use data collection methods to monitor crop growth and health. Sensors and remote sensing technology can be used to collect data on soil moisture, temperature, and nutrient levels. This data is used to optimize crop yields and minimize waste.
Environmental sciences : Environmental scientists use data collection methods to monitor air and water quality, track climate patterns, and measure the impact of human activity on the environment. They may use sensors, satellite imagery, and laboratory analysis to collect data on environmental factors.
Transportation : Transportation companies use data collection methods to track vehicle performance, optimize routes, and improve safety. GPS systems, on-board sensors, and other tracking technologies are used to collect data on vehicle speed, fuel consumption, and driver behavior.

Examples of Data Collection

Examples of Data Collection are as follows:

Traffic Monitoring: Cities collect real-time data on traffic patterns and congestion through sensors on roads and cameras at intersections. This information can be used to optimize traffic flow and improve safety.
Social Media Monitoring : Companies can collect real-time data on social media platforms such as Twitter and Facebook to monitor their brand reputation, track customer sentiment, and respond to customer inquiries and complaints in real-time.
Weather Monitoring: Weather agencies collect real-time data on temperature, humidity, air pressure, and precipitation through weather stations and satellites. This information is used to provide accurate weather forecasts and warnings.
Stock Market Monitoring : Financial institutions collect real-time data on stock prices, trading volumes, and other market indicators to make informed investment decisions and respond to market fluctuations in real-time.
Health Monitoring : Medical devices such as wearable fitness trackers and smartwatches can collect real-time data on a person’s heart rate, blood pressure, and other vital signs. This information can be used to monitor health conditions and detect early warning signs of health issues.

Purpose of Data Collection

The purpose of data collection can vary depending on the context and goals of the study, but generally, it serves to:

Provide information: Data collection provides information about a particular phenomenon or behavior that can be used to better understand it.
Measure progress : Data collection can be used to measure the effectiveness of interventions or programs designed to address a particular issue or problem.
Support decision-making : Data collection provides decision-makers with evidence-based information that can be used to inform policies, strategies, and actions.
Identify trends : Data collection can help identify trends and patterns over time that may indicate changes in behaviors or outcomes.
Monitor and evaluate : Data collection can be used to monitor and evaluate the implementation and impact of policies, programs, and initiatives.

When to use Data Collection

Data collection is used when there is a need to gather information or data on a specific topic or phenomenon. It is typically used in research, evaluation, and monitoring and is important for making informed decisions and improving outcomes.

Data collection is particularly useful in the following scenarios:

Research : When conducting research, data collection is used to gather information on variables of interest to answer research questions and test hypotheses.
Evaluation : Data collection is used in program evaluation to assess the effectiveness of programs or interventions, and to identify areas for improvement.
Monitoring : Data collection is used in monitoring to track progress towards achieving goals or targets, and to identify any areas that require attention.
Decision-making: Data collection is used to provide decision-makers with information that can be used to inform policies, strategies, and actions.
Quality improvement : Data collection is used in quality improvement efforts to identify areas where improvements can be made and to measure progress towards achieving goals.

Characteristics of Data Collection

Data collection can be characterized by several important characteristics that help to ensure the quality and accuracy of the data gathered. These characteristics include:

Validity : Validity refers to the accuracy and relevance of the data collected in relation to the research question or objective.
Reliability : Reliability refers to the consistency and stability of the data collection process, ensuring that the results obtained are consistent over time and across different contexts.
Objectivity : Objectivity refers to the impartiality of the data collection process, ensuring that the data collected is not influenced by the biases or personal opinions of the data collector.
Precision : Precision refers to the degree of accuracy and detail in the data collected, ensuring that the data is specific and accurate enough to answer the research question or objective.
Timeliness : Timeliness refers to the efficiency and speed with which the data is collected, ensuring that the data is collected in a timely manner to meet the needs of the research or evaluation.
Ethical considerations : Ethical considerations refer to the ethical principles that must be followed when collecting data, such as ensuring confidentiality and obtaining informed consent from participants.

Advantages of Data Collection

There are several advantages of data collection that make it an important process in research, evaluation, and monitoring. These advantages include:

Better decision-making : Data collection provides decision-makers with evidence-based information that can be used to inform policies, strategies, and actions, leading to better decision-making.
Improved understanding: Data collection helps to improve our understanding of a particular phenomenon or behavior by providing empirical evidence that can be analyzed and interpreted.
Evaluation of interventions: Data collection is essential in evaluating the effectiveness of interventions or programs designed to address a particular issue or problem.
Identifying trends and patterns: Data collection can help identify trends and patterns over time that may indicate changes in behaviors or outcomes.
Increased accountability: Data collection increases accountability by providing evidence that can be used to monitor and evaluate the implementation and impact of policies, programs, and initiatives.
Validation of theories: Data collection can be used to test hypotheses and validate theories, leading to a better understanding of the phenomenon being studied.
Improved quality: Data collection is used in quality improvement efforts to identify areas where improvements can be made and to measure progress towards achieving goals.

Limitations of Data Collection

While data collection has several advantages, it also has some limitations that must be considered. These limitations include:

Bias : Data collection can be influenced by the biases and personal opinions of the data collector, which can lead to inaccurate or misleading results.
Sampling bias : Data collection may not be representative of the entire population, resulting in sampling bias and inaccurate results.
Cost : Data collection can be expensive and time-consuming, particularly for large-scale studies.
Limited scope: Data collection is limited to the variables being measured, which may not capture the entire picture or context of the phenomenon being studied.
Ethical considerations : Data collection must follow ethical principles to protect the rights and confidentiality of the participants, which can limit the type of data that can be collected.
Data quality issues: Data collection may result in data quality issues such as missing or incomplete data, measurement errors, and inconsistencies.
Limited generalizability : Data collection may not be generalizable to other contexts or populations, limiting the generalizability of the findings.

About the author

Muhammad Hassan

Researcher, Academic Writer, Web developer

Delimitations in Research – Types, Examples and...

Research Process – Steps, Examples and Tips

Research Design – Types, Methods and Examples

Institutional Review Board – Application Sample...

Evaluating Research – Process, Examples and...

Research Questions – Types, Examples and Writing...

A Guide to Data Collection: Methods, Process, and Tools

A hand holds a smartphone in a green field.

Whether your field is development economics, international development, the nonprofit sector, or myriad other industries, effective data collection is essential. It informs decision-making and increases your organization’s impact. However, the process of data collection can be complex and challenging. If you’re in the beginning stages of creating a data collection process, this guide is for you. It outlines tested methods, efficient procedures, and effective tools to help you improve your data collection activities and outcomes. At SurveyCTO, we’ve used our years of experience and expertise to build a robust, secure, and scalable mobile data collection platform. It’s trusted by respected institutions like The World Bank, J-PAL, Oxfam, and the Gates Foundation, and it’s changed the way many organizations collect and use data. With this guide, we want to share what we know and help you get ready to take the first step in your data collection journey.

Main takeaways from this guide

Before starting the data collection process, define your goals and identify data sources, which can be primary (first-hand research) or secondary (existing resources).
Your data collection method should align with your goals, resources, and the nature of the data needed. Surveys, interviews, observations, focus groups, and forms are common data collection methods.
Sampling involves selecting a representative group from a larger population. Choosing the right sampling method to gather representative and relevant data is crucial.
Crafting effective data collection instruments like surveys and questionnaires is key. Instruments should undergo rigorous testing for reliability and accuracy.
Data collection is an ongoing, iterative process that demands real-time monitoring and adjustments to ensure high-quality, reliable results.
After data collection, data should be cleaned to eliminate errors and organized for efficient analysis. The data collection journey further extends into data analysis, where patterns and useful information that can inform decision-making are discovered.
Common challenges in data collection include data quality and consistency issues, data security concerns, and limitations with offline surveys . Employing robust data validation processes, implementing strong security protocols, and using offline-enabled data collection tools can help overcome these challenges.
Data collection, entry, and management tools and data analysis, visualization, reporting, and workflow tools can streamline the data collection process, improve data quality, and facilitate data analysis.

What is data collection?

SurveyCTO Collect app on a tablet and mobile device

The traditional definition of data collection might lead us to think of gathering information through surveys, observations, or interviews. However, the modern-age definition of data collection extends beyond conducting surveys and observations. It encompasses the systematic gathering and recording of any kind of information through digital or manual methods. Data collection can be as routine as a doctor logging a patient’s information into an electronic medical record system during each clinic visit, or as specific as keeping a record of mosquito nets delivered to a rural household.

Getting started with data collection

Before starting your data collection process, you must clearly understand what you aim to achieve and how you’ll get there. Below are some actionable steps to help you get started.

1. Define your goals

Defining your goals is a crucial first step. Engage relevant stakeholders and team members in an iterative and collaborative process to establish clear goals. It’s important that projects start with the identification of key questions and desired outcomes to ensure you focus your efforts on gathering the right information.

Start by understanding the purpose of your project– what problem are you trying to solve, or what change do you want to bring about? Think about your project’s potential outcomes and obstacles and try to anticipate what kind of data would be useful in these scenarios. Consider who will be using the data you collect and what data would be the most valuable to them. Think about the long-term effects of your project and how you will measure these over time. Lastly, leverage any historical data from previous projects to help you refine key questions that may have been overlooked previously.

Once questions and outcomes are established, your data collection goals may still vary based on the context of your work. To demonstrate, let’s use the example of an international organization working on a healthcare project in a remote area.

If you’re a researcher , your goal will revolve around collecting primary data to answer specific questions. This could involve designing a survey or conducting interviews to collect first-hand data on patient improvement, disease or illness prevalence, and behavior changes (such as an increase in patients seeking healthcare).
If you’re part of the monitoring and evaluation ( M&E) team , your goal will revolve around measuring the success of your healthcare project. This could involve collecting primary data through surveys or observations and developing a dashboard to display real-time metrics like the number of patients treated, percentage of reduction in incidences of disease,, and average patient wait times. Your focus would be using this data to implement any needed program changes and ensure your project meets its objectives.
If you’re part of a field team , your goal will center around the efficient and accurate execution of project plans. You might be responsible for using data collection tools to capture pertinent information in different settings, such as in interviews takendirectly from the sample community or over the phone. The data you collect and manage will directly influence the operational efficiency of the project and assist in achieving the project’s overarching objectives.

2. Identify your data sources

The crucial next step in your research process is determining your data source. Essentially, there are two main data types to choose from: primary and secondary.

Primary data is the information you collect directly from first-hand engagements. It’s gathered specifically for your research and tailored to your research question. Primary data collection methods can range from surveys and interviews to focus groups and observations. Because you design the data collection process, primary data can offer precise, context-specific information directly related to your research objectives. For example, suppose you are investigating the impact of a new education policy. In that case, primary data might be collected through surveys distributed to teachers or interviews with school administrators dealing directly with the policy’s implementation.
Secondary data, on the other hand, is derived from resources that already exist. This can include information gathered for other research projects, administrative records, historical documents, statistical databases, and more. While not originally collected for your specific study, secondary data can offer valuable insights and background information that complement your primary data. For instance, continuing with the education policy example, secondary data might involve academic articles about similar policies, government reports on education or previous survey data about teachers’ opinions on educational reforms.

While both types of data have their strengths, this guide will predominantly focus on primary data and the methods to collect it. Primary data is often emphasized in research because it provides fresh, first-hand insights that directly address your research questions. Primary data also allows for more control over the data collection process, ensuring data is relevant, accurate, and up-to-date.

However, secondary data can offer critical context, allow for longitudinal analysis, save time and resources, and provide a comparative framework for interpreting your primary data. It can be a crucial backdrop against which your primary data can be understood and analyzed. While we focus on primary data collection methods in this guide, we encourage you not to overlook the value of incorporating secondary data into your research design where appropriate.

3. Choose your data collection method

When choosing your data collection method, there are many options at your disposal. Data collection is not limited to methods like surveys and interviews. In fact, many of the processes in our daily lives serve the goal of collecting data, from intake forms to automated endpoints, such as payment terminals and mass transit card readers. Let us dive into some common types of data collection methods:

Surveys and Questionnaires

Surveys and questionnaires are tools for gathering information about a group of individuals, typically by asking them predefined questions. They can be used to collect quantitative and qualitative data and be administered in various ways, including online, over the phone, in person (offline), or by mail.

Advantages : They allow researchers to reach many participants quickly and cost-effectively, making them ideal for large-scale studies. The structured format of questions makes analysis easier.
Disadvantages : They may not capture complex or nuanced information as participants are limited to predefined response choices. Also, there can be issues with response bias, where participants might provide socially desirable answers rather than honest ones.

Interviews involve a one-on-one conversation between the researcher and the participant. The interviewer asks open-ended questions to gain detailed information about the participant’s thoughts, feelings, experiences, and behaviors.

Advantages : They allow for an in-depth understanding of the topic at hand. The researcher can adapt the questioning in real time based on the participant’s responses, allowing for more flexibility.
Disadvantages : They can be time-consuming and resource-intensive, as they require trained interviewers and a significant amount of time for both conducting and analyzing responses. They may also introduce interviewer bias if not conducted carefully, due to how an interviewer presents questions and perceives the respondent, and how the respondent perceives the interviewer.

Observations

Observations involve directly observing and recording behavior or other phenomena as they occur in their natural settings.

Advantages : Observations can provide valuable contextual information, as researchers can study behavior in the environment where it naturally occurs, reducing the risk of artificiality associated with laboratory settings or self-reported measures.
Disadvantages : Observational studies may suffer from observer bias, where the observer’s expectations or biases could influence their interpretation of the data. Also, some behaviors might be altered if subjects are aware they are being observed.

Focus Groups

Focus groups are guided discussions among selected individuals to gain information about their views and experiences.

Advantages : Focus groups allow for interaction among participants, which can generate a diverse range of opinions and ideas. They are good for exploring new topics where there is little pre-existing knowledge.
Disadvantages : Dominant voices in the group can sway the discussion, potentially silencing less assertive participants. They also require skilled facilitators to moderate the discussion effectively.

Forms are standardized documents with blank fields for collecting data in a systematic manner. They are often used in fields like Customer Relationship Management (CRM) or Electronic Medical Records (EMR) data entry. Surveys may also be referred to as forms.

Advantages : Forms are versatile, easy to use, and efficient for data collection. They can streamline workflows by standardizing the data entry process.
Disadvantages : They may not provide in-depth insights as the responses are typically structured and limited. There is also potential for errors in data entry, especially when done manually.

Selecting the right data collection method should be an intentional process, taking into consideration the unique requirements of your project. The method selected should align with your goals, available resources, and the nature of the data you need to collect.

If you aim to collect quantitative data, surveys, questionnaires, and forms can be excellent tools, particularly for large-scale studies. These methods are suited to providing structured responses that can be analyzed statistically, delivering solid numerical data.

However, if you’re looking to uncover a deeper understanding of a subject, qualitative data might be more suitable. In such cases, interviews, observations, and focus groups can provide richer, more nuanced insights. These methods allow you to explore experiences, opinions, and behaviors deeply. Some surveys can also include open-ended questions that provide qualitative data.

The cost of data collection is also an important consideration. If you have budget constraints, in-depth, in-person conversations with every member of your target population may not be practical. In such cases, distributing questionnaires or forms can be a cost-saving approach.

Additional considerations include language barriers and connectivity issues. If your respondents speak different languages, consider translation services or multilingual data collection tools . If your target population resides in areas with limited connectivity and your method will be to collect data using mobile devices, ensure your tool provides offline data collection , which will allow you to carry out your data collection plan without internet connectivity.

4. Determine your sampling method

Now that you’ve established your data collection goals and how you’ll collect your data, the next step is deciding whom to collect your data from. Sampling involves carefully selecting a representative group from a larger population. Choosing the right sampling method is crucial for gathering representative and relevant data that aligns with your data collection goal.

Consider the following guidelines to choose the appropriate sampling method for your research goal and data collection method:

Understand Your Target Population: Start by conducting thorough research of your target population. Understand who they are, their characteristics, and subgroups within the population.
Anticipate and Minimize Biases: Anticipate and address potential biases within the target population to help minimize their impact on the data. For example, will your sampling method accurately reflect all ages, gender, cultures, etc., of your target population? Are there barriers to participation for any subgroups? Your sampling method should allow you to capture the most accurate representation of your target population.
Maintain Cost-Effective Practices: Consider the cost implications of your chosen sampling methods. Some sampling methods will require more resources, time, and effort. Your chosen sampling method should balance the cost factors with the ability to collect your data effectively and accurately.
Consider Your Project’s Objectives: Tailor the sampling method to meet your specific objectives and constraints, such as M&E teams requiring real-time impact data and researchers needing representative samples for statistical analysis.

By adhering to these guidelines, you can make informed choices when selecting a sampling method, maximizing the quality and relevance of your data collection efforts.

5. Identify and train collectors

Not every data collection use case requires data collectors, but training individuals responsible for data collection becomes crucial in scenarios involving field presence.

The SurveyCTO platform supports both self-response survey modes and surveys that require a human field worker to do in-person interviews. Whether you’re hiring and training data collectors, utilizing an existing team, or training existing field staff, we offer comprehensive guidance and the right tools to ensure effective data collection practices.

Here are some common training approaches for data collectors:

In-Class Training: Comprehensive sessions covering protocols, survey instruments, and best practices empower data collectors with skills and knowledge.
Tests and Assessments: Assessments evaluate collectors’ understanding and competence, highlighting areas where additional support is needed.
Mock Interviews: Simulated interviews refine collectors’ techniques and communication skills.
Pre-Recorded Training Sessions: Accessible reinforcement and self-paced learning to refresh and stay updated.

Training data collectors is vital for successful data collection techniques. Your training should focus on proper instrument usage and effective interaction with respondents, including communication skills, cultural literacy, and ethical considerations.

Remember, training is an ongoing process. Knowledge gaps and issues may arise in the field, necessitating further training.

Moving Ahead: Iterative Steps in Data Collection

A woman in a blazer sits at a desk reviewing paperwork in front of her laptop.

Once you’ve established the preliminary elements of your data collection process, you’re ready to start your data collection journey. In this section, we’ll delve into the specifics of designing and testing your instruments, collecting data, and organizing data while embracing the iterative nature of the data collection process, which requires diligent monitoring and making adjustments when needed.

6. Design and test your instruments

Designing effective data collection instruments like surveys and questionnaires is key. It’s crucial to prioritize respondent consent and privacy to ensure the integrity of your research. Thoughtful design and careful testing of survey questions are essential for optimizing research insights. Other critical considerations are:

Clear and Unbiased Question Wording: Craft unambiguous, neutral questions free from bias to gather accurate and meaningful data. For example, instead of asking, “Shouldn’t we invest more into renewable energy that will combat the effects of climate change?” ask your question in a neutral way that allows the respondent to voice their thoughts. For example: “What are your thoughts on investing more in renewable energy?”
Logical Ordering and Appropriate Response Format: Arrange questions logically and choose response formats (such as multiple-choice, Likert scale, or open-ended) that suit the nature of the data you aim to collect.
Coverage of Relevant Topics: Ensure that your instrument covers all topics pertinent to your data collection goals while respecting cultural and social sensitivities. Make sure your instrument avoids assumptions, stereotypes, and languages or topics that could be considered offensive or taboo in certain contexts. The goal is to avoid marginalizing or offending respondents based on their social or cultural background.
Collect Only Necessary Data: Design survey instruments that focus solely on gathering the data required for your research objectives, avoiding unnecessary information.
Language(s) of the Respondent Population: Tailor your instruments to accommodate the languages your target respondents speak, offering translated versions if needed. Similarly, take into account accessibility for respondents who can’t read by offering alternative formats like images in place of text.
Desired Length of Time for Completion: Respect respondents’ time by designing instruments that can be completed within a reasonable timeframe, balancing thoroughness with engagement. Having a general timeframe for the amount of time needed to complete a response will also help you weed out bad responses. For example, a response that was rushed and completed outside of your response timeframe could indicate a response that needs to be excluded.
Collecting and Documenting Respondents’ Consent and Privacy: Ensure a robust consent process, transparent data usage communication, and privacy protection throughout data collection.

Perform Cognitive Interviewing

Cognitive interviewing is a method used to refine survey instruments and improve the accuracy of survey responses by evaluating how respondents understand, process, and respond to the instrument’s questions. In practice, cognitive interviewing involves an interview with the respondent, asking them to verbalize their thoughts as they interact with the instrument. By actively probing and observing their responses, you can identify and address ambiguities, ensuring accurate data collection.

Thoughtful question wording, well-organized response options, and logical sequencing enhance comprehension, minimize biases, and ensure accurate data collection. Iterative testing and refinement based on respondent feedback improve the validity, reliability, and actionability of insights obtained.

Put Your Instrument to the Test

Through rigorous testing, you can uncover flaws, ensure reliability, maximize accuracy, and validate your instrument’s performance. This can be achieved by:

Conducting pilot testing to enhance the reliability and effectiveness of data collection. Administer the instrument, identify difficulties, gather feedback, and assess performance in real-world conditions.
Making revisions based on pilot testing to enhance clarity, accuracy, usability, and participant satisfaction. Refine questions, instructions, and format for effective data collection.
Continuously iterating and refining your instrument based on feedback and real-world testing. This ensures reliable, accurate, and audience-aligned methods of data collection. Additionally, this ensures your instrument adapts to changes, incorporates insights, and maintains ongoing effectiveness.

7. Collect your data

Now that you have your well-designed survey, interview questions, observation plan, or form, it’s time to implement it and gather the needed data. Data collection is not a one-and-done deal; it’s an ongoing process that demands attention to detail. Imagine spending weeks collecting data, only to discover later that a significant portion is unusable due to incomplete responses, improper collection methods, or falsified responses. To avoid such setbacks, adopt an iterative approach.

Leverage data collection tools with real-time monitoring to proactively identify outliers and issues. Take immediate action by fine-tuning your instruments, optimizing the data collection process, addressing concerns like additional training, or reevaluating personnel responsible for inaccurate data (for example, a field worker who sits in a coffee shop entering fake responses rather than doing the work of knocking on doors).

SurveyCTO’s Data Explorer was specifically designed to fulfill this requirement, empowering you to monitor incoming data, gain valuable insights, and know where changes may be needed. Embracing this iterative approach ensures ongoing improvement in data collection, resulting in more reliable and precise results.

8. Clean and organize your data

After data collection, the next step is to clean and organize the data to ensure its integrity and usability.

Data Cleaning: This stage involves sifting through your data to identify and rectify any errors, inconsistencies, or missing values. It’s essential to maintain the accuracy of your data and ensure that it’s reliable for further analysis. Data cleaning can uncover duplicates, outliers, and gaps that could skew your results if left unchecked. With real-time data monitoring , this continuous cleaning process keeps your data precise and current throughout the data collection period. Similarly, review and corrections workflows allow you to monitor the quality of your incoming data.
Organizing Your Data: Post-cleaning, it’s time to organize your data for efficient analysis and interpretation. Labeling your data using appropriate codes or categorizations can simplify navigation and streamline the extraction of insights. When you use a survey or form, labeling your data is often not necessary because you can design the instrument to collect in the right categories or return the right codes. An organized dataset is easier to manage, analyze, and interpret, ensuring that your collection efforts are not wasted but lead to valuable, actionable insights.

Remember, each stage of the data collection process, from design to cleaning, is iterative and interconnected. By diligently cleaning and organizing your data, you are setting the stage for robust, meaningful analysis that can inform your data-driven decisions and actions.

What happens after data collection?

A person sits at a laptop while using a large tablet to aggregate data into a graph.

The data collection journey takes us next into data analysis, where you’ll uncover patterns, empowering informed decision-making for researchers, evaluation teams, and field personnel.

Process and Analyze Your Data

Explore data through statistical and qualitative techniques to discover patterns, correlations, and insights during this pivotal stage. It’s about extracting the essence of your data and translating numbers into knowledge. Whether applying descriptive statistics, conducting regression analysis, or using thematic coding for qualitative data, this process drives decision-making and charts the path toward actionable outcomes.

Interpret and Report Your Results

Interpreting and reporting your data brings meaning and context to the numbers. Translating raw data into digestible insights for informed decision-making and effective stakeholder communication is critical.

The approach to interpretation and reporting varies depending on the perspective and role:

Researchers often lean heavily on statistical methods to identify trends, extract meaningful conclusions, and share their findings in academic circles, contributing to their knowledge pool.
M&E teams typically produce comprehensive reports, shedding light on the effectiveness and impact of programs. These reports guide internal and sometimes external stakeholders, supporting informed decisions and driving program improvements.

Field teams provide a first-hand perspective. Since they are often the first to see the results of the practical implementation of data, field teams are instrumental in providing immediate feedback loops on project initiatives. Field teams do the work that provides context to help research and M&E teams understand external factors like the local environment, cultural nuances, and logistical challenges that impact data results.

Safely store and handle data

Throughout the data collection process, and after it has been collected, it is vital to follow best practices for storing and handling data to ensure the integrity of your research. While the specifics of how to best store and handle data will depend on your project, here are some important guidelines to keep in mind:

Use cloud storage to hold your data if possible, since this is safer than storing data on hard drives and keeps it more accessible,
Periodically back up and purge old data from your system, since it’s safer to not retain data longer than necessary,
If you use mobile devices to collect and store data, use options for private, internal apps-specific storage if and when possible,
Restrict access to stored data to only those who need to work with that data.

Further considerations for data safety are discussed below in the section on data security .

Remember to uphold ethical standards in interpreting and reporting your data, regardless of your role. Clear communication, respectful handling of sensitive information, and adhering to confidentiality and privacy rights are all essential to fostering trust, promoting transparency, and bolstering your work’s credibility.

Common Data Collection Challenges

Data collection is vital to data-driven initiatives, but it comes with challenges. Addressing common challenges such as poor data quality, privacy concerns, inadequate sample sizes, and bias is essential to ensure the collected data is reliable, trustworthy, and secure.

In this section, we’ll explore three major challenges: data quality and consistency issues, data security concerns, and limitations with offline data collection , along with strategies to overcome them.

Data Quality and Consistency

Data quality and consistency refer to data accuracy and reliability throughout the collection and analysis process.

Challenges such as incomplete or missing data, data entry errors, measurement errors, and data coding/categorization errors can impact the integrity and usefulness of the data.

To navigate these complexities and maintain high standards, consistency, and integrity in the dataset:

Implement robust data validation processes,
Ensure proper training for data entry personnel,
Employ automated data validation techniques, and
Conduct regular data quality audits.

Data security

Data security encompasses safeguarding data through ensuring data privacy and confidentiality, securing storage and backup, and controlling data sharing and access.

Challenges include the risk of potential breaches, unauthorized access, and the need to comply with data protection regulations.

To address these setbacks and maintain privacy, trust, and confidence during the data collection process:

Use encryption and authentication methods,
Implement robust security protocols,
Update security measures regularly,
Provide employee training on data security, and
Adopt secure cloud storage solutions.

Offline Data Collection

Offline data collection refers to the process of gathering data using modes like mobile device-based computer-assisted personal interviewing (CAPI) when t here is an inconsistent or unreliable internet connection, and the data collection tool being used for CAPI has the functionality to work offline.

Challenges associated with offline data collection include synchronization issues, difficulty transferring data, and compatibility problems between devices, and data collection tools.

To overcome these challenges and enable efficient and reliable offline data collection processes, employ the following strategies:

Leverage offline-enabled data collection apps or tools that enable you to survey respondents even when there’s no internet connection, and upload data to a central repository at a later time.
Your data collection plan should include times for periodic data synchronization when connectivity is available,
Use offline, device-based storage for seamless data transfer and compatibility, and
Provide clear instructions to field personnel on handling offline data collection scenarios.

Utilizing Technology in Data Collection

A group of people stand in a circle holding brightly colored smartphones.

Embracing technology throughout your data collection process can help you overcome many challenges described in the previous section. Data collection tools can streamline your data collection, improve the quality and security of your data, and facilitate the analysis of your data. Let’s look at two broad categories of tools that are essential for data collection:

Data Collection, Entry, & Management Tools

These tools help with data collection, input, and organization. They can range from digital survey platforms to comprehensive database systems, allowing you to gather, enter, and manage your data effectively. They can significantly simplify the data collection process, minimize human error, and offer practical ways to organize and manage large volumes of data. Some of these tools are:

Microsoft Office
Google Docs
SurveyMonkey
Google Forms

Data Analysis, Visualization, Reporting, & Workflow Tools

These tools assist in processing and interpreting the collected data. They provide a way to visualize data in a user-friendly format, making it easier to identify trends and patterns. These tools can also generate comprehensive reports to share your findings with stakeholders and help manage your workflow efficiently. By automating complex tasks, they can help ensure accuracy and save time. Tools for these purposes include:

Google sheets

Data collection tools like SurveyCTO often have integrations to help users seamlessly transition from data collection to data analysis, visualization, reporting, and managing workflows.

Master Your Data Collection Process With SurveyCTO

As we bring this guide to a close, you now possess a wealth of knowledge to develop your data collection process. From understanding the significance of setting clear goals to the crucial process of selecting your data collection methods and addressing common challenges, you are equipped to handle the intricate details of this dynamic process.

Remember, you’re not venturing into this complex process alone. At SurveyCTO, we offer not just a tool but an entire support system committed to your success. Beyond troubleshooting support, our success team serves as research advisors and expert partners, ready to provide guidance at every stage of your data collection journey.

With SurveyCTO , you can design flexible surveys in Microsoft Excel or Google Sheets, collect data online and offline with above-industry-standard security, monitor your data in real time, and effortlessly export it for further analysis in any tool of your choice. You also get access to our Data Explorer, which allows you to visualize incoming data at both individual survey and aggregate levels instantly.

In the iterative data collection process, our users tell us that SurveyCTO stands out with its capacity to establish review and correction workflows. It enables you to monitor incoming data and configure automated quality checks to flag error-prone submissions.

Finally, data security is of paramount importance to us. We ensure best-in-class security measures like SOC 2 compliance, end-to-end encryption, single sign-on (SSO), GDPR-compliant setups, customizable user roles, and self-hosting options to keep your data safe.

As you embark on your data collection journey, you can count on SurveyCTO’s experience and expertise to be by your side every step of the way. Our team would be excited and honored to be a part of your research project, offering you the tools and processes to gain informative insights and make effective decisions. Partner with us today and revolutionize the way you collect data.

Better data, better decision making, better world.

INTEGRATIONS

Want to create or adapt books like this? Learn more about how Pressbooks supports open publishing practices.

4 Gathering and Analyzing Qualitative Data

Gathering and analyzing qualitative data.

As the role of clinician researchers expands beyond the bedside, it is important to consider the possibilities of inquiry beyond the quantitative approach. In contrast to the quantitative approach, qualitative methodology is highly inductive and relies on the background and interpretation of the researcher to derive meaning from the gathering and analytic processes central to qualitative inquiry.

Chapter 4: Learning Objectives

As you explore the research opportunities central to your interests to consider whether qualitative component would enrich your work, you’ll be able to:

Define what qualitative research is
Compare qualitative and quantitative approaches
Describe the process of creating themes from recurring ideas gleaned from narrative interviews

What Is Qualitative Research?

Quantitative researchers typically start with a focused research question or hypothesis, collect a small amount of numerical data from a large number of individuals, describe the resulting data using statistical techniques, and draw general conclusions about some large population. Although this method is by far the most common approach to conducting empirical research in fields such as respiratory care and other clinical fields, there is an important alternative called qualitative research. Qualitative research originated in the disciplines of anthropology and sociology but is now used to study psychological topics as well. Qualitative researchers generally begin with a less focused research question, collect large amounts of relatively “unfiltered” data from a relatively small number of individuals, and describe their data using nonstatistical techniques, such as grounded theory, thematic analysis, critical discourse analysis, or interpretative phenomenological analysis. They are usually less concerned with drawing general conclusions about human behavior than with understanding in detail the experience of their research participants.

Consider, for example, a study by researcher Per Lindqvist and his colleagues, who wanted to learn how the families of teenage suicide victims cope with their loss (Lindqvist, Johansson, & Karlsson, 2008). They did not have a specific research question or hypothesis, such as, What percentage of family members join suicide support groups? Instead, they wanted to understand the variety of reactions that families had, with a focus on what it is like from their perspectives. To address this question, they interviewed the families of 10 teenage suicide victims in their homes in rural Sweden. The interviews were relatively unstructured, beginning with a general request for the families to talk about the victim and ending with an invitation to talk about anything else that they wanted to tell the interviewer. One of the most important themes that emerged from these interviews was that even as life returned to “normal,” the families continued to struggle with the question of why their loved one committed suicide. This struggle appeared to be especially difficult for families in which the suicide was most unexpected.

The Purpose of Qualitative Research

The strength of quantitative research is its ability to provide precise answers to specific research questions and to draw general conclusions about human behavior. This method is how we know that people have a strong tendency to obey authority figures, for example, and that female undergraduate students are not substantially more talkative than male undergraduate students. But while quantitative research is good at providing precise answers to specific research questions, it is not nearly as good at generating novel and interesting research questions. Likewise, while quantitative research is good at drawing general conclusions about human behavior, it is not nearly as good at providing detailed descriptions of the behavior of particular groups in particular situations. And quantitative research is not very good at communicating what it is actually like to be a member of a particular group in a particular situation.

But the relative weaknesses of quantitative research are the relative strengths of qualitative research. Qualitative research can help researchers to generate new and interesting research questions and hypotheses. The research of Lindqvist and colleagues, for example, suggests that there may be a general relationship between how unexpected a suicide is and how consumed the family is with trying to understand why the teen committed suicide. This relationship can now be explored using quantitative research. But it is unclear whether this question would have arisen at all without the researchers sitting down with the families and listening to what they themselves wanted to say about their experience. Qualitative research can also provide rich and detailed descriptions of human behavior in the real-world contexts in which it occurs. Among qualitative researchers, this depth is often referred to as “thick description” (Geertz, 1973) .

Similarly, qualitative research can convey a sense of what it is actually like to be a member of a particular group or in a particular situation—what qualitative researchers often refer to as the “lived experience” of the research participants. Lindqvist and colleagues, for example, describe how all the families spontaneously offered to show the interviewer the victim’s bedroom or the place where the suicide occurred—revealing the importance of these physical locations to the families. It seems unlikely that a quantitative study would have discovered this detail. The table below lists some contrasts between qualitative and quantitative research

Table listing major differences between qualitative and quantitative approaches to research. Highlights of qualitative research include deep exploration of a very small sample, conclusions based on interpretation drawn by the investigator and that the focus is both global and exploratory.

Data Collection and Analysis in Qualitative Research

Data collection approaches in qualitative research are quite varied and can involve naturalistic observation, participant observation, archival data, artwork, and many other things. But one of the most common approaches, especially for psychological research, is to conduct interviews. Interviews in qualitative research can be unstructured—consisting of a small number of general questions or prompts that allow participants to talk about what is of interest to them—or structured, where there is a strict script that the interviewer does not deviate from. Most interviews are in between the two and are called semi-structured interviews, where the researcher has a few consistent questions and can follow up by asking more detailed questions about the topics that come up. Such interviews can be lengthy and detailed, but they are usually conducted with a relatively small sample. The unstructured interview was the approach used by Lindqvist and colleagues in their research on the families of suicide victims because the researchers were aware that how much was disclosed about such a sensitive topic should be led by the families, not by the researchers.

Another approach used in qualitative research involves small groups of people who participate together in interviews focused on a particular topic or issue, known as focus groups. The interaction among participants in a focus group can sometimes bring out more information than can be learned in a one- on-one interview. The use of focus groups has become a standard technique in business and industry among those who want to understand consumer tastes and preferences. The content of all focus group interviews is usually recorded and transcribed to facilitate later analyses. However, we know from social psychology that group dynamics are often at play in any group, including focus groups, and it is useful to be aware of those possibilities. For example, the desire to be liked by others can lead participants to provide inaccurate answers that they believe will be perceived favorably by the other participants. The same may be said for personality characteristics. For example, highly extraverted participants can sometimes dominate discussions within focus groups.

Data Analysis in Qualitative Research

Although quantitative and qualitative research generally differ along several important dimensions (e.g., the specificity of the research question, the type of data collected), it is the method of data analysis that distinguishes them more clearly than anything else. To illustrate this idea, imagine a team of researchers that conducts a series of unstructured interviews with people recovering from alcohol use disorder to learn about the role of their religious faith in their recovery. Although this project sounds like qualitative research, imagine further that once they collect the data, they code the data in terms of how often each participant mentions God (or a “higher power”), and they then use descriptive and inferential statistics to find out whether those who mention God more often are more successful in abstaining from alcohol. Now it sounds like quantitative research. In other words, the quantitative-qualitative distinction depends more on what researchers do with the data they have collected than with why or how they collected the data.

But what does qualitative data analysis look like? Just as there are many ways to collect data in qualitative research, there are many ways to analyze data. Here we focus on one general approach called grounded theory (Glaser & Strauss, 1967) . This approach was developed within the field of sociology in the 1960s and has gradually gained popularity in psychology. Remember that in quantitative research, it is typical for the researcher to start with a theory, derive a hypothesis from that theory, and then collect data to test that specific hypothesis. In qualitative research using grounded theory, researchers start with the data and develop a theory or an interpretation that is “grounded in” those data. They do this analysis in stages. First, they identify ideas that are repeated throughout the data. Then they organize these ideas into a smaller number of broader themes. Finally, they write a theoretical narrative—an interpretation of the data in terms of the themes that they have identified. This theoretical narrative focuses on the subjective experience of the participants and is usually supported by many direct quotations from the participants themselves.

As an example, consider a study by researchers Laura Abrams and Laura Curran, who used the grounded theory approach to study the experience of postpartum depression symptoms among low-income mothers (Abrams & Curran, 2009) . Their data were the result of unstructured interviews with 19 participants. The table below hows the five broad themes the researchers identified and the more specific repeating ideas that made up each of those themes. In their research report, they provide numerous quotations from their participants, such as this one from “Destiny:”

“Well, just recently my apartment was broken into and the fact that his Medicaid for some reason was cancelled so a lot of things was happening within the last two weeks all at one time. So that in itself I don’t want to say almost drove me mad but it put me in a funk….Like I really was depressed. (p. 357)”

Their theoretical narrative focused on the participants’ experience of their symptoms, not as an abstract “affective disorder” but as closely tied to the daily struggle of raising children alone under often difficult circumstances. The table below illustrates the process of creating themes from repeating ideas in the qualitative research gathering and analysis process.

Table illustrates the process of grouping repeating ideas to identify recurring themes in the qualitative research gathering process. This requires a degree of interpretation of the data unique to the qualitative approach.

Given their differences, it may come as no surprise that quantitative and qualitative research do not coexist in complete harmony. Some quantitative researchers criticize qualitative methods on the grounds that they lack objectivity, are difficult to evaluate in terms of reliability and validity, and do not allow generalization to people or situations other than those actually studied. At the same time, some qualitative researchers criticize quantitative methods on the grounds that they overlook the richness of human behavior and experience and instead answer simple questions about easily quantifiable variables.

In general, however, qualitative researchers are well aware of the issues of objectivity, reliability, validity, and generalizability. In fact, they have developed a number of frameworks for addressing these issues (which are beyond the scope of our discussion). And in general, quantitative researchers are well aware of the issue of oversimplification. They do not believe that all human behavior and experience can be adequately described in terms of a small number of variables and the statistical relationships among them. Instead, they use simplification as a strategy for uncovering general principles of human behavior.

Many researchers from both the quantitative and qualitative camps now agree that the two approaches can and should be combined into what has come to be called mixed-methods research (Todd, Nerlich, McKeown, & Clarke, 2004). In fact, the studies by Lindqvist and colleagues and by Abrams and Curran both combined quantitative and qualitative approaches. One approach to combining quantitative and qualitative research is to use qualitative research for hypothesis generation and quantitative research for hypothesis testing. Again, while a qualitative study might suggest that families who experience an unexpected suicide have more difficulty resolving the question of why, a well-designed quantitative study could test a hypothesis by measuring these specific variables in a large sample. A second approach to combining quantitative and qualitative research is referred to as triangulation. The idea is to use both quantitative and qualitative methods simultaneously to study the same general questions and to compare the results. If the results of the quantitative and qualitative methods converge on the same general conclusion, they reinforce and enrich each other. If the results diverge, then they suggest an interesting new question: Why do the results diverge and how can they be reconciled?

Using qualitative research can often help clarify quantitative results via triangulation. Trenor, Yu, Waight, Zerda, and Sha (2008) investigated the experience of female engineering students at a university. In the first phase, female engineering students were asked to complete a survey, where they rated a number of their perceptions, including their sense of belonging. Their results were compared across the student ethnicities, and statistically, the various ethnic groups showed no differences in their ratings of their sense of belonging.

One might look at that result and conclude that ethnicity does not have anything to do with one’s sense of belonging. However, in the second phase, the authors also conducted interviews with the students, and in those interviews, many minority students reported how the diversity of cultures at the university enhanced their sense of belonging. Without the qualitative component, we might have drawn the wrong conclusion about the quantitative results.

This example shows how qualitative and quantitative research work together to help us understand human behavior. Some researchers have characterized qualitative research as best for identifying behaviors or the phenomenon whereas quantitative research is best for understanding meaning or identifying the mechanism. However, Bryman (2012) argues for breaking down the divide between these arbitrarily different ways of investigating the same questions.

Key Takeaways

The qualitative approach is centered on an inductive method of reasoning
The qualitative approach focuses on understanding phenomenon through the perspective of those experiencing it
Researchers search for recurring topics and group themes to build upon theory to explain findings
A mixed methods approach uses both quantitative and qualitative methods to explain different aspects of a phenomenon, processes, or practice
This chapter can be attributed to Research Methods in Psychology by Rajiv S. Jhangiani, I-Chant A. Chiang, Carrie Cuttler, & Dana C. Leighton is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, except where otherwise noted. This adaptation constitutes the fourth edition of this textbook, and builds upon the second Canadian edition by Rajiv S. Jhangiani (Kwantlen Polytechnic University) and I-Chant A. Chiang (Quest University Canada), the second American edition by Dana C. Leighton (Texas A&M University-Texarkana), and the third American edition by Carrie Cuttler (Washington State University) and feedback from several peer reviewers coordinated by the Rebus Community. This edition is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. ↵

Share This Book

Open access
Published: 23 May 2024

Investigating racial/ethnic differences in procedure experience in obstetrics & gynecology trainees at a single academic institution: a retrospective cohort study

Patricia GiglioAyers 1 , 2 ,
Christine E. Foley 1 , 2 ,
Beth Cronin 1 , 2 &
Dayna Burrell 1 , 2

BMC Medical Education volume 24 , Article number: 561 ( 2024 ) Cite this article

Metrics details

Discrimination is common in medical education. Resident physicians of races and ethnicities underrepresented in medicine experience daily discrimination which has been proven to negatively impact training. There is limited data on the impact of resident race/ethnicity on OB/GYN surgical training. The objective of this study was to investigate the impact of race/ethnicity on procedural experience in OB/GYN training.

A retrospective analysis of graduated OB/GYN resident case logs from 2009 to 2019 was performed at a single urban academic institution. Self-reported race/ethnicity data was collected. Association between URM and non-URM were analyzed using t-tests. Trainees were categorized by self-reported race/ethnicity into underrepresented in medicine (URM) (Black, Hispanic, Native American) and non-URM (White, Asian).

The cohort consisted of 84 residents: 19% URM ( N = 16) and 79% non-URM ( n = 66). Difference between URM and non-URM status and average case volume was analyzed using t-tests. There was no difference between non-URM and URM trainees and reported mean number of Total GYN (349 vs. 334, p = 0.31) and Total OB (624 vs. 597, P = 0.11) case logs. However, compared with non-URM, on average URM performed fewer Total procedures (1562 vs. 1469, P = 0.04). Analyzing individual procedures showed a difference in average number of abortions performed between URM and non-URM (76 vs. 53, P = 0.02). There were no other statistically significant differences between the two groups.

Conclusions

This single institution study highlights potential differences in trainee experience by race/ethnicity. Larger national studies are warranted to further explore these differences to identify bias and discrimination, and to ensure equitable experience for all trainees.

Peer Review reports

Introduction

Discrimination is common in medical education, with nearly 60% of medical trainees experiencing at least one form of harassment or discrimination during their training [ 1 ]. Race/ethnicity has been proven to negatively impact medical student experiences and evaluations [ 2 , 3 ]. Although data remains limited, a rising number of studies explore the impact of race/ethnicity on residency training.

Resident physicians of races & ethnicities underrepresented in medicine endure daily microaggressions and biases [ 4 ]. In general surgery, up to 24% of residents report experiencing discrimination based on race/ethnicity or religion, with highest rates (70%) reported among Black residents [ 5 , 6 , 7 ]. Black surgical residents are 4.2 times more likely to experience high levels of perceived daily discrimination [ 7 ]. Discriminatory acts include being mistaken for another person of the same race, mistaken for nonphysicians, and experiencing different standards of evaluation [ 5 ]. Compared with their White counterparts, non-White residents experience increase feelings of isolation and judgement [ 8 ]. Surgical residents who experience discrimination also reported higher rates of burnout, thoughts of attrition, and suicidal thoughts [ 5 , 6 ]. A recent study investigating the relationship between gender, race/ethnicity and general surgery resident case volume cites a correlation between racial/ethnic categories underrepresented in medicine (URM) (identified as Black, Hispanic or Native American) and lower operative volumes at graduation [ 9 ].

Data regarding the impact of race/ethnicity on training in Obstetrics and Gynecology (OB/GYN) is limited. OB/GYN is reported to have the highest percentage of trainees from racial and ethnic backgrounds underrepresented in medicine at 19% among the surgical subspecialties [ 10 ]. However, recent data from 2022 demonstrated there is a greater proportion of White physicians at the fellowship level compared to residency level [ 11 ]. This trend persists in academic medicine, with a higher proportion of white physicians in leadership positions and with higher academic ranks [ 12 ]. Despite multiple initiatives by national organizations within OB/GYN to address racial and ethnic disparities [ 13 , 14 ], studies exploring racial disparities and discrimination are sparse in OB/GYN literature. To the authors knowledge, there is no published data on the impact of race/ethnicity on resident surgical training in OB/GYN. Specifically, there is no data on the impact of race on the fundamental metric of surgical volume during gynecology residency training. The aim of this study was to begin by exploring the impact of race/ethnicity on OB/GYN procedural experience in residency training at a single institution.

A retrospective analysis of graduated OB/GYN resident procedural case logs per the Accreditation Council for Graduate Medical Education (ACGME) from 2009 to 2019 at a single institution was performed. The research was deemed exempt by the IRB and was determined to be non-human subjects research. Self-reported race/ethnicity as limited by ERAS check boxes was collected. Trainees were categorized into URM (Black, Hispanic, Native American) and non-URM (White, Asian). The institution instructs residents to log a procedure if active participation as the primary surgeon is > 50% of the procedure. The primary outcome was total number of surgical procedures logged by a graduating resident. Secondary outcomes included procedure logs for the following ACGME categories: Normal spontaneous vaginal delivery (NSVD), Cesarean section (CS), Operative delivery (ODEL), Abdominal hysterectomy (AHYST), Vaginal hysterectomy (VHYST), Laparoscopic hysterectomy (LHYST), Minimally Invasive Hysterectomy (MIH), Total Hysterectomy (THYST), Incontinence and pelvic floor (ISPF), Laparoscopy (LAPS), Operative Hysteroscopy (OHYST), Abortion (ABORT), Transvaginal ultrasound (TVUS), Surgery for invasive cancer (SIC). Total numbers of cases, total obstetric (Total OB: CS, NSVD, ODEL), and total gynecologic (Total GYN: THYST, LAPS, OHYST) cases were collected. Residents in OB/GYN who completed the four-year residency training program were included in the analysis. Trainees who transferred training programs during residency or did not complete residency were excluded. Procedures were reported as mean number of procedures per ACGME category per group (URM vs. non-URM). Differences between URM and non-URM status and mean case volumes were analyzed using t-tests.

The cohort consisted of 84 residents. Residents who self-selected the ACGME category of “none of the above” ( n = 2) were excluded from the URM vs. non-URM analyses. There was a total of 82 residents included in the final analysis: 66 non-URM (78.57%). (Table 1 ) There were no differences between non-URM and URM trainees and reported mean number of Total GYN (349 vs. 334, p = 0.31) and Total OB (624 vs. 597, P = 0.11) case logs. However, URM trainees had significantly fewer Total procedures (1469 vs. 1562, P = 0.04) than their non-URM counterparts (Table 2 ). Analyzing specific procedures showed when comparing mean number of abortions, URM trainees experienced significantly less abortions (76 vs. 53, P = 0.02) than non-URM trainees. No differences were found between non-URM and URM trainees in all other specific individual procedure categories (Table 2 ).

Resident trainees from races and ethnicities underrepresented in medicine experience daily discrimination, however there is limited data on the impact of racial/ethnic discrimination on training and postgraduate experience within OB/GYN. The importance of identifying and addressing racial and ethnic disparities within OB/GYN and medical education is widely accepted. In 2021, the ACGME launched ACGME Equity Matters, an initiative focused on learning and improvement in areas of diversity, equity and incision, and antiracism practices [ 13 ]. In 2020 ACOG, along with leading national and international women’s health organizations, released a joint statement, “Collective Action Addressing Racism.” [ 14 ] This statement specifically cites commitment to education, recognition, and scholarship as ways to eliminate inequalities in women’s health. Despite these initiatives, published research is limited.

This single institution study highlights potential differences in trainee experience by race/ethnicity and calls for further review at training programs across our specialty. This study showed a difference in total procedure experience between URM and non-URM OB/GYN residents during the 10-year time period examined. These differences may suggest discriminatory practices which are limiting procedural experience for URM residents. These findings are similar to recently published data that demonstrated a correlation between general surgery residents underrepresented in medicine or who identified as female, and lower operative volumes at graduation [ 9 ].

Additionally, this study observed a significant difference in the number of abortion procedures logged by URM versus non-URM trainees. In our institution, trainees have the choice to opt out of abortion procedures. This choice is not recorded as a part of the operative log but may confound this particular data point. We are unaware of any correlation between a trainee’s self-identified race and choice to perform abortion procedures. Additional work is needed to evaluate the demonstrated differences on a qualitative level to better identify the root cause(s) of the variation demonstrated, including possible sociocultural influences. Further work must be done to identify unconscious and overt biases and address discrimination to ensure all residents, regardless of race/ethnicity or gender, have an equitable training experience.

This small, single institution study calls for further review of racial and ethnic differences in procedural experience at training programs across our specialty. Although OB/GYN does have the highest percent of URM trainees among the surgical subspecialties, the lower proportion of URM physicians in fellowships and in higher academic rank positions suggests persistent institutional and structural racism. Procedural case logs are an objective and nationally utilized measure which could be further analyzed to identify and ultimately address training differences. If publicly available, these case logs could hold programs accountable for ensuring equitable procedural experience. Addressing any identified differences would not only improve resident experience and skill, but also contribute to the goal of creating a racially and ethnically diverse workforce to improve patient care in OB/GYN.

There are several limitations to this study, including variation in the accuracy and reporting practices of resident procedure logs which may impact data. Although criteria at this institution exist instructing residents to log only procedures which they performed > 50% of as the primary surgeon, residents are individually responsible for tracking and logging procedures. Furthermore, the small sample size of this study at a single institution, coupled with the variation in resident surgical experience and reporting practices between OB/GYN programs nationally, prevent this study from generalizability to all OB/GYN residency programs. This study analyzes total case logs at time of graduation, and therefore does not explore how race/ethnicity may impact procedural experience across the four years of residency and does not account for variation in logging during different times of residency. The authors also recognize that increased procedural numbers do not necessarily translate to procedural competency. Although differences may suggest training inequity among URM vs. non-URM residents, variation in procedural numbers may not reflect trainee competency at time of graduation.

Differences may exist in Obstetrics and Gynecology procedural experience by trainee race/ethnicity. Larger national studies are warranted to further explore these differences to identify bias and discrimination, and to ensure equitable experience for all trainees.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Fnais N, Soobiah C, Chen MH, et al. Harassment and discrimination in medical training: a systematic review and meta-analysis. Acad Med. 2014;89(5):817–27. https://doi.org/10.1097/ACM.0000000000000200 .

Article Google Scholar

Woolf K. Differential attainment in medical education and training. BMJ. 2020;368:m339. https://doi.org/10.1136/bmj.m339 .

Orom H, Semalulu T, Underwood W 3rd. The social and learning environments experienced by underrepresented minority medical students: a narrative review. Acad Med. 2013;88(11):1765–77. https://doi.org/10.1097/ACM.0b013e3182a7a3af .

Osseo-Asare A, Balasuriya L, Huot SJ, et al. Minority Resident Physicians’ views on the role of Race/Ethnicity in their training experiences in the Workplace. JAMA Netw Open. 2018;1(5):e182723. https://doi.org/10.1001/jamanetworkopen.2018.2723 .

Yuce TK, Turner PL, Glass C, et al. National Evaluation of Racial/Ethnic Discrimination in US Surgical Residency Programs. JAMA Surg. 2020;155(6):526–8. https://doi.org/10.1001/jamasurg.2020.0260 .

Hu YY, Ellis RJ, Hewitt DB, et al. Discrimination, abuse, harassment, and Burnout in Surgical Residency Training. N Engl J Med. 2019;381(18):1741–52. https://doi.org/10.1056/NEJMsa1903759 .

Khubchandani JA, Atkinson RB, Ortega G, et al. Perceived discrimination among Surgical residents at Academic Medical centers. J Surg Res. 2022;272:79–87. https://doi.org/10.1016/j.jss.2021.10.029 .

Wong RL, Sullivan MC, Yeo HL, Roman SA, Bell RH Jr, Sosa JA. Race and surgical residency: results from a national survey of 4339 US general surgery residents. Ann Surg. 2013;257(4):782–7. https://doi.org/10.1097/sla.0b013e318269d2d0 .

Eruchalu CN, He K, Etheridge JC, et al. Gender and Racial/Ethnic disparities in operative volumes of graduating general surgery residents. J Surg Res. 2022;279:104–12. https://doi.org/10.1016/j.jss.2022.05.020 .

Nieblas-Bedolla E, Williams JR, Christophers B, Kweon CY, Williams EJ, Jimenez N. Trends in Race/Ethnicity among applicants and matriculants to US Surgical specialties, 2010–2018. JAMA Netw Open. 2020;3(11):e2023509. https://doi.org/10.1001/jamanetworkopen.2020.23509 .

Talbott JMV, Wasson MN. Sex and Racial/Ethnic Diversity in Accredited Obstetrics and Gynecology Specialty and Subspecialty Training in the United States. J Surg Educ. 2022;79(3):818–27. https://doi.org/10.1016/j.jsurg.2021.12.011 .

Wooding DJ, Das P, Tiwana S, Siddiqi J, Khosa F. Race, ethnicity, and gender in academic obstetrics and gynecology: 12-year trends. Am J Obstet Gynecol MFM. 2020;2(4):100178. https://doi.org/10.1016/j.ajogmf.2020.100178 .

Diversity E, Inclusion. Accessed December 6, 2022. https://www.acgme.org/what-we-do/diversity-equity-and-inclusion/ .

Joint Statement: Collective Action Addressing Racism. Accessed December 6. 2022. https://www.acog.org/news/news-articles/2020/08/joint-statement-obstetrics-and-gynecology-collective-action-addressing-racism .

Download references

Acknowledgements

Not applicable.

There is no financial support or funding to report for this manuscript.

Author information

Authors and affiliations.

Department of Obstetrics and Gynecology, Women and Infants Hospital, 101 Dudley St, 02905, Providence, RI, USA

Patricia GiglioAyers, Christine E. Foley, Beth Cronin & Dayna Burrell

The Warren Alpert Medical School of Brown University, 222 Richmond Street, 02903, Providence, RI, USA

You can also search for this author in PubMed Google Scholar

Contributions

PGA and DB were involved in the conception, design, interpretation of data, and manuscript writing. CF was involved in the design of this study, analysis, and editing of the manuscript. BC contributed to the conception, design, and editing of this work. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Patricia GiglioAyers .

Ethics declarations

Competing interests.

The authors declare no competing interests.

Ethics, approval, and consent to participate

The ethical approval for the study and informed consent are waived by the Women and Infants Institutional Review Board due to retrospective nature of the study. All methods carried out in the study were performed in accordance with relevant guidelines and regulations.

Consent for publication

Competing interests.

The author(s) declare(s) that they have no competing interests. Dr. Dayna Burrell has acted as a BMC Education article review in the past upon request. This data was accepted for oral presentation at the 2023 CREOG and APGO Annual Meeting. The conference took place February 27-March 1, 2023 in National Harbor, Maryland.

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ . The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article.

GiglioAyers, P., Foley, C.E., Cronin, B. et al. Investigating racial/ethnic differences in procedure experience in obstetrics & gynecology trainees at a single academic institution: a retrospective cohort study. BMC Med Educ 24 , 561 (2024). https://doi.org/10.1186/s12909-024-05363-9

Download citation

Received : 07 June 2023

Accepted : 28 March 2024

Published : 23 May 2024

DOI : https://doi.org/10.1186/s12909-024-05363-9

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Race/Ethnicity
Residency training
Surgical education
Obstetrics and gynecology
Operative logs

BMC Medical Education

ISSN: 1472-6920

Submission enquiries: [email protected]
General enquiries: [email protected]

IMAGES

Data Gathering Procedure Example
Data Gathering Procedure Flowchart
Designing Data Collection Processes
10 Procedures for collecting data
DATA GATHERING AND SAMPLE AND SAMPLING TECHNIQUES
Practical Research 1 Data Gathering Instrument and Analysis Procedures

VIDEO

GROUP 6: ANNIE KADATUAN
GROUP 6: SINSUAT
GROUP 6: DATUMANONG
Data gathering and using google forms
Dianne Kyla Rayos
Lecture 16: Data Mining CSE 2020 Fall

COMMENTS

(PDF) Collecting data through case studies
The case study is a data collection method in which in-depth descriptive information. about specific entities, or cases, is collected, organized, interpreted, and presented in a. narrative format ...
Case Study
A case study is a research method that involves an in-depth examination and analysis of a particular phenomenon or case, such as an individual, organization, community, event, or situation. It is a qualitative research approach that aims to provide a detailed and comprehensive understanding of the case being studied.
Data Collection
Learn how to collect data systematically for your research project. Find out how to choose the right method, plan your procedures, operationalize your variables, and collect your data.
Case Study Method: A Step-by-Step Guide for Business Researchers
Although case studies have been discussed extensively in the literature, little has been written about the specific steps one may use to conduct case study research effectively (Gagnon, 2010; Hancock & Algozzine, 2016).Baskarada (2014) also emphasized the need to have a succinct guideline that can be practically followed as it is actually tough to execute a case study well in practice.
Data Collection: What It Is, Methods & Tools + Examples
To collect data, we must first identify what information we need and how we will collect it. We can also evaluate a hypothesis based on collected data. In most cases, data collection is the primary and most important step for research. The approach to data collection is different for different fields of study, depending on the required information.
Planning Qualitative Research: Design and Decision Making for New
When conducting a case study, researchers use a variety of data collection procedures. Merriam and Tisdell (2015) and Creswell and Poth (2018) suggest multiple information sources for reconstructing and analyzing the case. Within the bounded system, one must investigate the perceptions of diverse participants, collect multiple types of evidence ...
Best Practices in Data Collection and Preparation: Recommendations for
We offer best-practice recommendations for journal reviewers, editors, and authors regarding data collection and preparation. Our recommendations are applicable to research adopting different epistemological and ontological perspectives—including both quantitative and qualitative approaches—as well as research addressing micro (i.e., individuals, teams) and macro (i.e., organizations ...
Data Collection Methods and Tools for Research; A Step-by-Step Guide to
It means the findings of case studies can be used just for the same issues as the general patterns for different studies. Qualitative methods encompass three main categories including observations, document reviews, and in-depth interviews in spite of the fact that there are less common ways to gather qualitative data.
Data Collection Methods: A Comprehensive View
Your choice of data collection method (or alternately called a data gathering procedure) depends on the research questions you're working on, the type of data required, and the available time and resources and time. You can categorize data-gathering procedures into two main methods: Primary data collection. Primary data is collected via first ...
What Data Gathering Strategies Should I Use?
In this chapter, we review many of the data gathering strategies that can be used by postgraduates in social and behavioural research. We explore three major domains of data gathering strategies: strategies for connecting with people (encompassing interaction-based and observation-based strategies), exploring people's handiworks (encompassing participant-centred and artefact-based strategies ...
PDF COLLECTING DATA IN MIXED METHODS RESEARCH
R. esearchers collect data in a mixed methods study to address the research questions or hypotheses. The data collection procedure needs to fit the type of mixed methods design in the study. This requires using procedures drawn from concurrent forms of data collection, in which both the quantitative and qualitative data are collected ...
Qualitative Study Design and Data Collection
5. Describe the processes of qualitative data collection for observing, interviewing, focus groups, and naturally occurring data. Given a study description, identify the processes employed in that study. 6. Explain why sometimes it is best to use a combination of qualitative strategies for data gathering.
Data Collection Methods
Table of contents. Step 1: Define the aim of your research. Step 2: Choose your data collection method. Step 3: Plan your data collection procedures. Step 4: Collect the data. Frequently asked questions about data collection.
Dissecting the Case Study Research: Stake and Merriam Approaches
The most common way of gathering data in qualitative case study research is interviews. The interviews are also the main element of subjectivity and relativity that the constructivist researcher ...
Data Collection
Data collection is the process of gathering and collecting information from various sources to analyze and make informed decisions based on the data collected. This can involve various methods, such as surveys, interviews, experiments, and observation. In order for data collection to be effective, it is important to have a clear understanding ...
Guide to Data Collection Methods and Tools
Surveys, interviews, observations, focus groups, and forms are common data collection methods. Sampling involves selecting a representative group from a larger population. Choosing the right sampling method to gather representative and relevant data is crucial. Crafting effective data collection instruments like surveys and questionnaires is ...
Stake Data Gathering in Case Study
Stake, R. (1995). The art of case study research. Thousand Oaks, CA: Sage Publications. Chapter 4: Data Gathering "It (data gathering) begins before there is commitment to do the study; back-grounding, acquaintance with other cases, first impressions. A considerable propor
Gathering and Analyzing Qualitative Data
Gathering and Analyzing Qualitative Data. As the role of clinician researchers expands beyond the bedside, it is important to consider the possibilities of inquiry beyond the quantitative approach. ... In fact, the studies by Lindqvist and colleagues and by Abrams and Curran both combined quantitative and qualitative approaches. One approach to ...
Case Study Methodology of Qualitative Research: Key Attributes and
A case study protocol should have the following constituent elements: (a) an overview of the entire study including its objectives, (b) a detailed description of field procedures including the techniques of data collection to be employed, and how one plans to move ahead and operate in the field, (c) a clearly and sharply developed questions ...
(PDF) Chapter 3 Research Design and Methodology
Research Design and Methodology. Chapter 3 consists of three parts: (1) Purpose of the. study and research design, (2) Methods, and (3) Statistical. Data analysis procedure. Part one, Purpose of ...
Demystification and Actualisation of Data Saturation in Qualitative
The state of theoretical saturation is achieved by the combined process of gathering and examining data (ibid., p. 61). ... (2021) employs a methodical qualitative case study to thoroughly analyse the process of collecting data and ensuring saturation. Wray et ... This procedure entails examining the data to formulate ideas based on the ...
Investigating racial/ethnic differences in procedure experience in
There is limited data on the impact of resident race/ethnicity on OB/GYN surgical training. The objective of this study was to investigate the impact of race/ethnicity on procedural experience in OB/GYN training. A retrospective analysis of graduated OB/GYN resident case logs from 2009 to 2019 was performed at a single urban academic institution.
Community mapping and data gathering for city planning in the
The assessment stage of the planning process consists of 1) data gathering through the mapping and profiling of settlements and communities, and 2) analysis of data outputs. Data outputs relate to specific activities, to be carried out as shown in Table 1. Table 1 Mapping project data outputs. Mapping activity.
Poor creativity in interictal migraine: A case-control pilot study
This study has the characteristics of a case-control study. Procedure. The data collection sessions were carried out in person and took place at the Hospital del Henares. The data compilation phase took place during April and May 2023. ... Parametric data are shown by mean and standard deviation (SD) and the parametric univariate test ...

Data Collection: What It Is, Methods & Tools + Examples

What is Data Collection?

Phone vs. Online vs. In-Person Interviews

Multi-Mode Surveys

MORE LIKE THIS

Cannabis Industry Business Intelligence: Impact on Research

Top 10 Dynata Alternatives & Competitors

What Are My Employees Really Thinking? The Power of Open-ended Survey Analysis

I Am Disconnected – Tuesday CX Thoughts

Other categories

Data Collection Methods: A Comprehensive View

What is Data Collection?

The Specific Types of Data Collection Methods

Quantitative Methods

Qualitative Methods

Secondary Data Collection Methods

The Importance of Data Collection Methods

So, What’s the Difference Between Data Collecting and Data Processing?

Do You Want to Become a Data Scientist?

You might also like to read:

Data Science Bootcamp

Online Bootcamp

Recommended Articles

What is Exploratory Data Analysis? Types, Tools, Importance, etc.

What is Data Wrangling? Importance, Tools, and More

What is Spatial Data Science? Definition, Applications, Careers & More

Data Science and Marketing: Transforming Strategies and Enhancing Engagement

An Introduction to Natural Language Processing in Data Science

Why Use Python for Data Science?

Program Benefits

Qualitative Study Design and Data Collection

Cite this chapter

Access this chapter

Author information

Corresponding author

Answers to Self-Tests

Self-Test 15.2

Rights and permissions

Copyright information

About this chapter

Download citation

Share this chapter

Have a language expert improve your writing

Data Collection Methods | Step-by-Step Guide & Examples

Table of contents

Prevent plagiarism, run a free check.

Operationalisation

Standardising procedures

Creating a data management plan

Cite this Scribbr article

Is this article helpful?

Pritha Bhandari

Data Collection – Methods Types and Examples

Data Collection

Types of Data Collection

Primary Data Collection

Secondary Data Collection

Qualitative Data Collection

Quantitative Data Collection

Data Collection Methods

Focus Groups

Observation

Experiments

Case Studies

Secondary Data Analysis

How to Collect Data

Applications of Data Collection

Examples of Data Collection

Purpose of Data Collection

When to use Data Collection

Characteristics of Data Collection

Advantages of Data Collection

Limitations of Data Collection

About the author

Muhammad Hassan

You may also like

Delimitations in Research – Types, Examples and...

Research Process – Steps, Examples and Tips

Research Design – Types, Methods and Examples

Institutional Review Board – Application Sample...