Logo image

  • Suggest edit

C769 Resources

C769 resources #.

Your Course Instructor

ugcapstoneit @ wgu . edu

These examples are on-par with average passing tasks. Use them as a guideline for what evaluators accept in fulfillment of the requirements.

BSNOS: task 1 , task 2 , and task 3

BSNOS: task 1

BSCIA: task 1 , task 2 , and task 3

Excellence Archive #

The Capstone Excellence Archive includes a wide range of completed conclusion reports (task 3). When reviewing archived capstones, keep in mind that they all are, by definition, above and beyond the requirements. Therefore, do not use these as examples of what’s needed to meet the requirements. For a more down-to-earth example of what’s required, see the above Examples section. Note that the archive has all recognized C769 capstones are categorized as BS Information Technology regardless of degree emphasis.

Task 1 Resources #

Topic approval form template #, waiver form #, overview video #, task 1 examples #.

See the Examples and Excellence Archive sections.

Task 2 Resources #

Task 2 template #.

Write your proposal following Task 2: The Proposal template :

Task 2 Examples #

Task 2 videos #, tasks 1-3 accelerated plan #.

Nearing the end of your term? Plan accordingly:

Sections A1, A2, and A3 #

Sections b and b1 #, sections c and d #, sections e and f #, sections g and h #, completing task 3 #.

IT Audio Series podcast Converting Task 2 to Task 3 ; view the transcript .

Task 3 Template #

Write your proposal following Task 3: The Conclusion template :

Task 3 Examples #

Grammar, sources, and apa #, wgu coaching center writing help #.

WGU Writing Help Appointments

WGU Writing Live Events

WGU Learning Hub: Writing Studio

grammarly_icon

Students have reported missed mistakes when using the Google doc Grammarly extension. Therefore, we advise copying content directly into the app or purchasing the premium version compatible with MS Word.

Get the best writing help from the writing experts: WGU Writing Center . While Writing Center Instructors cannot say whether a task will pass, they can help you revise your paper to meet WGU competency standards for professional communication, sources, and APA formatting. The Writing Center also offers live Q & A sessions covering a variety of topics which include general writing and grammar help. See a list of upcoming events here: Writing Center Live Events

Sources and APA formatting #

Sources and format should follow APA \(7^{\text{th}}\) edition guidelines . Outside of grammar mistakes, most APA errors involve formatting of the sources or in-text citations. In-text citations should be of the form, (Author, year) . For more details, see APA guidelines for citations and APA guide for missing reference information .

Avoid reference errors by using a referencing tool:

MS Word Reference Tool \(\leftarrow\) desktop version only.

JabRef \(\leftarrow\) free and works on most platforms.

LibreOffice Reference Tool . - Download LibreOffice for free.

General Resources #

Student resources #.

Academic Coaching Center . The Coaching Center provides help with writing, Microsoft products, tech, math, and more!

Student Resource Hub

Webinars and Cohorts #

5 days to finish c769 task 2 challenge cohort #.

Keep an eye out for the 5 Days to Finish Task 2 Challenge Cohort. You will usually see the enrollment on Mondays in the course of study in the section Labeled Explore Cohort . The Explore Cohort section will not be seen once enrollment is closed . It will start on Wednesdays and will kick off with a live webinar at 7 PM ET. Each day of the cohort I will email support for a section or 2 of the proposal (task 2) to help you finish within 5-10 days. Most tend to finish in 10 days.

Email Candice Allen ugcapstoneit @ wgu . edu for more information or questions.

Webinar recording: C769 cohort five days to finish!

WGU Academic Coaching Center Live Events #

Writing Live Events

All Live Events

Libraries #

WGU’s library

google.scholar.com

You can search WGU’s library and other open-source libraries using google.scholar.com Go to >’Google.scholar>setting>libraires>’ and then add WGU and other libraries.

Welcome Email #

We’ve tried to provide everything you need here on this website as a more complete and readily available resource than the “welcome email,” i.e., the non-automated introductory email from your assigned course instructor containing the tips and resources needed to get started in the right direction. However, we often get asked for the email, and I suppose this website would be incomplete without it. So here is an example of a “welcome email.”

A Welcome Email

Welcome to C769! See the C769 IT Capstone website for almost everything you need. To understand what the capstone requires (and does not require), start here and watch these videos: Project Idea and Task 2: Proposal Overview . Your first step is choosing a topic; we recommend a topic proposing a technical solution to a problem -avoiding those that deliver only information (i.e., assessments, policies, plans, training, etc.). Though they often need reframing to fit the capstone requirements, past or present work projects make great capstone topics. The capstone project can be very similar to your C850 Emerging Technologies project -only here you get to choose the client and problem to solve.

Any IT project is allowable provided it has the following:

A specific client and IT problem (you can fabricate a client whose problem needs your chosen solution.)

The implementation of a hard IT deliverable (software or hardware) that helps solve that problem. This is a wholly written exercise (an actual project is not required), but fabricated projects must be written as realistically as possible from reality. While it may need some revising, you can reuse any of your professional or academic work.

Understand what the project requires and doesn’t require! Watch Project Idea and all the Task 2 Videos (they are short). Then review these examples of actual passing projects.

Write the topic approval (task 1) following the Approval Form Template . This document only needs to be an outline of your proposed project. Changes from task 1 to task 2 are allowed and typical; evaluators will not rigorously compare tasks 1 and 2.

Check the IRB statement and email the completed form to your assigned course instructor for signature. Appointments before approval are not required but are encouraged if you have concerns.

Afterward, you can start working on Task 2 .

Pacing Guide #

Be aware that task 2 and 3 evaluations often take the full three days. Therefore, if you are approaching the end of your term, don’t wait on task 2 to start task 3 and submit task 3 early enough to have it returned, revised, and resubmitted.

Accelerated Plan #

Statistical Analysis of HDD Failure ¶

Matthew unrue, spring 2020 ¶, western govenors university msda capstone ¶, website version note: ¶.

This notebook is so large and works with so much that data that it was run in multiple settings with the kernel reset for memory management each time. As such, the code cell blocks have execution numbers that are not perfectly in order. Though these do not match up perfectly in this version, the code was and should only be excecuted from top to bottom.

Additional Resources: ¶

The 5-page project Executive Summary can be found here . The reveal.js based multimedia presentation notes can be found here . The 87-page report write-up of this project can be found here .

Table of Contents: ¶

  • I: Dataset Preparation
  • II: Data Tidying
  • III: Nan Value Management
  • IV: Analysis of Potential Predictors
  • V: Dataset Preparation
  • VI: PCA and MCA
  • VII: Model Creation
  • VIII: Conclusions

Introduction ¶

What factors indicate impending hard disk drive failure ¶, h0: study factors do not significantly indicate impending hard disk failure. ¶, h1: study factors do significantly indicate impending hard disk failure. ¶, context ¶.

Data helps businesses solve problems, make better decisions, and understand consumers, but a lot of data needs to be stored and available to enable these benefits. Hard drive failure is the most common form of data loss, which is one of the most impactful problems that businesses can experience today as simple drive recovery can cost up to $7,500 per drive (Painchaud, 2018). For cloud-based data centers, keeping multitudes of businesses’ data intact for their own operations is crucial. Being able to predict which hard drives are at the highest risk of failure based on understanding of the combinations of routine diagnostics test results is an ideal solution to backup and replace failing drives before the data is lost.

Data ¶

The dataset used is Backblaze’s 4th quarter data from 2019 (Backblaze, 2020). All of the needed data is contained within the .zip file that Backblaze provides to the public as .csv files split by day.

The dataset contains .csv files for each day of its corresponding quarter, from 2019-10-01 to 2019-12-31. As an example, the subsection of the dataset for 2019-10-01 contains 115,259 rows of data. However, as this data contains recorded readings from a live data center, the number of hard drives and thus rows, changed daily as failed drives were taken out and new drives were installed. The 129 column attributes are Date, Serial Number, Model Number, Capacity, Failure, 62 Self-Monitoring, Analysis and Reporting Technology (SMART) test results, and 62 normalized values of the SMART test values. The Failure attribute is the dependent variable of this study and is a qualitative binary categorical variable. The Date, Serial Number, and Model are nominal qualitative independent variables. Finally, and the SMART value columns are continuous quantitative independent variables.

As stated in Backblaze’s Hard Drive Data and Stats page (Backblaze, n.d.), this dataset is free for any use as long as Backblaze is cited as the data source, that users accept that they are solely responsible for how the data is used, and that the data cannot be sold to anybody as it publicly available.

Data Analytics Tools and Techniques ¶

Python, pandas, and the scikit-learn stack are extensively used for the loading, tidying, manipulation, and analysis of the datasets. PyTorch is used for all neural network related tasks of the analysis and model production. Matplotlib and seaborn are used to create charts and graphics for analysis and presentation of project findings. A needed algorithm, namely Fisher's Exact test for contingency tables greater than 2x2 dimensions, is unavailable in the scikit-learn ecosystem, and R.stats is used for this by using rpy2 to run the R code by embedding it in the Python process. Prince is used for factor analysis, and imbalanced-learn is used for the implementation of SMOTE.

Like R and unlike SAS, all of these packages are easily available, free, and open-source with Python. These methods have been chosen over R for ease of explanation, as Python code is often understood more readily than R, and because of the potential of integrating this project directly into a program or software for future use. While R is highly specialized for statistics and mathematics, Python is a general-purpose programming language with specialized libraries for the needed tools, and this facilitates project expansion in the future.

Synthetic Minority Over-Sampling Technique (SMOTE) is used specifically to handle the imbalanced classes for training and testing splits. PCA is used for dimensionality reduction. Predictor variables are examined through correlation coefficients and Fisher's exact test, as well as graphed univariate and bivariate distributions. A logistic regression model and a decision tree model are examined along with the results of the PCA to find predictor variables as well. For building a predictive model for future use, the logistic regression model, a random forest ensemble model, and neural networks are compared to determine which can produce the most useful model.

As HDD failure is an extremely rare event, the dependent variable class is extremely imbalanced and failing to control for the imbalance through techniques like boosting or oversampling would lead to ineffective models. As the dependent variable is a Boolean value, this task is a binary classification task. Logistic regression is an ideal predictive model for binary classification tasks that gives a probability for classification while also having a simplistic interpretation of coefficients that can be used for feature selection. Decision trees are also simple to understand and work well for classification tasks. Given the complexity of the various fields in the dataset, a more complicated model may work better for predictive power. Random forests and neural networks work very well for classification tasks under these circumstances.

Project Outcomes ¶

The key project outcomes are a deep understanding of the risk of hard drive failure based on the results of SMART test values regardless of manufacturer and predictive models that will be able to flag hard drives that are at high risk of failing. The understanding of the risk of failure based on test values will empower better business decisions by optimizing the choice of storage used based on projected lifetime. The predictive models will allow the business to proactively backup data from storage onto new storage devices before failure while also allowing hard drives to continue working closer to their end of life, minimizing waste from constantly replacing hard drives before it is needed. The combination of these two products will also enable the future creation of a more automated system that protects data from hard drive failure.

Dataset Preparation ¶

The dataset provided by Backblaze is made up of 92 .csv files, 1 for each day in the 2019 4th quarter, totaling 3.13GB of text data. As hard drive failure is an extremely rare event, all of these days will need to be considered together in order to have enough failures to draw conclusions. The project begins by combining all parts of the dataset from their .csv files into a single file.

Out of 10,991,209 hard drive days, there were only 678 failures, which gives a failure rate of 0.006169%.

Weiss (2013) defined the imbalance ratio as the ratio between majority and minority classes with a modestly imbalanced dataset having an imbalance ratio of 10:1, and extremely imbalanced datasets as having an imbalance ratio of 1000:1 or greater (pg. 15). This dataset has an imbalance ratio of approximately 16,210:1 and as such will require very careful cultivation in order for any predictive model to successfully learn from. The rarity of the positive failure cases is also the reason that the entire 4th quarter dataset is required.

Unfortunately, this combined file requires too much memory to load all at once for current hardware restraints. It needs 13.5GB for just the data, not including the memory needed for the OS and other software, nor memory for calculations.

As this dataset contains both raw and normalized values for all of the SMART values, a simple way to deal with the memory issues is to divide the dataset into a raw form and a normalized form.

Data Tidying ¶

The considerably smaller raw value subset of data is the main dataset of this project. As with nearly all real-world datasets, this one needs considerable cleaning and tidying in order to use for analysis.

30 rows × 68 columns

All SMART test columns have null values in some rows. The dataset notes state that this comes from differing manufacturer's standards despite the standardized nature of SMART tests.

Deriving the manufacturer from the model column will allow the dataset to be easily divided by manufacturer.

The "DELLBOSS VD" model value seems the be the only value potentially out of place.

60 rows × 68 columns

None of the SMART values exist for this hard drive model, but 60 of the drives have this model value. Additionally, no failures for this model exist in the dataset. Any row with this model value should be removed from the training data before any predictive analysis. Some searching online leads to the belief that it may be a RAID controller. ( https://www.dell.com/support/manuals/au/en/aubsd1/boss-s-1/boss_s1_ug_publication/overview?guid=guid-b20ef25b-b7e3-40f2-b7cd-e497358cd10a&lang=en-us )

0 rows × 68 columns

Additionally the "Seagate SSD" model seems to be missing information. Like the "DELLBOSS VD" model rows, this one also does not have any failures and will need to be removed before predictive analysis is performed.

96 rows × 68 columns

The rows not appropriate for analysis are deleted.

5 rows × 69 columns

Dtype Conversion ¶

Given the size of the dataset, a few minor changes to the columns may free up a considerable amount of memory. The date and capacity_bytes columns are two easy places to improve.

Here we can see that 1108 drive days have an error value rather than their actual capacity. These rows may need to be removed, but may also be an excellent signal for a failing drive.

Unfortunately, all drives experiencing this error do not fail and this can introduce problems in the final model. As it only affects 0.01% of the dataset, removing the affected rows seems best.

The capacity_bytes column is converted from bytes to terabytes to condense the information on disk.

5 rows × 70 columns

Raw Data Univariate Distributions ¶

With these things finished, the univariate distributions can be examined to gain a better sense of the data.

The first column, date shows some sort of testing or operational failure on November 5th.

Drive capacities are mostly 4, 8, and 12 TB, likely coinciding with large investments in new drives for the datacenter and possibly alongside the price lowering of specific models.

The manufacturer of the most drives in this dataset is Seagate at 72.59%. HGST is the second highest at 24.24%. Western Digital is the least represented manufacturer in the dataset with only 0.23%, but as HGST was acquired by Western Digital in 2012 (Sanders, 2018), the drives in this dataset will likely be quite similar between the two manufacturers given the seven-year timespan between then and the time of dataset recording and creation. Finally, Toshiba is the other manufacturer, with 2.94% of the dataset. This amount is quite low and may make it difficult to accurately predict their drives in comparison.

The SMART values vary greatly from the number of different types of drives that exist in this dataset. Before the columns can be graphed appropriately, the NaN/null values need to be examined. It's most likely that the missing data is most related to the hard drive's manufacturer or model.

Every single SMART figure column has null values.

The count row is equivalent to the number of non-null values. If a column has a count of 0, every single value in it is NaN or null, and should be deleted.

smart_13_raw, smart_15_raw, smart_179_raw, smart_181_raw, smart_182_raw, smart_201_raw, smart_250_raw, smart_251_raw, smart_252_raw, and smart_255_raw are all empty in this dataset, as all rows have NaN values in these columns.

Unfortunately, this operation takes too much memory to do in this manner. Each column will have to be graphed separately and then the graphs combined into a single graphic for the same effect.

5 rows × 59 columns

NaN Value Management ¶

With some dataset tidying complete, the final major dataset adjustments that need to be made before analysis can be performed is that the NaN values need dealt with. The rows or columns with them can be removed, or they can be filled in through interpolation or estimation.

The first five mostly complete columns all have two NaNs, which are the result of two rows that have no raw smart values at all. Both drives failed, making them quite important for predicting future failure. However, the lack of data makes them useless for predicting future failure in their current form.

The most likely scenario is that both drives failed just before the diagnostics were collected. As such, these two rows will be deleted and their associated row for the date before their currently marked failures will be updated to have failed that day.

2 rows × 59 columns

1 rows × 59 columns

The next section of columns all have 8792 rows with NaNs, ignoring the 2 rows just removed. Coincidentally, all of these columns share the same problematic rows.

8792 rows × 59 columns

This subset of drives are all manfactured by Seagate, and are 3 size variations of the same model line. There is not an updated model from this line in this dataset to interpolate values from.

Interpolating mean values from the same manufacturer, Seagate, and the models' respective capacity_TB categories would be a good way to estimate the missing values if enough data exists.

Additionally, creating a boolean column to flag interpolated data as missing may help the predictive models account for it.

smart_3_raw ¶

For the smart_3_raw data, the median for the manufacturer without regard for the drive capacity will be used for the first and third model of drive, as there are no appropriate rows with values to interpolate from.

smart_4_raw ¶

For the smart_4_raw data, the mean for the manufacturer and drive capacity will be used for the second model. The mean for the manufacturer without regard for the drive capacity will be used for the first and third model of drive, as there are no appropriate rows with values to interpolate from.

smart_5_raw ¶

For the smart_5_raw data, the median for the manufacturer and drive capacity will be used for the second model. The median for the manufacturer without regard for the drive capacity will be used for the first and third model of drive, as there are no appropriate rows with values to interpolate from.

smart_7_raw ¶

For the smart_7_raw data, the median for the manufacturer and drive capacity will be used for the second model. The median for the manufacturer without regard for the drive capacity will be used for the first and third model of drive, as there are no appropriate rows with values to interpolate from.

smart_10_raw ¶

For the smart_10_raw data, the median for the manufacturer will be used to fill the NaN values.

smart_197_raw ¶

For the smart_197_raw data, the median for the manufacturer and drive capacity will be used for the second model. The median for the manufacturer without regard for the drive capacity will be used for the first and third model of drive, as there are no appropriate rows with values to interpolate from.

smart_198_raw ¶

For the smart_198_raw data, the median for the manufacturer and drive capacity will be used for the second model. The median for the manufacturer without regard for the drive capacity will be used for the first and third model of drive, as there are no appropriate rows with values to interpolate from.

smart_199_raw ¶

For the smart_199_raw data, the median for the manufacturer and drive capacity will be used for the second model. The median for the manufacturer without regard for the drive capacity will be used for the first and third model of drive, as there are no appropriate rows with values to interpolate from.

smart_193_raw and smart_225_raw ¶

The smart_193_raw column is a different problem than the last group of columns. This group has 53985 rows with NaN values, which is still low enough in this large dataset to interpolate values without majorly ill effects, but still requires caution.

An important note here is that some manufacturers use different SMART attributes to represent the same information. Most Seagate and some Western Digital and Hitachi drives actually use 225 rather than 193 to store the Load/Unload Cycle Count value (Acronis, Knowledge Base 9128; Acronis, Knowledge Base 9152). We can see here that no row has both 193 and 225 values.

53985 rows × 59 columns

53985 rows × 2 columns

The only rows that do not have either value are the exact same rows as the last group. These will need interpolated if the rows are to be kept. The 45193 other rows can be filled by combining the two columns that represent the same information.

8792 rows × 2 columns

The smart_193_raw and smart_225_raw columns will be combined into a new smart_193_225 column and then the remaining values filled as in previous columns.

5 rows × 58 columns

Now that the two columns have been merged, the same process of interpolation by model and capacity can be used on the remaining group.

The remaining columns to examine all have over 2 million NaN value rows each. This level of missing data causes interpolation to skew results far more than the previous groups' levels of missing data. The following grouping of columns have at least 70% of their values.

smart_240_raw, smart_241_raw, and smart_242_raw ¶

Notably, none of the HGST drives have a value for the smart_240_raw column. Additionally, the drives that are missing the smart_241_raw data are also likely the drives missing the smart_242_raw data.

Seagate drives have enough filled values to use and Toshiba drives have no missing values, but the HGST and Western Digital drives do not have enough values to interpolate from. As such, all missing values will be filled in with the mean.

smart_187_raw, smart_188_raw, and smart_190_raw ¶

The group of the smart_187_raw, smart_188_raw, and smart_190_raw columns are divided by manufacturer, with all Seagate drives having the values and none of the other drive manufacturers having the values.

Given the column distributions, the smart_187_raw and smart_188_raw NaNs will be filled with the medians, and the smart_190_raw NaNs will be filled with the mean.

Memory Management and Reloading Checkpoint ¶

The remaining columns ¶.

These remaining columns have over 30% of their values missing, and an individualized approach will be taken with each of them. In some cases, categories of existing values may be helpful to preserve the potential for information with NaNs being their own category.

smart_195_raw ¶

This column only has values in a single manufacturer's drives, and even then only 77% of them. There appears to be virtually no difference in the column's distribution by failure status. Filling in NaNs with this information would only result in collinearity between the column and the manufacturer column, so it will be dropped from the dataframe.

5 rows × 57 columns

smart_191_raw ¶

This column is not split along manufacturer lines like many others, but still has a large percentage of missing values. A categorical column smart_191_cat will be created with the following categories and values.

The original smart_191_raw column will then be dropped.

smart_184_raw ¶

This column very rarely has any value other than 0 when it is available. However, whenever it is available and not 0, it has a disproportionate ratio of failures to nonfailures, making it a very useful measure for predicting failure. A categorical column smart_184_cat will be created with the following categories and values.

The original smart_184_raw column will then be dropped.

smart_189_raw ¶

This column only has values in a single manufacturer's drives, and even then only 38% of them. There is also little correlation between this column and the failure rate. Filling in NaNs with this information could result in collinearity between the column and the manufacturer column as well, so it will be dropped from the dataframe without a category column.

5 rows × 56 columns

smart_200_raw ¶

This column is not entirely split along manufacturer lines like many others, but still has a large percentage of missing values. Given the reasonably large correlation between a higher value and a higher failure rate, a categorical column smart_200_cat will be created with the following categories and values.

The original smart_200_raw column will then be dropped.

smart_196_raw ¶

This column is not split along manufacturer lines whatsoever, but still has a large percentage of missing values. Given the reasonably large correlation between a higher value and a higher failure rate, a categorical column smart_196_cat will be created with the following categories and values.

The original smart_196_raw column will then be dropped.

smart_8_raw ¶

This column is not strongly split along manufacturer lines, but still has a large percentage of missing values. Given the reasonably large negative correlation between a higher value and failure rate, a categorical column smart_8_cat will be created with the following categories and values.

The original smart_8_raw column will then be dropped.

smart_2_raw ¶

This column is not strongly split along manufacturer lines, but still has a large percentage of missing values. Given the reasonably large negative correlation between a higher value and failure rate, a categorical column smart_2_cat will be created with the following categories and values.

The original smart_2_raw column will then be dropped.

smart_183_raw ¶

This column only has values in a single manufacturer's drives, and even then only 23% of them. There is also little correlation between this column and the failure rate. Filling in NaNs with this information could result in collinearity between the column and the manufacturer column as well, so it will be dropped from the dataframe without a category column.

5 rows × 55 columns

smart_22_raw ¶

This column only has values in a single manufacturer's drives, as it is an indication of helium levels encased in certain HGST drives (Klein, 2015). Given this, it would make no sense to fill this column's NaN values in rows of drives from other manufacturers. Beyond that, the dataset does not have any failures with abnormal levels, making this column potentially a negative impact to the real-world effectiveness of a predictive model. Given this risk, the risk of collinearity with the manufacturer column, and the low correlation of this column and the failure rate, this column will be dropped from the dataframe without a category column for the simplification of the models.

5 rows × 54 columns

smart_223_raw ¶

This column is not strongly split along manufacturer lines, but still has a large percentage of missing values. Given the reasonably large negative correlation between a higher value and failure rate, a categorical column smart_223_cat will be created with the following categories and values.

The original smart_223_raw column will then be dropped.

smart_18_raw ¶

This column is not only missing 97% of its values, it also has no variance whatsoever, making it useless for analysis.

5 rows × 53 columns

smart_224_raw ¶

5 rows × 52 columns

smart_220_raw ¶

This column is entirely split along manufacturer lines and has a large percentage of missing values, but it seems to be one of the few predictors available for Toshiba drives. Given the relatively large negative correlation between a higher value and failure rate, a categorical column smart_220_cat will be created with the following categories and values.

The original smart_220_raw column will then be dropped.

smart_222_raw ¶

Although only available on the Toshiba drives, this is the highest correlation to failure rates yet. A categorical column smart_222_cat will be created with the following categories and values.

The original smart_222_raw column will then be dropped.

smart_226_raw ¶

Although only available on the Toshiba drives, this is the highest negative correlation to failure rates yet. A categorical column smart_226_cat will be created with the following categories and values.

The original smart_226_raw column will then be dropped.

smart_23_raw ¶

This column is not only missing 98% of its values, it also has no variance whatsoever, making it useless for analysis.

5 rows × 51 columns

smart_24_raw ¶

5 rows × 50 columns

smart_11_raw ¶

Although only available on 0.64% of drives, this is the highest correlation to failure rates yet. A categorical column smart_11_cat will be created with the following categories and values.

The original smart_11_raw column will then be dropped.

smart_254_raw ¶

This column is not only missing 99.75% of its values, it also has no variance whatsoever, making it useless for analysis.

5 rows × 49 columns

smart_235_raw ¶

This column represents an interesting report, in that the first 3 bytes of it is the drive's good block count, while the last 2 bytes is the drive's bad block count, but this column is missing 99.92% of its values, making it useless for this type of predictive analysis.

5 rows × 48 columns

smart_233_raw ¶

No failures exist in the drives that have any value for this column, and it is also missing 99.92% of its values, making it useless for analysis.

0 rows × 48 columns

5 rows × 47 columns

smart_232_raw ¶

0 rows × 47 columns

5 rows × 46 columns

smart_168_raw ¶

0 rows × 46 columns

5 rows × 45 columns

smart_170_raw ¶

0 rows × 45 columns

5 rows × 44 columns

smart_218_raw ¶

0 rows × 44 columns

5 rows × 43 columns

smart_174_raw ¶

0 rows × 43 columns

5 rows × 42 columns

smart_16_raw ¶

0 rows × 42 columns

5 rows × 41 columns

smart_17_raw ¶

0 rows × 41 columns

5 rows × 40 columns

smart_173_raw ¶

0 rows × 40 columns

5 rows × 39 columns

smart_231_raw ¶

0 rows × 39 columns

5 rows × 38 columns

smart_177_raw ¶

0 rows × 38 columns

5 rows × 37 columns

Analysis of Potential Predictors ¶

With all NaN values interpolated or their columns removed, correlations can be determined between the columns.

22 rows × 22 columns

A few of the columns' relations will need to be examined well based on these correlation coefficients.

A prominent feature is smart_9_raw as the column with the most extreme correlations with other columns, which is understandable given that SMART attribute 9 represents the total count of hours the drive has been in a power-on state (Acronis, Knowledge Base 9109). Most other issues worth measuring are likely correlated with the drive age and amount of operation. This column may also be a powerful predictor within predictive models as an older drive is more likely to wear down to failure suddenly than a newer drive in general even if other values are not present. Even if other predictors of failure are present in an instance, a drive with an average or lower smart_9_raw value may represent a drive that will fail far sooner than the average length of time to failure.

smart_240_raw also has quite high correlations with other independent variables.

smart_197_raw and smart_198_raw have a nearly perfect degree of collinearity with each other and little in comparison with any other column. smart_198_raw will be dropped as it has a lower correlation with the dependent variable failure.

Finally, smart_190_raw and smart_194_raw have a very high degree of collinearity with each other and little in comparison with any other column. One likely needs removed.

The dataset may be large enough to not need to worry about the multicollinearity affecting the predictive power of the models, but the redundancy of information may skew the results.

For potential predictors for failure, smart_5_raw and smart_197_raw have the highest positive correlations with failure, at 4.4% and 2.7%. SMART attribute 5 is the reallocated sectors count of drives, which triggers when a read, write, or verification error occurs (Acronis, Knowledge Base 9105). SMART attribute 197 is the current pending sector count, which is the count of unstable sectors that are awaiting remapping (Acronis, Knowledge Base 9133). This value decreases as sectors are remapped, but the value would remain consistently high if these sectors are unable to be remapped. Both columns make complete sense as the highest correlation with failure and will likely be the most important predictor variables for HDD failure.

manufacturer ¶

Model ¶.

The model column will ultimately be dropped even after all of the work that went into cleaning its data. The large amount of categories in it substantially adds complexity to the model while not improving enough. The manufacturer column, while less specific, contains all of the same variation with only four categories. Additionally, many of the models do not have a single failure, and even more have only thousands of hard drive days that they represent. Leaving the column in for predictive modeling and analysis will only hurt the overall results, and as such, is removed.

capacity_TB ¶

Smart_191_cat ¶.

This column has the highest p-value out of all of the category columns. While still statistically significant, this is likely from the size of the dataset and not out of pure correlation. smart_191_cat is not likely to be a good predictor variable.

smart_184_cat ¶

With a p-value of 0.0, this is likely the strongest relation to failure in the dataset.

smart_200_cat ¶

Smart_196_cat ¶, smart_8_cat ¶, smart_2_cat ¶, smart_223_cat ¶, smart_220_cat ¶, smart_222_cat ¶, smart_226_cat ¶, smart_11_cat ¶, dataset preparation for model creation ¶.

To begin performing factor analysis, the dataset will need to be prepared through standardization and normalization, as well as the test, train, and validation splits. Doing these before the PCA ensures that no data is contaminated with the influence of the testing and validation data.

Dataset Splitting: Train, Test, and Validation ¶

The first split is 80% Train and 20% Test, stratified on the y_df / failure series.

Verify the stratified splitting.

Note that while the ratio is not exact, it is the closest possible.

The second split is 87.5% Train and 12.5% Validation, stratified on the y_df / failure series, to result in 70% Train and 10% Validation overall.

Continuous Variable Standardization ¶

A scaler fit to the training data is created to standardize the continuous columns for model training. This avoids any contamination of the training data by ensuring that the test and validation datasets do not influence the training data at all.

This fits the scaler to the continuous columns of the training data. The fit scaler will then be used to scale the testing and validation datasets.

A mean as close to zero as possible given the dataset and a standard deviation of 1 is a successful standardization.

PCA and MCA ¶

The eigenvalues and explained inertia were used to create a scree plot, and this plot is then used alongside the cumulative inertia to determine that 13 principal components an appropriate amount of dimensionality reduction to use as these components made up 82.37% of the inertia of the dataset in only 13 out of the 20, or 65%, of the total components.

PCA as a form of dimensionality reduction ensures that as little information, in the form of inertia, is lost as possible for the given number of dimensions reduced. As this dataset is quite large, any amount of dimensionality reduction will greatly affect the speed and chance of proper convergence in the predictive models to come. The result is reducing of the data by 35% while only losing 17.63% of the information, a 2-for-1 trade.

Training set PCA transformation ¶

7682577 rows × 13 columns

5 rows × 25 columns

Test set PCA transformation ¶

2195023 rows × 13 columns

Validation set PCA transformation ¶

1097511 rows × 13 columns

To begin doing MCA, the categorical columns need converted to boolean encoding columns.

While Factor Analysis of Mixed Data (FAMD) would have been ideal for dimensionality reduction, the current hardware requirements and software availability do not allow for it with such a large dataset.

Why SMOTE (Synthetic Minority Oversampling Technique) is Needed ¶

Traditional training fails as hard drive failure is an extremely rare occurence. The model learns to only predict non-failure, making it useless for actually predicting failure. This is why a combination of undersampling the non-failures and oversampling the failures will improve the training and production of the predictive models.

SMOTE ¶

Model creation ¶, logistic regression with smote ¶, lbfgs solver logistic regression ¶, decision tree ¶, random forest ensemble ¶, random forest ensemble with class weights ¶.

In an attempt to train the ensemble in a way that prioritizes the true negative, or actual failure cases, this version of the random forest weights failures as twice as important as non-failures.

Compared to the unweighted Random Forest ensemble, this weighted ensemble gains another 6 true negative classifications for a true negative rate of 40% rather than 36%, but also gains 40,687 false positive classifications, for 0.028%, instead of 0.0097% false positives.

Neural Networks ¶

PyTorch requires the boolean values to be converted to floating point, so these dtypes will be changed before the neural network is defined.

Neural Network 1 ¶

While it may eventually improve with enough training, it's most likely that this architecture of neural network is too simple for the problem at hand. A more complex one will be built next.

Neural Network 2 ¶

It may seem quite odd that the testing loss is consistently lower than the training loss. In this case, it's quite likely the the sheer size of the training set causes this to occur. The model is constantly improving every training batch and the training loss is calculated from the entire epoch. The testing loss is calculated after the entire epoch of batches have all affected the model for the better.

While 50 epochs were originally planned, the test loss consistently decreases even at the 50th epoch. Additionally, when compared to the other models, this neural network has a very high amount of true negative predictions and a moderately low amount of false positive predictions. Additional training may result in a model that outperforms even the LBFGS solved logistic regression model for this task.

Validation Results ¶

Conclusions ¶.

Table 1 HDD Failure Predictive Model Testing Results

The Project Limitations ¶

A few limitations of this project exist. First, a very large amount of the dataset was made up of missing values. A second limitation that deserves caution is that the ratios of drives made by each manufacturer in the dataset is very imbalanced. No assumptions about value or reliability of the four manufacturers included in the dataset should be made from this data. A third limitation is that the dataset was extremely imbalanced in terms of the minority (failure) and majority (non-failure) classes. Though SMOTE succeeded exceptionally well at allowing predictive models to learn from the imbalanced data, it does introduce bias as the synthetically created instances of the minority classes overrepresent their information in the analysis. Finally, working computer memory was a great limitation throughout the project as the dataset is so large. This limitation prevented factor analysis of mixed data from being performed and PCA had to be selected as the alternative.

Actions Proposed and Expected Benefits ¶

It is highly recommended that either the logistic regression model or the more complex DNN model is added to the daily HDD diagnostics checks and backup procedure pipeline. The complex DNN will successfully flag 71.3% of drives expected to fail that day and the logistic regression 64%, allowing for total backup and retirement of the drive before the failure occurs. Do note that while more sensitive to detecting failure, the DNN does have a higher false positive rate, at 6.36%, than the more conservative logistic regression at 2.68%. Until this can be completed, special care should be given to drives with higher values of SMART attributes 5, 197, and 9 to reduce data loss and complications arising from the events of HDD failure.

Once implemented, an ensemble approach between the two should be tested to further reduce the false positive rate. Furthermore, additional research is warranted beyond the scope and limitations of the project. Taking an RNN approach to the data tidying and predictive modeling will almost certainly improve the results quite significantly, as they are specifically designed for time-series data such as this.

References ¶

Acronis. Knowledge Base 9105. S.M.A.R.T. Attribute: Reallocated Sectors Count | Knowledge Base. https://kb.acronis.com/content/9105 .

Acronis. Knowledge Base 9109. S.M.A.R.T. Attribute: Power-On Hours (POH) | Knowledge Base. https://kb.acronis.com/content/9109 .

Acronis. Knowledge Base 9128. S.M.A.R.T. Attribute: Load Cycle Count; Load/Unload Cycle Count | Knowledge Base. https://kb.acronis.com/content/9128 .

Acronis. Knowledge Base 9133. S.M.A.R.T. Attribute: Current Pending Sector Count | Knowledge Base. https://kb.acronis.com/content/9133 .

Acronis. Knowledge Base 9152. S.M.A.R.T. Attribute: Load/Unload Cycle Count | Knowledge Base. https://kb.acronis.com/content/9152 .

Backblaze. (2020). data_Q4_2019. San Mateo, CA; Backblaze. Klein, A. (2015, April 16). SMART Hard Drive Attributes: SMART 22 is a Gas Gas Gas. Backblaze Blog | Cloud Storage & Cloud Backup. https://www.backblaze.com/blog/smart-22-is-a-gas-gas-gas/ .

Painchaud, A. (2018, October 31). 8 Reasons on How Data Loss Can Negatively Impact Your Bussiness. https://www.sherweb.com/blog/security/statistics-on-data-loss/ .

Sanders, J. (2018, November 13). Western Digital spins down HGST and Tegile brands in hard disk market shuffle. TechRepublic. https://www.techrepublic.com/article/western-digital-spins-down-hgst-and-tegile-brands-in-hard-disk-market-shuffle/ .

Weiss, G. M. (2013). Foundations of Imbalanced Learning. Imbalanced Learning, 13–41. https://doi.org/10.1002/9781118646106.ch2

You are using an outdated browser. Please upgrade your browser to improve your experience.

wgu capstone project examples

Health & Nursing

Courses and certificates.

  • Bachelor's Degrees
  • View all Business Bachelor's Degrees
  • Business Management – B.S. Business Administration
  • Healthcare Administration – B.S.
  • Human Resource Management – B.S. Business Administration
  • Information Technology Management – B.S. Business Administration
  • Marketing – B.S. Business Administration
  • Accounting – B.S. Business Administration
  • Finance – B.S.
  • Supply Chain and Operations Management – B.S.
  • Accelerated Information Technology Bachelor's and Master's Degree (from the School of Technology)
  • Health Information Management – B.S. (from the Leavitt School of Health)

Master's Degrees

  • View all Business Master's Degrees
  • Master of Business Administration (MBA)
  • MBA Information Technology Management
  • MBA Healthcare Management
  • Management and Leadership – M.S.
  • Accounting – M.S.
  • Marketing – M.S.
  • Human Resource Management – M.S.
  • Master of Healthcare Administration (from the Leavitt School of Health)
  • Data Analytics – M.S. (from the School of Technology)
  • Information Technology Management – M.S. (from the School of Technology)
  • Education Technology and Instructional Design – M.Ed. (from the School of Education)

Certificates

  • Supply Chain
  • Accounting Fundamentals
  • View all Business Degrees

Bachelor's Preparing For Licensure

  • View all Education Bachelor's Degrees
  • Elementary Education – B.A.
  • Special Education and Elementary Education (Dual Licensure) – B.A.
  • Special Education (Mild-to-Moderate) – B.A.
  • Mathematics Education (Middle Grades) – B.S.
  • Mathematics Education (Secondary)– B.S.
  • Science Education (Middle Grades) – B.S.
  • Science Education (Secondary Chemistry) – B.S.
  • Science Education (Secondary Physics) – B.S.
  • Science Education (Secondary Biological Sciences) – B.S.
  • Science Education (Secondary Earth Science)– B.S.
  • View all Education Degrees

Bachelor of Arts in Education Degrees

  • Educational Studies – B.A.

Master of Science in Education Degrees

  • View all Education Master's Degrees
  • Curriculum and Instruction – M.S.
  • Educational Leadership – M.S.
  • Education Technology and Instructional Design – M.Ed.

Master's Preparing for Licensure

  • Teaching, Elementary Education – M.A.
  • Teaching, English Education (Secondary) – M.A.
  • Teaching, Mathematics Education (Middle Grades) – M.A.
  • Teaching, Mathematics Education (Secondary) – M.A.
  • Teaching, Science Education (Secondary) – M.A.
  • Teaching, Special Education (K-12) – M.A.

Licensure Information

  • State Teaching Licensure Information

Master's Degrees for Teachers

  • Mathematics Education (K-6) – M.A.
  • Mathematics Education (Middle Grade) – M.A.
  • Mathematics Education (Secondary) – M.A.
  • English Language Learning (PreK-12) – M.A.
  • Endorsement Preparation Program, English Language Learning (PreK-12)
  • Science Education (Middle Grades) – M.A.
  • Science Education (Secondary Chemistry) – M.A.
  • Science Education (Secondary Physics) – M.A.
  • Science Education (Secondary Biological Sciences) – M.A.
  • Science Education (Secondary Earth Science)– M.A.
  • View all Technology Bachelor's Degrees
  • Cloud Computing – B.S.
  • Computer Science – B.S.
  • Cybersecurity and Information Assurance – B.S.
  • Data Analytics – B.S.
  • Information Technology – B.S.
  • Network Engineering and Security – B.S.
  • Software Engineering – B.S.
  • Accelerated Information Technology Bachelor's and Master's Degree
  • Information Technology Management – B.S. Business Administration (from the School of Business)
  • View all Technology Master's Degrees
  • Cybersecurity and Information Assurance – M.S.
  • Data Analytics – M.S.
  • Information Technology Management – M.S.
  • MBA Information Technology Management (from the School of Business)
  • Full Stack Engineering
  • Web Application Deployment and Support
  • Front End Web Development
  • Back End Web Development

3rd Party Certifications

  • IT Certifications Included in WGU Degrees
  • View all Technology Degrees
  • View all Health & Nursing Bachelor's Degrees
  • Nursing (RN-to-BSN online) – B.S.
  • Nursing (Prelicensure) – B.S. (Available in select states)
  • Health Information Management – B.S.
  • Health and Human Services – B.S.
  • Psychology – B.S.
  • Health Science – B.S.
  • Healthcare Administration – B.S. (from the School of Business)
  • View all Nursing Post-Master's Certificates
  • Nursing Education—Post-Master's Certificate
  • Nursing Leadership and Management—Post-Master's Certificate
  • Family Nurse Practitioner—Post-Master's Certificate
  • Psychiatric Mental Health Nurse Practitioner —Post-Master's Certificate
  • View all Health & Nursing Degrees
  • View all Nursing & Health Master's Degrees
  • Nursing – Education (BSN-to-MSN Program) – M.S.
  • Nursing – Leadership and Management (BSN-to-MSN Program) – M.S.
  • Nursing – Nursing Informatics (BSN-to-MSN Program) – M.S.
  • Nursing – Family Nurse Practitioner (BSN-to-MSN Program) – M.S. (Available in select states)
  • Nursing – Psychiatric Mental Health Nurse Practitioner (BSN-to-MSN Program) – M.S. (Available in select states)
  • Nursing – Education (RN-to-MSN Program) – M.S.
  • Nursing – Leadership and Management (RN-to-MSN Program) – M.S.
  • Nursing – Nursing Informatics (RN-to-MSN Program) – M.S.
  • Master of Healthcare Administration
  • MBA Healthcare Management (from the School of Business)
  • Business Leadership (with the School of Business)
  • Supply Chain (with the School of Business)
  • Accounting Fundamentals (with the School of Business)
  • Back End Web Development (with the School of Technology)
  • Front End Web Development (with the School of Technology)
  • Web Application Deployment and Support (with the School of Technology)
  • Full Stack Engineering (with the School of Technology)
  • Single Courses
  • Course Bundles

Apply for Admission

Admission requirements.

  • New Students
  • WGU Returning Graduates
  • WGU Readmission
  • Enrollment Checklist
  • Accessibility
  • Accommodation Request
  • School of Education Admission Requirements
  • School of Business Admission Requirements
  • School of Technology Admission Requirements
  • Leavitt School of Health Admission Requirements

Additional Requirements

  • Computer Requirements
  • No Standardized Testing
  • Clinical and Student Teaching Information

Transferring

  • FAQs about Transferring
  • Transfer to WGU
  • Transferrable Certifications
  • Request WGU Transcripts
  • International Transfer Credit
  • Tuition and Fees
  • Financial Aid
  • Scholarships

Other Ways to Pay for School

  • Tuition—School of Business
  • Tuition—School of Education
  • Tuition—School of Technology
  • Tuition—Leavitt School of Health
  • Your Financial Obligations
  • Tuition Comparison
  • Applying for Financial Aid
  • State Grants
  • Consumer Information Guide
  • Responsible Borrowing Initiative
  • Higher Education Relief Fund

FAFSA Support

  • Net Price Calculator
  • FAFSA Simplification
  • See All Scholarships
  • Military Scholarships
  • State Scholarships
  • Scholarship FAQs

Payment Options

  • Payment Plans
  • Corporate Reimbursement
  • Current Student Hardship Assistance
  • Military Tuition Assistance

WGU Experience

  • How You'll Learn
  • Scheduling/Assessments
  • Accreditation
  • Student Support/Faculty
  • Military Students
  • Part-Time Options
  • Virtual Military Education Resource Center
  • Student Outcomes
  • Return on Investment
  • Students and Gradutes
  • Career Growth
  • Student Resources
  • Communities
  • Testimonials
  • Career Guides
  • Skills Guides
  • Online Degrees
  • All Degrees
  • Explore Your Options

Admissions & Transfers

  • Admissions Overview

Tuition & Financial Aid

  • Student Success
  • Prospective Students
  • Current Students
  • Military and Veterans
  • Commencement
  • Careers at WGU
  • Advancement & Giving
  • Partnering with WGU

What Do You Know About The WGU Excellence Awards

  • Online University Experience
  • See More Tags

wgu capstone project examples

Excellence Award – You can achieve this award throughout your degree journey by submitting outstanding work.  If an evaluator identifies the work as exemplary, he or she may nominate an exceptional performance task submission that is passed on the first attempt with the highest available score on each aspect of the assessment.  The nomination will be shared through email, along with an e-certificate that you can print, share, or request a printed copy mailed to you.

Capstone Excellence Award – You can only earn this award for the capstone, which is the culminating project at the end of your degree journey. Besides Evaluator nomination, CEAs are reviewed and approved by Evaluation management.  If chosen, you will be mailed a certificate and this recognition will be noted on your transcript.  You may also be invited to have your capstone published in the Capstone Excellence Archive.

Capstone Simulation Excellence Award –If you are in WGU’s MBA program and your team scores 90th percentile or higher on the national business simulation, you will be honored with this achievement, once related performance tasks are completed.  Through email, you will be congratulated and receive an e-certificate that you can print, share, or request a printed copy mailed to you.

Students who receive these awards often feel more connected with WGU’s Evaluation faculty because their nomination comments show that they have taken a thorough look at their work and appreciated their efforts.  Did you know that you can also respond back to the congratulatory email you receive?  When you do so, your appreciation will be shared with the Evaluator who made the nomination and your student and/or course mentor if you name them. 

Here are some recent responses students gave upon receiving an Excellence award.

"I can't thank you enough for recognizing my efforts as it makes me even more determined to push thru all the hard work and earn my degree that I've wanted for so many years. Thank you for the boost of confidence!

Thank you so much for sharing this with me, and for choosing my task! This has definitely made my week if not my month.

Thank you so much for informing me of this award. This was a very testing term for me. When I submitted this paper I fully expected to get it back for revisions. When I saw the passing grade, I cried. And today as I read this email I cried again. Recognition of hard work does not flow my direction very often. It is very gratifying to receive positive feedback from a project like this one. Please thank the evaluator for me as well. Their comments were very kind and gracious.

Thank you so much for the kind words. That paper was a tough one but very enjoyable to strategize and write. I am immensely enjoying my WGU experience and it is an honor to receive the certificate.

Thank you for the recognition and honor. I greatly appreciate the evaluator that had taken the time to read my work. Please extend my warmest gratitude. You have made me smile so big today and had made my week!"

Upon receipt of the Capstone Excellence Award, students (now graduates) said:

"Thank you so much for this honor, and please extend my thanks to the evaluator who submitted my capstone. Finishing the capstone really did feel like the culmination of all my work at WGU thus far. The subject matter covered in the capstone covered numerous courses, stretching back to all my semesters at the school. With graduation in sight, I was able to apply for a promotion at my job. During the interview I got to explain the WGU concept to the panel interviewers, and they were quite impressed with the program's structure, as well as the knowledge I was able to demonstrate I had obtained. I got the promotion, contingent on finishing the degree (which I just did tonight!), and I will start my new position next week, with increased responsibilities and salary, all thanks to the degree earned at WGU. Thanks again for this honor, I have submitted the waiver/release and am flattered that my project may assist other students in the future."

"Just prior to graduation, I applied for a new job which was offered and accepted and I am now two weeks into my new role as a Senior Clinical Analyst developing nursing documentation screens and maintaining appropriate access for hospital staff to complete their roles. Although my previous position was when the facility required that I obtain a bachelor's degree, the education I acquired during the BSHI degree program has greatly contributed to me receiving more inquiries from others within my division of the HCA corporation.  For the first time in years, I feel that I now have something more to offer to future career opportunities.  Thank you WGU for all that you do!"

"I appreciate your email and the award that has been given. I would love to express the effect of my Capstone and my experience at WGU on my career and my professional dreams. I have been teaching for 13 years and although my students have done well overall and students have enjoyed my program, there has always been a small group of students who have "fallen through the cracks." They didn't require a lot of attention, and they weren't failing miserably, they simply didn't engage with the others and achieve at high levels. In beginning my masters' program I wanted to advance my career through education providing options for the future and learn how to better plan and design curriculum in order to ensure that all of my students were being "seen" by me as a teacher and provided with the best possible chance to succeed. The classes, tasks, and especially my capstone project provided the tools and the introspection and experimentation based on research to make this goal possible. Not only did I finish the program with a degree that advances my current career as a teacher and provides opportunities at a higher level if I would like to pursue that, but I was also given the tools necessary to begin a change that will allow me to motivate students who are lacking in natural motivation. I have been extremely impressed with WGU as a University and can't stop talking about it with everyone I know. I have felt completely supported with mentors, emails, materials, feedback, etc. I could not have been happier with the support that was available and given, nor with the classes and the requirements. I am so happy to have made this decision, and even happier to have gone through a process that has made me a better person!"

In addition, here is how a couple of our faculty members felt when receiving a student response back:

Evaluator: Thank you so much for this kind email.  It is such an amazing gift to be able to read this beautiful note from a student.  I am so grateful that you shared this with me.  I will continue to send excellence awards when appropriate and this note will definitely serve as an inspiration for those few extra moments it takes to make a difference in our important students' lives. 

Mentor: Thank you all as well, I want you to note that [Student Name] was accepted to WGU with no transfer credits and no previous college experience. But, he is my most hard working, diligent, accelerating student at the moment. And I am very proud of him! Thanks so much for appreciating his work.

Manager: I am so pleased our students appreciate that their excellent performances are recognized.  Their responses, like this and others, make my day.  I know that all evaluators and our managers spend many hours reviewing each submission and utilize our expertise to examine and assist the students towards educational competency.

If you want to earn these Excellence Awards, please speak with your mentors about ways that you may meet the quality standards associated with each Excellence Award type.  There is no application or self-selection process; these awards simply offer an exciting boost along your degree journey or a pleasing accomplishment for the capstone at the end.  Do your best in fully demonstrating the competency required by each task — even excel if you can within the time that you have committed, and they will come.  Remember that maintaining On Time Progress (OTP) to graduation is essential to attaining the most important recognition: your WGU diploma, the ultimate seal of your degree competence!

Ready to Start Your Journey?

HEALTH & NURSING

Recommended Articles

Take a look at other articles from WGU. Our articles feature information on a wide variety of subjects, written with the help of subject matter experts and researchers who are well-versed in their industries. This allows us to provide articles with interesting, relevant, and accurate information. 

{{item.date}}

{{item.preTitleTag}}

{{item.title}}

The university, for students.

  • Student Portal
  • Alumni Services

Most Visited Links

  • Business Programs
  • Student Experience
  • Diversity, Equity, and Inclusion
  • Student Communities

IMAGES

  1. WGU CAPSTONE PROJECT EXAMPLES

    wgu capstone project examples

  2. Wgu capstone mba

    wgu capstone project examples

  3. Effective Capstone Project Examples for You to Follow

    wgu capstone project examples

  4. Capstone Template

    wgu capstone project examples

  5. PASSED MY CAPSTONE! IT'S DONE! : r/WGU

    wgu capstone project examples

  6. WGU Capstone Project Writing Guide, Tips and Format

    wgu capstone project examples

VIDEO

  1. Tugas Uas POS (Point of Sale) Delphi

  2. خاوند کے بنیادی حقوق کیا ہیں؟

  3. UPCOMING AFFORDABLE PROJECT IN GURGAON || NEW AFFORDABLE

  4. FREE TEMPLATE SPARK AR INSTAGRAM FILTER MYRTOS

  5. Memory Grabber

  6. Girder Arch joining Bandra worli sea link and coastal road #trending #mumbai

COMMENTS

  1. New Way To Access Capstone Work Of Previous Grads

    Oct 20, 2015. To assist you with the culminating educational experience of completing a capstone for your degree, WGU has provided the Comprehensive Capstone Archive site. This Archive gives you access to the capstone work of other students in order to get an idea of how to proceed with your own individual capstone.

  2. For those who wanted to look at capstone examples. WGU has an ...

    115K subscribers in the WGU community. Place for Western Governors University students, faculty and alumni. ... For those who wanted to look at capstone examples. WGU has an archive of some of them. capstonearchives.wgu.edu Open. ... This helped me a lot when I did my capstone in regards to what exactly the people who receive your capstone are ...

  3. C769 IT Capstone (COMPLETED!!

    If you can't find them in Taskstream or elsewhere, don't hesitate to request them from the Course Mentors: For Task 1: Capstone Topic Approval form (IT_Capstone_Topic_Approval_Form.docx) (You should be able to find this in Taskstream.) C769 Pacing Guide (C769-Pacing Guide.pdf) (Ask the mentors for this!)

  4. Capstone guide

    The 5 Parts of the WGU Capstone Overview of the Capstone Process The Capstone Process. To complete the capstone project, candidates will follow a series of steps as presented below. The steps outlined are intended as a "big picture" overview of the journey that is the capstone project.

  5. C964 Resources

    WGU Capstone Excellence Archives#. The Capstone Excellence Archives include a wide range of completed projects. Reviewing them may help get ideas, provide inspiration, and understand the requirements. However, keep in mind that they all are above and beyond the requirements. Therefore, don't use these as examples of what's needed to meet the requirements.

  6. WGU B.Sc (IT) Degree

    In today's video, I tell you all about my experience studying for the IT Capstone Written Project (C769) class. I provide an overview of the course materia...

  7. Graduates Receive Recognition for Capstone Excellence

    Thirteen graduates' projects were chosen for exemplifying one or more dimensions of excellence: conquering challenges, expressing ingenuity, exhibiting mastery, and/or providing synergy. Projects are nominated for Capstone Excellence by their capstone evaluator and selected by the capstone facilitator in their program or college area.

  8. C769 Resources

    ugcapstoneit @ wgu. edu. Examples# These examples are on-par with average passing tasks. Use them as a guideline for what evaluators accept in fulfillment of the requirements. ... Though they often need reframing to fit the capstone requirements, past or present work projects make great capstone topics. The capstone project can be very similar ...

  9. QFT1 Business

    Time to Complete: 5 days. I actually had fun with this one. For my hypothetical company, I decided to create an IT consulting company. I already have a consulting LLC on the side so it was a logical conclusion. For the writing, I basically just wrote one section each day. It was a ton of writing but overall not too bad once I got into it.

  10. Repository/Database of Example Capstone Projects : r/WGU

    Repository/Database of Example Capstone Projects. When I first started at WGU, I seem to remember a repository of capstone projects. They were deemed to be the example of superior work by the faculty and made available for other students to get an idea of how to best craft their project. Now that I am getting ready to start my capstone, I ...

  11. QGT1

    QGT1 Business Management Capstone Business Plan-Passing Paper, No Revisions Needed! Coursework 91% (69) 14. Monica Cummings Task 2. Assignments 100% (8) 33. The Granite Guys Business Plan. Coursework 100% (50) 22.

  12. WGU

    Final Capstone project. Assignments 100% (5) 36. Dylan Adkins WGU Curriculum and Instruction Capstone Task 5. Coursework 95% (22) 7. Capstone Task 4 - Task 4. Coursework 80% (15) 13. C561 capstone task 1. Coursework 100% (18) Coursework. Date Rating. year. Ratings. C561 capstone task 1. 13 pages 2020/2021 100% (18)

  13. Graduates Receive Capstone Excellence Award

    The capstone is the culminating degree project where WGU students integrate all of their program competencies. It represents our candidates' best work, and most of their projects are archived with student permission, thereby available to other students, graduates, mentors, and accreditors to view.

  14. NURSING D226 :

    Change Proposal Template.docx. 1 Comprehensive Healthcare Change Proposal Student Name College of Health Professions, Western Governors University D226: BSNU Capstone Instructor Date f2 A1. Innovative Change Propose one innovative, value-based healthcare change that will affect patient. NURSING D226. Western Governors University. 440 views.

  15. New Capstone Archive and Excellence Awards Changes

    By focusing on the new Model and Excellence Archives, WGU will no longer routinely archive the work of its graduate students in the Comprehensive Capstone Archive. If you have already given permission by October 2015, your work will be added to the Comprehensive Capstone Archive; be advised that the site will be retired by the end of 2015.

  16. C868 Software Dev Capstone Ideas : r/WGU

    C868 Software Dev Capstone Ideas. Software Development Capstone. I'm starting on the Capstone project and want to reuse my Software II project. I was thinking that I could just add more search functionality, more user-friendly fields, and update the reporting functionality. Is that enough?

  17. WGU Capstone by Amanda Chenoweth on Prezi

    MSML Capstone Amanda Chenoweth VST2-C210 Western Governors University Core Competencies Core Competencies 3002.1.1: Practice of Management 3002.1.1: Practice of Management Identify patterns of organizational and employee behavior Apply theoretical actions to resolve or direct the

  18. WGU MSDA Capstone Project

    Western Govenors University MSDA Capstone ... The 5-page project Executive Summary can be found here. ... As an example, the subsection of the dataset for 2019-10-01 contains 115,259 rows of data. However, as this data contains recorded readings from a live data center, the number of hard drives and thus rows, changed daily as failed drives ...

  19. C769 IT Capstone Written Project (I Passed) : r/WGU

    C769 IT Capstone Written Project (I Passed) Information Technology. Hello Night Owls, I did the thing! I completed the capstone and completed my BS in IT degree! Pro Tip: Use the free version of grammarly for each section and correct all the obvious issues, then use the premium suggestions to correct the underlined sections to get the 100% ...

  20. What Do You Know About The WGU Excellence Awards

    Capstone Excellence Award - You can only earn this award for the capstone, which is the culminating project at the end of your degree journey. Besides Evaluator nomination, CEAs are reviewed and approved by Evaluation management. If chosen, you will be mailed a certificate and this recognition will be noted on your transcript.

  21. d181

    D181 Task 4 WGU Capstone Conclusions and Results New. 27 pages 2024/2025 None. 2024/2025 None. Save. ORM4 TASK 5 Presenting THE Capstone (Paper) ... WGU Capstone Research Template; TC Student IRB Application for Capstone Project; TC Stakeholder Information; Capstone Proposal revisednj; Proposal alignment to D180; Data Informed Practices- Task 2 ...

  22. CYBERSECURITY GRADUATE CAPSTONE

    CYBERSECURITY GRADUATE CAPSTONE — C796. So I had just submitted my Capstone proposal to the instructor for the approval and we had briefly discussed task 2 requirements. Task 2 lists sections a, b and c which appears not to be directly corresponding to the Rubric. The entire structure is so confusing it makes it difficult to even start ...

  23. Task 1

    WGU Capstone Proposal Template: Master of Science, Curriculum and Instruction Art and SEL in the Classroom Professional Development Emilee Adams December 12, 2022 Art and SEL in the Classroom Professional Development Research Type I will be conducting an action research study on the impact that a professional development program has on teachers ...