Group Project

After you have your group set, go to this link to accept the group project. Your team name should be your group name (Group 1 to Group 10). Note that this is an empty repository. You will fill it in. One of the group members will create a team and other members of the same group will join that team.

Key dates

Component	Due Date
First part - report submission	15/11/2023 23:59
First part - presentations	17/11/2023 During Class Hours
Final report submission	30/12/2023 23:59
Final presentation	3-5/1/2024 During Class/Lab Hours

What to do?

First Part

You should prepare a list for potential mutations that might have caused the disease.
You can investigate the pipelines for “Whole Exome Sequencing”, eg. GATK pipeline
You will use the troya server. You can connect the server through ssh. The server DNS address and password information will be provided by your instructor.
Your user name will be your group name. The data us shared with you at /disk2/ens210/data. Please do not duplicate this data.
You can use conda to install the tools that you need.
If you need to install a software that is not available through conda, you should ask your TAs to install it for you.

Second Part

You will choose one of the most critical missense mutations from Patient X that might have made him/her sick.
After you get an approval from your advisor, you will perform an evolutionary analysis on the protein that has the mutation of your interest.
This evolutionary research will include the following tasks:
- Homology search
- Phylogenetic tree construction
- Identification of paralogs and decide whether they have the “same” function with your protein of interest. If not, exclude them from MSA.
- Perform conservation analysis on the refined MSA.
- Write a report

Week 13

Complete the project.
Interpret the results as a group.
Write a report (see due date above).
Push your report.
Push your codes and plots.

Week 14

Make the final presentation.
Revise the report if asked by the instructor.
Push everything to the repository.

Report

Report should be submitted markdown format. The report should include the following sections.

Title
Introduction (½ page)
Results (2 pages)
Figures (min 4) with legends
Discussion (½ page)
Materials and Methods (½-1 page)
References

Consider one page as 500 words.

A markdown-formatted template is given below:

# Project Title

## Authors

## Introduction

* Introduce the data, the gene (protein), the disease

## Results

Figures should be embeded in this section.

## Discussion

* What are the limitations of this study? 
* What else would you do to confirm your mutation?

## Materials and Methods

* Explain your methids in detail, eg. what tools and parameters you used.

## References

First Presentation (17/11/2023)

5 min presentation
One group member can present.

Final Presentation (3-5/1/2024)

10 min presentation + 2 min QA ~ 12 min total.
It is recommended that each group member presents a part of the study.
The attendance will be taken and it will contribute to the final score on both days.

Method code and documentation on GitHub

All the code and steps should be given in an appropriate file such as python, bash. The code and steps should be documented in a markdown file such as method.md. The aim is to provide reproducible environment.

Grading

The first part will have 50% weight on your total score.
Report (20 points) Story flow, English, appropriate referencing
Depth of work (20 points)
GitHub documentation (20 points)
Presentation (20 points)
Individual contribution (20 points)
- Work load will also be considered here. Group work load should be equally distributed.
- Participation by asking questions to other groups will contribute to your score.

Notes on grading

Work load for each group member should be balanced. The work balance will be checked through the contributions of each member at GitHub repo. Each member should push their own work to the repository. The work load balance will be evaluated.
The group project will affect 10% of your total class score.
If you get 0 from any of the sections (see grading), the group will get 0 for the entire project.
Plagiarism will not be tolerated: the entire group will fail the project.