BIOL7200 Programming For Bioinformatics: Fasta Files

$20 Bonus + 25% OFF

Securing Higher Grades Costing Your Pocket?
Book Your Assignment at The Lowest Price
Now!

Students Who Viewed This Also Studied

BIOL7200 Programming For Bioinformatics

Task:

Assume that the user gives you correct inputs all the time. Your script will be graded on the output produced and not on how the errors are handled.

For this assignment, the module list is “multiprocessing”, “sys”, “re”, “os”, “subprocess”, and “argparse”.  Do not use input() for any input.

Instructions for submission

•Name your script as _parallel_ani.py

•Your code should run as ./ _parallel_ani.py -o [-t ] fasta_file1 fasta_file2 fasta_file3…

•Do not hardcode any file name!

•Use #!/usr/bin/env python3 as your shebang

•Your script should finish within 5min for 10 input genomes and 5 threads, partial credit will be awarded up to 10 min of run time.

Another common task in bioinformatics – running the same task for different inputs.  The use case here will be computing average nucleotide identity (ANI) for each pair (all-against-all pairs) of input fasta file.  The fasta files are microbial genome sequences.  The ANI is being calculated using MUMmer’s dnadiff.  Install the MUMmer package yourself from here: https://mummer4.github.io/.  Assume that dnadiff is in our environment PATH.

Your objective is simple – you are given a set of files, say A.fasta, B.fasta and C.fasta.  You must calculate the pairwise distance between them and print them in a matrix format as follows (Each column is tab-separated):

A.fasta

B.fasta

C.fasta

A.fasta

100

B.fasta

100

C.fasta

100

The threads argument is key.  The threads specify how many parallel instances of pairwise ANI calculations should be performed.  So, if the user says -t 3, launch 3 ANI computations (obviously for different pairs) simultaneously.  Your program should finish ~3 times faster for -t 3 when compared to a single thread (-t 1).

Helpful notes:

1.You will have to run dnadiff like this:

dnadiff -p file1.fasta file2.fasta

E.g., dnadiff -p output123 genome1.fasta genome2.fasta

This will create a bunch of files starting with output123

2.The file with extension .report will have the ANI (in the 19th line – both numeric columns will have the same values).

BIOL7200 Programming For Bioinformatics

Answer in Detail


Solved by qualified expert

Get Access to This Answer

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Hac habitasse platea dictumst vestibulum rhoncus est pellentesque. Amet dictum sit amet justo donec enim diam vulputate ut. Neque convallis a cras semper auctor neque vitae. Elit at imperdiet dui accumsan. Nisl condimentum id venenatis a condimentum vitae sapien pellentesque. Imperdiet massa tincidunt nunc pulvinar sapien et ligula. Malesuada fames ac turpis egestas maecenas pharetra convallis posuere. Et ultrices neque ornare aenean euismod. Suscipit tellus mauris a diam maecenas sed enim. Potenti nullam ac tortor vitae purus faucibus ornare. Morbi tristique senectus et netus et malesuada. Morbi tristique senectus et netus et malesuada. Tellus pellentesque eu tincidunt tortor aliquam. Sit amet purus gravida quis blandit. Nec feugiat in fermentum posuere urna. Vel orci porta non pulvinar neque laoreet suspendisse interdum. Ultricies tristique nulla aliquet enim tortor at auctor urna. Orci sagittis eu volutpat odio facilisis mauris sit amet.

Tellus molestie nunc non blandit massa enim nec dui. Tellus molestie nunc non blandit massa enim nec dui. Ac tortor vitae purus faucibus ornare suspendisse sed nisi. Pharetra et ultrices neque ornare aenean euismod. Pretium viverra suspendisse potenti nullam ac tortor vitae. Morbi quis commodo odio aenean sed. At consectetur lorem donec massa sapien faucibus et. Nisi quis eleifend quam adipiscing vitae proin sagittis nisl rhoncus. Duis at tellus at urna condimentum mattis pellentesque. Vivamus at augue eget arcu dictum varius duis at. Justo donec enim diam vulputate ut. Blandit libero volutpat sed cras ornare arcu. Ac felis donec et odio pellentesque diam volutpat commodo. Convallis a cras semper auctor neque. Tempus iaculis urna id volutpat lacus. Tortor consequat id porta nibh.

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Hac habitasse platea dictumst vestibulum rhoncus est pellentesque. Amet dictum sit amet justo donec enim diam vulputate ut. Neque convallis a cras semper auctor neque vitae. Elit at imperdiet dui accumsan. Nisl condimentum id venenatis a condimentum vitae sapien pellentesque. Imperdiet massa tincidunt nunc pulvinar sapien et ligula. Malesuada fames ac turpis egestas maecenas pharetra convallis posuere. Et ultrices neque ornare aenean euismod. Suscipit tellus mauris a diam maecenas sed enim. Potenti nullam ac tortor vitae purus faucibus ornare. Morbi tristique senectus et netus et malesuada. Morbi tristique senectus et netus et malesuada. Tellus pellentesque eu tincidunt tortor aliquam. Sit amet purus gravida quis blandit. Nec feugiat in fermentum posuere urna. Vel orci porta non pulvinar neque laoreet suspendisse interdum. Ultricies tristique nulla aliquet enim tortor at auctor urna. Orci sagittis eu volutpat odio facilisis mauris sit amet.

Tellus molestie nunc non blandit massa enim nec dui. Tellus molestie nunc non blandit massa enim nec dui. Ac tortor vitae purus faucibus ornare suspendisse sed nisi. Pharetra et ultrices neque ornare aenean euismod. Pretium viverra suspendisse potenti nullam ac tortor vitae. Morbi quis commodo odio aenean sed. At consectetur lorem donec massa sapien faucibus et. Nisi quis eleifend quam adipiscing vitae proin sagittis nisl rhoncus. Duis at tellus at urna condimentum mattis pellentesque. Vivamus at augue eget arcu dictum varius duis at. Justo donec enim diam vulputate ut. Blandit libero volutpat sed cras ornare arcu. Ac felis donec et odio pellentesque diam volutpat commodo. Convallis a cras semper auctor neque. Tempus iaculis urna id volutpat lacus. Tortor consequat id porta nibh.

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Hac habitasse platea dictumst vestibulum rhoncus est pellentesque. Amet dictum sit amet justo donec enim diam vulputate ut. Neque convallis a cras semper auctor neque vitae. Elit at imperdiet dui accumsan. Nisl condimentum id venenatis a condimentum vitae sapien pellentesque. Imperdiet massa tincidunt nunc pulvinar sapien et ligula. Malesuada fames ac turpis egestas maecenas pharetra convallis posuere. Et ultrices neque ornare aenean euismod. Suscipit tellus mauris a diam maecenas sed enim. Potenti nullam ac tortor vitae purus faucibus ornare. Morbi tristique senectus et netus et malesuada. Morbi tristique senectus et netus et malesuada. Tellus pellentesque eu tincidunt tortor aliquam. Sit amet purus gravida quis blandit. Nec feugiat in fermentum posuere urna. Vel orci porta non pulvinar neque laoreet suspendisse interdum. Ultricies tristique nulla aliquet enim tortor at auctor urna. Orci sagittis eu volutpat odio facilisis mauris sit amet.

Tellus molestie nunc non blandit massa enim nec dui. Tellus molestie nunc non blandit massa enim nec dui. Ac tortor vitae purus faucibus ornare suspendisse sed nisi. Pharetra et ultrices neque ornare aenean euismod. Pretium viverra suspendisse potenti nullam ac tortor vitae. Morbi quis commodo odio aenean sed. At consectetur lorem donec massa sapien faucibus et. Nisi quis eleifend quam adipiscing vitae proin sagittis nisl rhoncus. Duis at tellus at urna condimentum mattis pellentesque. Vivamus at augue eget arcu dictum varius duis at. Justo donec enim diam vulputate ut. Blandit libero volutpat sed cras ornare arcu. Ac felis donec et odio pellentesque diam volutpat commodo. Convallis a cras semper auctor neque. Tempus iaculis urna id volutpat lacus. Tortor consequat id porta nibh.

15 More Pages to Come in This Document. Get access to the complete answer.

More BIOL7200 BIOL7200 Programming For Bioinformatics: Questions & Answers

Management

This assessment item relates to the course learning outcomes as stated in the Course Profile.Assignment Specification For this assignment you are required to develop a Graphical User Interface (GUI) java program to demonstrate you can use Java constructs including input/output via a GUI, Java primit …

View Answer

CN6121 Artificial Intelligence

1. Build an Artificial Neural Network in Matlab to classify breast cancer data from the Wisconsin database. Use 4-fold cross validation for training, testing and validating the neural network.
2. Write a 2,500-word research report which should include the following sections:
a) Review of the relev …

View Answer

MOD003264 Digital Security

Question:
Whilst your target audience has some level of IT knowledge, they have employed you as the subject expert. Answers to each question should be provided at a level of technical detail sufficient to that target audience in the given scenario. You should apply the knowledge gained from the&n …

View Answer

Information Technology

This assignment is a case study report and you are required to working in a python.You must develop your own unique and original dataset – copying a dataset from the internet will result in reduced or zero marks. …

View Answer

Content Removal Request

If you are the original writer of this content and no longer wish to have your work published on Myassignmenthelp.com then please raise the
content removal request.

Choose Our Best Expert to Help You

Jane Sima

Ph.D in Psychology with Specialization in Industrial-Organizational Psychology

1592 – Completed Orders

Hire Me

Haidil Jiang

PhD in Law with Specialization in Civil Law and Aviation Law

1047 – Completed Orders

Hire Me

Still in Two Minds? The Proof is in Numbers!

33845 Genuine Reviews With a Rating of 4.9/5.

Management

Essay: 14 Pages, Deadline:
13 days

Its my 3rd time using myassignmenthelp. Fast in amendments and able to amend until you are satisfies with the work.

User ID: 8***97 Singapore

Project Management

Assignment: 1 Page, Deadline:
2 days

Thank you. You are just amazing. I can totally rely on you and never get late for any assignment and always get good marks with your work. on top of t …

User ID: 3***08 Australia

Management

Assignment: 0.4 Pages, Deadline:
4 days

the work was very good. I appreciate the effort of the writter. the price was also resonable

User ID: 8***85 United Kingdom, Great Britain

Management

Programming: 0 Pages, Deadline:
10 days

Good work team. At the end the assignment got satisfactory. Keep up with the work.

User ID: 8***99 Australia

Statistics

Programming: 1.4 Pages, Deadline:
1 day

I received an A on this assignment. It was well written, designed well, and informative.

User ID: 8***55 United States

Assignment

Essay: 4.8 Pages, Deadline:
21 days

Thanks for help I will choose with pleasure to do all my essay with Good and professional

User ID: 8***70 United Kingdom, Great Britain

Marketing

Assignment: 12 Pages, Deadline:
21 days

Executed right on time for me to hand in my work. Nonetheless i hope to get more help in the future. Thank you

User ID: 2***79 London, Great Britain

Pharmacy

Assignment: 6 Pages, Deadline:
18 days

good stuff, would use again in future. nice tutors and very helpful thank you to the team that helped me

User ID: 4***37 Sydney, Australia

Healthcare

Assignment: 3 Pages, Deadline:
13 days

Thanks for help I received a distinction for this assignment.Powerpoint was visually great and the transcript complemented it.

User ID: 4***42 Adelaide, Australia

Healthcare

Essay: 2 Pages, Deadline:
29 days

I received an A on this paper and infographic. It was well written and designed well.

User ID: 5***82 United States

Management

Essay: 14 Pages, Deadline:
13 days

Its my 3rd time using myassignmenthelp. Fast in amendments and able to amend until you are satisfies with the work.

User ID: 8***97 Singapore

Project Management

Assignment: 1 Page, Deadline:
2 days

Thank you. You are just amazing. I can totally rely on you and never get late for any assignment and always get good marks with your work. on top of t …

User ID: 3***08 Australia

Management

Assignment: 0.4 Pages, Deadline:
4 days

the work was very good. I appreciate the effort of the writter. the price was also resonable

User ID: 8***85 United Kingdom, Great Britain

Management

Programming: 0 Pages, Deadline:
10 days

Good work team. At the end the assignment got satisfactory. Keep up with the work.

User ID: 8***99 Australia

Statistics

Programming: 1.4 Pages, Deadline:
1 day

I received an A on this assignment. It was well written, designed well, and informative.

User ID: 8***55 United States

Assignment

Essay: 4.8 Pages, Deadline:
21 days

Thanks for help I will choose with pleasure to do all my essay with Good and professional

User ID: 8***70 United Kingdom, Great Britain

Marketing

Assignment: 12 Pages, Deadline:
21 days

Executed right on time for me to hand in my work. Nonetheless i hope to get more help in the future. Thank you

User ID: 2***79 London, Great Britain

Pharmacy

Assignment: 6 Pages, Deadline:
18 days

good stuff, would use again in future. nice tutors and very helpful thank you to the team that helped me

User ID: 4***37 Sydney, Australia

Healthcare

Assignment: 3 Pages, Deadline:
13 days

Thanks for help I received a distinction for this assignment.Powerpoint was visually great and the transcript complemented it.

User ID: 4***42 Adelaide, Australia

Healthcare

Essay: 2 Pages, Deadline:
29 days

I received an A on this paper and infographic. It was well written and designed well.

User ID: 5***82 United States

Have any Query?
Reference no: EM132069492

WhatsApp
Hello! Need help with your assignments? We are here

GRAB 25% OFF YOUR ORDERS TODAY

X