Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_52173 |
Symbol | MSH5 |
ID | 7202038 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 547987 |
End bp | 549271 |
Gene Length | 1285 bp |
Protein Length | 376 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | muts-like protein 5 |
Protein accession | XP_002181399 |
Protein GI | 219122117 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAACG GCGCCTTGCC TGATGACTTC GAATACGTAT TTTCGGAATC GCTCTCATAC TTCAAGAGTG CGGAGATGCG TCAACTTGAT CAAAATATTG GTGACCTCGA TGCTTTCATC AAAGATGCGG AAACATTGAT TGTTGCAGAA TTGGAGGACG AAATCCTGGA CCACGAATCG GAGCTCCGTG AGACCTTCAT CGCCCTGGCC GAGCTGGATT GCATACTCTC CTTCGCTGGT GCTGCTGCAG ACTTAGATTT CGTGCGCCCG CGCGTTGTCT CGGCGTCTGA ACAGTGCGTT GAGATAAGCC GGGGCCGCCA TCCTTTGCAA GAAATTGTTC TCGACACAGC ATTTGTACCC AACGATACCA CTATGAACAC GACAAGTCGT GTAACCGTAA TCACAGGCCC AAACTTTAGT GGCAAGAGTT GCTTTGCTCG CCAAGGTCAG TTCTTGAGTG AGGGTTTAAT TGCGATTGCC AAAAAAAATT CTCGGCGTTG TCTGACTATT CTATTCCTTG TACCTTGTAA GTCGGAGTAC TCGTCTATAT GGCACACATT GGCTGTTTTC TCCCCTGCGA TGAAGCACGT ATTTCGCTGA CAGATCAAAT TTTCACACAA TTCAGCTCTA CTGAAACATG TGCTGTCCCT CAAAGCAGCT TCCAACTTGA TCTCAGCCGT ATGGGAGCTA TTCTCCGTCG AGCGAGTCAG CATTCGCTGG TTTTGATTGA CGAATTTGGG AAAGGTAAGG TTCGATTTCG AAGCACATAA AAGAAATTCA GTGGAAGCTG ACATGCTTAT TATCCGCACA AGGTACAAGC CCGGCATCAG GAATCTCCCT ACTCACGGCG GCTCTGCAAA AGTTGGTATC TAATCGCTCG AAAGTAATCT GCACGACTCA TTTCCTAGAG ATCTTTTCAA TAGGATTGCT TGTGGACTCT GAGAATGGAA TTTCCGCGAT GCATATGACT GTGCATGTCC CTGAGACGGC CAATGATAGT GCTGTCCCAC TATTCCGAAT GGAGCACGGG ATCGCAAATT CGTCCGCTGG ACTCGTTTGC GCGAAAATGG CTGGCGTAAA AAAAGCCATC GTCGATCGCG CCTACGAGAT AATTAAGGCA ATCAAGAAAC GTCAAAAGGT CCATCCGCTT GCTGAACTTT TGCGCAATGA TATACACATG ACTCTCGACT CGAAGCATGC CATCAGATCC TTCGTCAGCA CGAGTTGGAG GGATGCTAGC GACGATCAAA TTGATGCATT CTTTTCTATA ACTGAAAGGA TGTAG
|
Protein sequence | MENGALPDDF EYVFSESLSY FKSAEMRQLD QNIGDLDAFI KDAETLIVAE LEDEILDHES ELRETFIALA ELDCILSFAG AAADLDFVRP RVVSASEQCV EISRGRHPLQ EIVLDTAFVP NDTTMNTTSR VTVITGPNFS GKSCFARQVG VLVYMAHIGC FLPCDEARIS LTDQIFTQFS STETCAVPQS SFQLDLSRMG AILRRASQHS LVLIDEFGKG TSPASGISLL TAALQKLVSN RSKVICTTHF LEIFSIGLLV DSENGISAMH MTVHVPETAN DSAVPLFRME HGIANSSAGL VCAKMAGVKK AIVDRAYEII KAIKKRQKVH PLAELLRNDI HMTLDSKHAI RSFVSTSWRD ASDDQIDAFF SITERM
|
| |