Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51848 |
Symbol | myoC3 |
ID | 7200538 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 110159 |
End bp | 113147 |
Gene Length | 2989 bp |
Protein Length | 932 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179793 |
Protein GI | 219118019 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGCTG CCAATACGTC CAATTACGTT TACATTCGAT CCGAGGAGTA CGCGTGGGTC CCGGGTCGGT TGCTGGAACG CGACGGCACC CAAGCCATTG TTTCGGTACC CGTGTTTAAA AATGAAGAGG AGGTACAGTC TGACGGGGGC CGAATTAAAC GACACGAAAA GGTGACTGTC GATCTAGCGA CCTATCCGAA TGCGGCCTTG CTTCTACAGA ATGTGGACGA GCATGGCAAT CTCAATGAAG TGGAGGATAT GGTGGATCTG CCGTTTCTAC ACGAGGTACG TTCCTGCTTC TGGGTCCTTC CACCTTTCAA AATTTGACCG GGAAGCGCGG AAACGCTTAT TTACTGTTGG TCTATGCATA CATTTACTGA CTTGCCGCTT GTGTACGTAC CATCCTTTCC GTACGTAGGC TGCCATCCTC TACAACTTGA AAACTCGGCA TCAGCAACAA AAGCCTTATA CTCGCACCGG CGATATCGTC ATCGCGTGTA ACCCGTACCA ATGGTTCGAA CGATTGTACA ATGAAGAAAC ACGGGTTCAC TACTCACGAT CCCTCGTCTG GGATCCTCCG GACGGAGATC CGCGTCAGGG TCTGGAACCG CACATTTACG AAGCCAGTGC GCTGGCGTAC CGCGGGCTGG CGGTGGACGG GGAGGACCAG TCCATATTGG TGTCGGGAGA ATCCGGCGCG GGAAAAACCG AGTCGGTCAA AATTTGTCTC AATCACATTG CGAGCGTTCA GCAAGGGCAC GCCCATGGCT CGGATGATGT TGATTTTGAA TCGCCCATTG TGCAACGAGT CTTGGACAGC AATCCGCTTC TGGAGGCGTT TGGGAACGCC AAGACGGTCC GCAACGACAA CTCCTCGCGG TTCGGAAAGT ATATTCGTTT GCAGTTCGAC GCGGAGGATC CGGTAGATGC AGCGTACGCG GGTAGATCTG TTCCAAGCTG CAGACTAGCC GGAAGCAAGT GCGAAGTCTA CCTCCTCGAA AAATCCCGTG TCGTGACGCA CGAGGAGGAA GAACGAACCT ATCATATATT TTACCAGCTA CTGGCTGCGG ATGAGGACGT GAAAACAAAA ATTTGGGGAG GACTCGCTGA TACCGACAAC GAGTCCTTTT CGTACGTTGG ATTCACTGAT ACGGATACGA TTGAAGGAAA TAGCGACGCC GAAAGGTTTC AACACACAAT TGATTCTCTG GCTTTGATCG GTATCAAGGA CGAAAAATTG ATGAACCTCA TGAGAGCTAT ATGTATTGTC CTTCAACTGG GTAACCTGAT CTTTGAAAAA GACGAAAAAG ATGACACCCA TACCGCCATT ACGTCCGAAG ATGAATTTAC AGCATTGGCA GAACTTATGG ACATTCCGAA AGACGAACTC CTTCCAGCTC TTACGATTCG CACGATGCGA GCGCGAAATG AAGAATTCAA AGTTCCACTC AACGAAGTCC AATCCAAAGA CTCATGCGAT GCCTTTGCAA AGGAAATTTA TGCCAAAACC TTTTTGTGGC TGGTGCGCGC CATCAATGAT GCTACATGTG CCGAGCTGAA TTATGACGGG AAGAAAAAGG CAAATTTTGC AGTAATCGGA CTGTTGGATA TTTTTGGTTT CGAATCTTTT ACAACCAACC GTTTTGAGCA GCTTTGCATC AATTATGCCA ATGAAAAGCT TCAACAGAAA TTTACACAAG ACATATTCCG TTCGGTTCAA GCAGAGTATG AGACAGAAGG AATCGAATTG GAAGAGATCA CATACGATGA CAACACAGAT GTTTTGGATC TCGTGGAAGG GCGCATGGGA CTTTTAGCCG TCTTGAATGA GGAATGCGTG CGACCAGGTG GCTCGGATAG AGGATTTGTA TCAAAGGTGC AAGCAATGAA CAAAGAAAGT CCATGCTTTC TGCGAGAAAA GCAGTTTGAA GAGTGCGTGT TTGGAGTACG ACACTTCGCT GGAAGAGTAA TCTATGATGC GAATGGCTTC GTAACGAAAA ACATGGACAC GCTACCCTCA GACCTACAGG ATTGCGCGAA AAAAAGCTCG AACATGATTT TGGTGCATGA GCTGAGCAAC GAGGCGATGA TGAATTCATT GGAGGTAAAA ACGAAGAAAC CCCGTAAATC GTCACCAAAA GTAAAGAAAG CCCCACCTGC AAAACGCGGA AGCAACCTTG TCGGAGATAC TGTTTGGACC AAATTCAAGA GCCAACTTAC TTCGCTGATG ACGAACTTGA CCAAGACAAG GACGCGATAC ATCCGCTGTA TCAAACCCAA TCCCTTAAAG GCACCTCTTG TAATGCAGCA TGTCTCTACA ATTGAACAGC TCAGGTGCGC AGGTGTCGTT GCCGCAGTCA CCATCTCGCG TTCTGCCTTC CCCAATAGAT TAGAGCACGA AGCTGTGTTG TACCGATTTA AATCTCTTTG GGGTAAGGGT GAGCAGCACT TAGCGGATCT TAAAGTATTG GATATCGATG ATCCCGACCT AAAGTCAAGA ACTCTCGTCG ATCGACTTCT GGGTTCTGCA CTCAAAGATC TTCAGAACCA AATAAACGAC GAAACTTTAG TGAAGGCGTT TGTCATTGGG AACACAAGGG CTTACTTCCG GGCTGGTGCT CTTGAACATC TTGAGGCTGA GCGAGTGAAA AAGTTGGGTG TTTGGGTTGT AGAGATTCAA AAGATTGCTC GAAAGTACAT GGTTCGAGCT CGATACGGAA AAATGCGTTT TTGTACGATT GCGCTTCAAT CTTTTGCCCG GAAGCGTCAC GCGAGAAGAA CATTTACTAT ATTGCGAAAC GCTTCCATTC TTCTTACATG CTGGTATAGA TGTATCCGGG CAAAAAGAAA GCTTGCGAAG CTGAGCCGAG ATCAAAAGGC GAGCATGATT CAAACACACT GGAGAATGGC TATCGCTATA ACGGAACTAA AGCGCTGTCG TAAGGCCGCC GCCGTTATAC AGAGTATAGC CAGAGGAGCC TTGCAGCGC
|
Protein sequence | MVAANTSNYV YIRSEEYAWV PGRLLERDGT QAIVSVPVFK NEEEVQSDGG RIKRHEKVTV DLATYPNAAL LLQNVDEHGN LNEVEDMVDL PFLHEAAILY NLKTRHQQQK PYTRTGDIVI ACNPYQWFER LYNEETRVHY SRSLVWDPPD GDPRQGLEPH IYEASALAYR GLAVDGEDQS ILVSGESGAG KTESVKICLN HIASVQQGHA HGSDDVDFES PIVQRVLDSN PLLEAFGNAK TVRNDNSSRF GKYIRLQFDA EDPVDAAYAG RSVPSCRLAG SKCEVYLLEK SRVVTHEEEE RTYHIFYQLL AADEDVKTKI WGGLADTDNE SFSYVGFTDT DTIEGNSDAE RFQHTIDSLA LIGIKDEKLM NLMRAICIVL QLGNLIFEKD EKDDTHTAIT SEDEFTALAE LMDIPKDELL PALTIRTMRA RNEEFKVPLN EVQSKDSCDA FAKEIYAKTF LWLVRAINDA TCAELNYDGK KKANFAVIGL LDIFGFESFT TNRFEQLCIN YANEKLQQKF TQDIFRSVQA EYETEGIELE EITYDDNTDV LDLVEGRMGL LAVLNEECVR PGGSDRGFVS KVQAMNKESP CFLREKQFEE CVFGVRHFAG RVIYDANGFV TKNMDTLPSD LQDCAKKSSN MILVHELSNE AMMNSLEVKT KKPRKSSPKV KKAPPAKRGS NLVGDTVWTK FKSQLTSLMT NLTKTRTRYI RCIKPNPLKA PLVMQHVSTI EQLRCAGVVA AVTISRSAFP NRLEHEAVLY RFKSLWGKGE QHLADLKVLD IDDPDLKSRT LVDRLLGSAL KDLQNQINDE TLVKAFVIGN TRAYFRAGAL EHLEAERVKK LGVWVVEIQK IARKYMVRAR YGKMRFCTIA LQSFARKRHA RRTFTILRNA SILLTCCMIQ THWRMAIAIT ELKRCRKAAA VIQSIARGAL QR
|
| |