Gene PHATRDRAFT_51848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_51848 
SymbolmyoC3 
ID7200538 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp110159 
End bp113147 
Gene Length2989 bp 
Protein Length932 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179793 
Protein GI219118019 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGCTG CCAATACGTC CAATTACGTT TACATTCGAT CCGAGGAGTA CGCGTGGGTC 
CCGGGTCGGT TGCTGGAACG CGACGGCACC CAAGCCATTG TTTCGGTACC CGTGTTTAAA
AATGAAGAGG AGGTACAGTC TGACGGGGGC CGAATTAAAC GACACGAAAA GGTGACTGTC
GATCTAGCGA CCTATCCGAA TGCGGCCTTG CTTCTACAGA ATGTGGACGA GCATGGCAAT
CTCAATGAAG TGGAGGATAT GGTGGATCTG CCGTTTCTAC ACGAGGTACG TTCCTGCTTC
TGGGTCCTTC CACCTTTCAA AATTTGACCG GGAAGCGCGG AAACGCTTAT TTACTGTTGG
TCTATGCATA CATTTACTGA CTTGCCGCTT GTGTACGTAC CATCCTTTCC GTACGTAGGC
TGCCATCCTC TACAACTTGA AAACTCGGCA TCAGCAACAA AAGCCTTATA CTCGCACCGG
CGATATCGTC ATCGCGTGTA ACCCGTACCA ATGGTTCGAA CGATTGTACA ATGAAGAAAC
ACGGGTTCAC TACTCACGAT CCCTCGTCTG GGATCCTCCG GACGGAGATC CGCGTCAGGG
TCTGGAACCG CACATTTACG AAGCCAGTGC GCTGGCGTAC CGCGGGCTGG CGGTGGACGG
GGAGGACCAG TCCATATTGG TGTCGGGAGA ATCCGGCGCG GGAAAAACCG AGTCGGTCAA
AATTTGTCTC AATCACATTG CGAGCGTTCA GCAAGGGCAC GCCCATGGCT CGGATGATGT
TGATTTTGAA TCGCCCATTG TGCAACGAGT CTTGGACAGC AATCCGCTTC TGGAGGCGTT
TGGGAACGCC AAGACGGTCC GCAACGACAA CTCCTCGCGG TTCGGAAAGT ATATTCGTTT
GCAGTTCGAC GCGGAGGATC CGGTAGATGC AGCGTACGCG GGTAGATCTG TTCCAAGCTG
CAGACTAGCC GGAAGCAAGT GCGAAGTCTA CCTCCTCGAA AAATCCCGTG TCGTGACGCA
CGAGGAGGAA GAACGAACCT ATCATATATT TTACCAGCTA CTGGCTGCGG ATGAGGACGT
GAAAACAAAA ATTTGGGGAG GACTCGCTGA TACCGACAAC GAGTCCTTTT CGTACGTTGG
ATTCACTGAT ACGGATACGA TTGAAGGAAA TAGCGACGCC GAAAGGTTTC AACACACAAT
TGATTCTCTG GCTTTGATCG GTATCAAGGA CGAAAAATTG ATGAACCTCA TGAGAGCTAT
ATGTATTGTC CTTCAACTGG GTAACCTGAT CTTTGAAAAA GACGAAAAAG ATGACACCCA
TACCGCCATT ACGTCCGAAG ATGAATTTAC AGCATTGGCA GAACTTATGG ACATTCCGAA
AGACGAACTC CTTCCAGCTC TTACGATTCG CACGATGCGA GCGCGAAATG AAGAATTCAA
AGTTCCACTC AACGAAGTCC AATCCAAAGA CTCATGCGAT GCCTTTGCAA AGGAAATTTA
TGCCAAAACC TTTTTGTGGC TGGTGCGCGC CATCAATGAT GCTACATGTG CCGAGCTGAA
TTATGACGGG AAGAAAAAGG CAAATTTTGC AGTAATCGGA CTGTTGGATA TTTTTGGTTT
CGAATCTTTT ACAACCAACC GTTTTGAGCA GCTTTGCATC AATTATGCCA ATGAAAAGCT
TCAACAGAAA TTTACACAAG ACATATTCCG TTCGGTTCAA GCAGAGTATG AGACAGAAGG
AATCGAATTG GAAGAGATCA CATACGATGA CAACACAGAT GTTTTGGATC TCGTGGAAGG
GCGCATGGGA CTTTTAGCCG TCTTGAATGA GGAATGCGTG CGACCAGGTG GCTCGGATAG
AGGATTTGTA TCAAAGGTGC AAGCAATGAA CAAAGAAAGT CCATGCTTTC TGCGAGAAAA
GCAGTTTGAA GAGTGCGTGT TTGGAGTACG ACACTTCGCT GGAAGAGTAA TCTATGATGC
GAATGGCTTC GTAACGAAAA ACATGGACAC GCTACCCTCA GACCTACAGG ATTGCGCGAA
AAAAAGCTCG AACATGATTT TGGTGCATGA GCTGAGCAAC GAGGCGATGA TGAATTCATT
GGAGGTAAAA ACGAAGAAAC CCCGTAAATC GTCACCAAAA GTAAAGAAAG CCCCACCTGC
AAAACGCGGA AGCAACCTTG TCGGAGATAC TGTTTGGACC AAATTCAAGA GCCAACTTAC
TTCGCTGATG ACGAACTTGA CCAAGACAAG GACGCGATAC ATCCGCTGTA TCAAACCCAA
TCCCTTAAAG GCACCTCTTG TAATGCAGCA TGTCTCTACA ATTGAACAGC TCAGGTGCGC
AGGTGTCGTT GCCGCAGTCA CCATCTCGCG TTCTGCCTTC CCCAATAGAT TAGAGCACGA
AGCTGTGTTG TACCGATTTA AATCTCTTTG GGGTAAGGGT GAGCAGCACT TAGCGGATCT
TAAAGTATTG GATATCGATG ATCCCGACCT AAAGTCAAGA ACTCTCGTCG ATCGACTTCT
GGGTTCTGCA CTCAAAGATC TTCAGAACCA AATAAACGAC GAAACTTTAG TGAAGGCGTT
TGTCATTGGG AACACAAGGG CTTACTTCCG GGCTGGTGCT CTTGAACATC TTGAGGCTGA
GCGAGTGAAA AAGTTGGGTG TTTGGGTTGT AGAGATTCAA AAGATTGCTC GAAAGTACAT
GGTTCGAGCT CGATACGGAA AAATGCGTTT TTGTACGATT GCGCTTCAAT CTTTTGCCCG
GAAGCGTCAC GCGAGAAGAA CATTTACTAT ATTGCGAAAC GCTTCCATTC TTCTTACATG
CTGGTATAGA TGTATCCGGG CAAAAAGAAA GCTTGCGAAG CTGAGCCGAG ATCAAAAGGC
GAGCATGATT CAAACACACT GGAGAATGGC TATCGCTATA ACGGAACTAA AGCGCTGTCG
TAAGGCCGCC GCCGTTATAC AGAGTATAGC CAGAGGAGCC TTGCAGCGC
 
Protein sequence
MVAANTSNYV YIRSEEYAWV PGRLLERDGT QAIVSVPVFK NEEEVQSDGG RIKRHEKVTV 
DLATYPNAAL LLQNVDEHGN LNEVEDMVDL PFLHEAAILY NLKTRHQQQK PYTRTGDIVI
ACNPYQWFER LYNEETRVHY SRSLVWDPPD GDPRQGLEPH IYEASALAYR GLAVDGEDQS
ILVSGESGAG KTESVKICLN HIASVQQGHA HGSDDVDFES PIVQRVLDSN PLLEAFGNAK
TVRNDNSSRF GKYIRLQFDA EDPVDAAYAG RSVPSCRLAG SKCEVYLLEK SRVVTHEEEE
RTYHIFYQLL AADEDVKTKI WGGLADTDNE SFSYVGFTDT DTIEGNSDAE RFQHTIDSLA
LIGIKDEKLM NLMRAICIVL QLGNLIFEKD EKDDTHTAIT SEDEFTALAE LMDIPKDELL
PALTIRTMRA RNEEFKVPLN EVQSKDSCDA FAKEIYAKTF LWLVRAINDA TCAELNYDGK
KKANFAVIGL LDIFGFESFT TNRFEQLCIN YANEKLQQKF TQDIFRSVQA EYETEGIELE
EITYDDNTDV LDLVEGRMGL LAVLNEECVR PGGSDRGFVS KVQAMNKESP CFLREKQFEE
CVFGVRHFAG RVIYDANGFV TKNMDTLPSD LQDCAKKSSN MILVHELSNE AMMNSLEVKT
KKPRKSSPKV KKAPPAKRGS NLVGDTVWTK FKSQLTSLMT NLTKTRTRYI RCIKPNPLKA
PLVMQHVSTI EQLRCAGVVA AVTISRSAFP NRLEHEAVLY RFKSLWGKGE QHLADLKVLD
IDDPDLKSRT LVDRLLGSAL KDLQNQINDE TLVKAFVIGN TRAYFRAGAL EHLEAERVKK
LGVWVVEIQK IARKYMVRAR YGKMRFCTIA LQSFARKRHA RRTFTILRNA SILLTCCMIQ
THWRMAIAIT ELKRCRKAAA VIQSIARGAL QR