Gene PHATRDRAFT_54688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54688 
SymbolmyoA3 
ID7202004 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp353352 
End bp356620 
Gene Length3269 bp 
Protein Length1027 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181360 
Protein GI219122035 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGCT CGCACGATTC ACACGTATGG GTCCGAACCG ACCTGGTCGA AAGCGTCTTG 
GGTAACGACG GGACTTTACC CAAGGGTTGG CGTCCTCGGA AACGCACACG GGTAAGCGGT
GATGAGTGGG GATGGGTTCG GGCGGTTGTC CACCAAACGA CCAACTCGAC GAATACCGTT
GATGAGGCAA CGGCTACGAG TGCCTCGGAA GCAACTTCGC CGTTTGCACA CGTAAAATTA
CGAAAGACGA ACGCATTATC GCCGGCACAA AAGTCGCCCT CGTCCAGATG GACCGGATCT
CCGACACGAC TCGCCAAGAC GGTACAGATT ACGCTCACGG TAGATGATCC CCTGCTGGCC
AATGGAAAAC TCAATGGTGA AACCGTTACG TTCAGCTATA ATACGGGGGA ACAATCCAAA
GTCTGCGTCG CTAACGCCTG GTGGCGCGAG GGGCAACCGC CACCGGAAGA CTTGACCAGT
TTGGAACAAT TGCACGAGCC AGCGGTAGTC TTTTGCCTCC TCCAACGTTA CCAGTTAGAC
CACGTTTACA CATACACGGG CAAGATTCTT CTGGCATTGA ATCCATTCCA AACGTTACCC
ATTTACGGGG AAGAGATTAT GCGATTGTAC TGGCACACAA CCGGCTCGTC GTCGCCGAAA
GCCCAATACG AACGCCCACC ACCTCACATT TACGCTATTG CGGAGGACGC TTACAGATCC
ATGATGCGCT CACTCCAAAT CAATGCCTCG CGTGGAGAAA ATCAATCCAT TCTAGTATCG
GGCGAAAGTG GTGCTGGAAA AACCGTCACC ACGAAAATCA TTATGCGCTA CCTGGCGACT
CTGTCGGAGC AACGCTCCCA CACTTCCAGG GTAGGCATCG AGTCGCAAGT ACTTCAGAGT
AATCCCATTT TGGAATCATT CGGCAACGCC CGTACCGTTC GAAACGATAA TTCGTCACGC
TTTGGCAAAT TCATTGAAAT ATCCTTCCGG GACGGGTCTC TCGTATCGGC ATCGGTTGAA
ACGTACCTAC TGGAAAAGGT TCGGCTGATT TCGCAGTCGC CGGGTGAACG GAATTATCAC
ATTTTCTACG AAGCCCTGGT AGGTTTATCT TCAAAAGATG CTCAAAGCTT GGGTATTGCC
GACTCATCAC CACGAGACTT CCGCATGACG GCCGTGTCCG GTACCTTCGA TCGCCGCGAT
CAGGTACGCG ATGTTGATAC ATACCGCGAT TTGCGACAGG CCTTAGATAC AGTTGGTTTT
TCGACAGAAG AGCAGCACGG CCTATTCGTG GTAGTATGCG CATTGTTGCA CGCGTCGAAT
TTAACTCTGA CCGAGTACGG TCACGATGCG AGTGCATTGG ATGAATCGAA CCCTAGTTTG
CCTGCGACAA TTGCTTTGCT CGGAGTCGAT CCCGAGGATT TAAACAATGC CGTCTGTAGC
TGCGCTATCG AAGCTGGGGG GGAAATCTTG TTCAAGAATT TACCTGTGGA GAAGGCACAT
AAGGCAATGG AAGCTTTGAT CAAGGCCACC TATGGTGCGC TCTTCACGTT TATTGTGCGC
AAGATCAACT CAAAGATACA AGCACAACAC GATACAAGCG GATTATGGCA AGCTTCGATT
GGCGTTTTGG ATATCTTTGG TTTTGAAAGC TTCGAAGTGA ACTCCTTCGA GCAGTTGTGC
ATCAATTACT GTAACGAGGC GTTACAGCAA CAATTCAACA GGTTTGTATT CAAGTTGGAA
CAGCAAGAAT ATCACAAGGA GGGAATTGAC TGGTCCTTCA TTGCGTTTCC TGACAATCAG
GATGTTCTTG ATTTGATTGA AAAGCGTCAC GATGGAATAT TGTCCGTACT TGACGAACAA
TCCCGGCTGG GCCGATGTAC GGACAAGTCT TTCGCTCAAG CTATTTATGA GAAGTGCGGT
GCCCACCCTC GTTTTGAATC TTCCAAATCA CAGCAAGCCA TACTAGCATT TGGAATTCAG
CACTATGCTG GCTCCGTTGA ATACAACACG GCTAACTTTT TGGAAAAGAA CAGGGACGAC
TTGCCGAAAG AGACAACAGA ACTGCTTATG TCGAGTTCCA ATCCGTTTTT GGTTGGCCTT
GGAAAGATAC TTTGTGAAAA ATCAGTGGCA TTGAATGCTT CAAACTCAGC CATGTCGAGG
GGAAACCGGA AACAGTTGCA ACGCGCCGCC AGTTCCATCT TACGGGACAG CGTCGGCAGC
CAGTTCAGCT CACAATTACA GTTGCTACGG AAACGTATAG AATCAACAGC TCCGCATTAC
GTCCGGTGTC TTAAACCCAA TGACGATTTG GTACCAAATA GCTTTGATCC TTTGGTGATT
GCCGATCAAT TACGCTGTGC TGGTGTTCTA GAGGCGATTC GAGTGTCTCG AGTCGGATTT
CCGCACCGAT ATTTTCACGA TCACTTTGTG CAGCGCTACA GTTTACTGGT AGCTAAGCGG
CTGACCAAGC GAGGGCGAGG GCTGAACGGT TGTGACTCTT GCGGAAGTTT AGTGGAAGAG
TTACTCCCTC AGATTTCGAG TATTCTGGAT GATGAGGCAG TCTCCCCTTC CAAGAATCAT
CGTCCTACCG CGTAAGTCAT CCCTGATTTG CTGGTTACAG TTATTGCATC ACGCTCTCAC
AATGCTTTTC AAACAGAATC TCTCTTCTGG GAATGCAAAT GGGCAAAACA AAAGTTTTCC
TTCGTCGTCG TGCATTTGAA GCCTTGGAAC ACCTACGAGG ACTCAAAATG GAAAAGGCCG
CCTCAAAAAT TCAAGCATTT GGACGAATGA TCGTCGCGAA ACTCAATTAT GATATATCTG
TGTACGCTGC CGTTTTAATA CAAAACTTCT TTCGACAAAT CGGTGCATTC CGTCTTGAAC
GTGCGCAGAG AATCGAAGAT GCTGCCGAGA GAATTCAGTG CAGCTGGAGA AGTTACGATG
CACGAAGGAC AATGCAAGCT GCGCGTTACG TTGCCTGGTG GTGTCAGAGT ACTTATAGGG
GAAGTGTCGC CCGTCAGTTA TGTGCCTATT TATTTTTGGA CCGTAAGGTG TTGACGATCC
AACATGCTTG GAAATATTAT GCATCAACTC GAACTTTTCG TAAGTTACGC AAAGCGGTGG
TCCTTCTACA GTGTCGACAC CGTGGTCGTG TTGCCTATCG CGACTTGTGC AGACTGCGCC
GCGAAGCTCG AGACCTGTCT ACCGTTGCTG CTGAACGCGA TCAGCTTCGC CAGGAATCTC
AGCGTCTTCG TCGAGCGCTT GAGCACGCG
 
Protein sequence
MASSHDSHVW VRTDLVESVL GNDGTLPKGW RPRKRTRVSG DEWGWVRAVV HQTTNSTNTV 
DEATATSASE ATSPFAHVKL RKTNALSPAQ KSPSSRWTGS PTRLAKTVQI TLTVDDPLLA
NGKLNGETVT FSYNTGEQSK VCVANAWWRE GQPPPEDLTS LEQLHEPAVV FCLLQRYQLD
HVYTYTGKIL LALNPFQTLP IYGEEIMRLY WHTTGSSSPK AQYERPPPHI YAIAEDAYRS
MMRSLQINAS RGENQSILVS GESGAGKTVT TKIIMRYLAT LSEQRSHTSR VGIESQVLQS
NPILESFGNA RTVRNDNSSR FGKFIEISFR DGSLVSASVE TYLLEKVRLI SQSPGERNYH
IFYEALVGLS SKDAQSLGIA DSSPRDFRMT AVSGTFDRRD QVRDVDTYRD LRQALDTVGF
STEEQHGLFV VVCALLHASN LTLTEYGHDA SALDESNPSL PATIALLGVD PEDLNNAVCS
CAIEAGGEIL FKNLPVEKAH KAMEALIKAT YGALFTFIVR KINSKIQAQH DTSGLWQASI
GVLDIFGFES FEVNSFEQLC INYCNEALQQ QFNRFVFKLE QQEYHKEGID WSFIAFPDNQ
DVLDLIEKRH DGILSVLDEQ SRLGRCTDKS FAQAIYEKCG AHPRFESSKS QQAILAFGIQ
HYAGSVEYNT ANFLEKNRDD LPKETTELLM SSSNPFLVGL GKILCEKSLQ RAASSILRDS
VGSQFSSQLQ LLRKRIESTA PHYVRCLKPN DDLVPNSFDP LVIADQLRCA GVLEAIRVSR
VGFPHRYFHD HFVQRYSLLV AKRLTKRGRG LNGCDSCGSL VEEISLLGMQ MGKTKVFLRR
RAFEALEHLR GLKMEKAASK IQAFGRMIVA KLNYDISVYA AVLIQNFFRQ IGAFRLERAQ
RIEDAAERIQ CSWRSYDARR TMQAARYVAW WCQSTYRGSV ARQLCAYLFL DRKVLTIQHA
WKYYASTRTF RKLRKAVVLL QCRHRGRVAY RDLCRLRREA RDLSTVAAER DQLRQESQRL
RRALEHA