Gene PHATRDRAFT_44555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44555 
Symbol 
ID7197800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp892640 
End bp894976 
Gene Length2337 bp 
Protein Length743 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178322 
Protein GI219115053 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.966071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AATGTTTCAC TGGACATCCC ACTGATCTCA TTTTCTGCCT GAGGAAGTGT CCGATAGAAG 
GGCCCATTCA CCATGGCGGC ACAGCCAGCC GTAAGCCAAG GATCTACTCT TTTTGACTCA
GACAACGAGG ATGGACGGGT TTTCAAGGTA AACACGAAGT ACGCCAAAGA GTACGAGTCC
CGGAAACAAA AAGAGGAGTT GAGGCGAGTG CAGTTTCAAG GTGATGATGA CGATGGCGAG
AGTACTGACT CTTCTGAAGA CGAAGACGCA GAACTCCTTA CTGCCAAGCT GGACACGAAT
ATAATAAAGA CGATTAATAT ACTGCGAAGC AAAGACTCTC GTATTTACGA CCCTTCCGTA
AATTTCTTCG ACAACGCAGA AGACAGTGAA GAGGACTCAC TGCCTACGAA AGAGTCAAAG
TCGAAACCGA AGCGCTACAA GGATGTTGTC AGAGAACAGA TATTGAAACA AATGGACGAA
GATGTGCCAA TTGGAGAAGA AAACGATAAT GATTACGGGG TTCTGGCATC CGACGAAGCG
AAATCTCGAC TCGCCTACGA CAATCGCCAG CAAGAACTAC GAAAAGCCTT CGTCGACTCC
ACGACCGGCA AAAAGGGCTA TGGCATCGAT GACGACGGAG AAGATAGCGA CGATGATTCG
GATACCTTCT TAGTTGTTAA GAAAGTTAGC AAAGGAATTA CGGAAGACGA AGATACCGCA
GAAGCCCGTC AGGAATTTTT ACAAGAGATG GAAAAGCTTG AAAAGACGGC GCGCGACGAC
AACCGCGATT TCGTCGACCC AAAAGGCGAA GTAAAAGACG GCGAACGCTT TTTGCTTGAT
TTTTTCAAGC GACGTAATTG GCTAGAGAGA GATAATGGAG ATAGCGGGCC GGACGGAAAT
AGCATAAAGG GCATTCAACC CCCCAGACCG ATTGCGGGCG ATGGAAACGA ATCAGAGAAC
TCACTGGAAC AGCTGCACAA GACGGACGAC TTTGAAGCGC AGTATAACTT CAGATTTGAA
GAAGCTGCAG CGAAATCACA GTCAGGTGCG GATTTCTCAA TCATCGGTTA CGCCCGCGGG
CAAACAATGA ATACGCTCCG TCGTAAAGAC GAAAGTCGTA GAGATAAAAG ATTGAGTCGT
AAAGATCGCA AAGTAGCTGA TCGAACAGCC AAAGAAGAGC AGCTGAAGCG CCTGAAAAAT
GCCAAGAGAC AAGAAATGGA AGGAAAATTG AAGCAAGTTA AGTCAGTTCT TGGTGAGGTT
GAAAATCGTG GCGAAGCAGT GGACGAGGCT GCAATTCTGA AATTGCTCGA GGGTGATTTT
GATCCTGAAG AATTTGAGGT ACTGATGGAG AAAACGTACG GCGAAGATTT TTACGGAAAA
GAAGATTCGG AATGGCAGAA TGATAAGGAC GTGCGGGAGT CCTTAAAGCA CGATGAGGAC
GGCGACCTTC TCGTTGGCGA GGGTGACTCC GACGGTGGCT TATACGATAA CGTCGAAGAG
GATACCGAAA GCTACAAAGA TGGTCACGAA ACGCCGGCCG ATGAAAACGA CGAAGAAGGA
TGGCCAGAGG AAGAAGAAGT TAGAGAAGAA ATAGAAGAGA CAGAGCTGGA AAGAACTGTG
AAATTAAAAG TGGAAAACGA GCTATACAAA CTGGACTACG AAGACATTGT TGCAGATATT
CCCACTCGAT TTAAGTACCG TCAGGTTGAA GCCAATAATT TTGGGCTCTC TACGGAAGAG
ATTCTGCTAG CCCGGGATAC CACGTTGAAG CAGTTTGTTT CTCTGAAAAA ACTGGCGCCT
TATAACGAAG CCGGTGAGCA TTTTGTGGGC AGTAGGAAAC GGAGACGATT CCGCGATTTG
CTCAAGCAAG AGTTGGAAGA GACCGTAAAA AGCTCCAAAG CTGTGGCGGA AGAAGGCGCC
GACGAACCGG CAATGGAGGA TCGAACGCAA ACGAAAAAAC GTCGCCGCCT GAAGAAAGGA
AAGAAGGCTG AAAACGTACC TGGAGATGCT ACCGGTACGG CCAATTCTGA CATTTTGGAA
AGATCGGAGG AGACCGACGA GGGGCCGAAA ACGAAGCGGC GGCGAAGAAA GAAGCTGAAA
AAAGAAGAAC TAACCGATGG CAATCTCGAG AAGACGGAGA AAAAGACGAA CAAGCATAGT
ATCAAGGCAA GGCTTGAGTC CGAAGCTCAA GAAGAAAACA AGGTTGATAA ACGAAATCAC
CACACCAAGA AGCCAAGGCA CAAGAAAAAA AAGTCCGGGA TTGAAGGAGT ATCGCATTCT
CGACTTGAAT CGTATGGGCT ATAGATTTGT AAATCTAGAC GACCTGTCTT TACAGTT
 
Protein sequence
MAAQPAVSQG STLFDSDNED GRVFKVNTKY AKEYESRKQK EELRRVQFQG DDDDGESTDS 
SEDEDAELLT AKLDTNIIKT INILRSKDSR IYDPSVNFFD NAEDSEEDSL PTKESKSKPK
RYKDVVREQI LKQMDEDVPI GEENDNDYGV LASDEAKSRL AYDNRQQELR KAFVDSTTGK
KGYGIDDDGE DSDDDSDTFL VVKKVSKGIT EDEDTAEARQ EFLQEMEKLE KTARDDNRDF
VDPKGEVKDG ERFLLDFFKR RNWLERDNGD SGPDGNSIKG IQPPRPIAGD GNESENSLEQ
LHKTDDFEAQ YNFRFEEAAA KSQSGADFSI IGYARGQTMN TLRRKDESRR DKRLSRKDRK
VADRTAKEEQ LKRLKNAKRQ EMEGKLKQVK SVLGEVENRG EAVDEAAILK LLEGDFDPEE
FEVLMEKTYG EDFYGKEDSE WQNDKDVRES LKHDEDGDLL VGEGDSDGGL YDNVEEDTES
YKDGHETPAD ENDEEGWPEE EEVREEIEET ELERTVKLKV ENELYKLDYE DIVADIPTRF
KYRQVEANNF GLSTEEILLA RDTTLKQFVS LKKLAPYNEA GEHFVGSRKR RRFRDLLKQE
LEETVKSSKA VAEEGADEPA MEDRTQTKKR RRLKKGKKAE NVPGDATGTA NSDILERSEE
TDEGPKTKRR RRKKLKKEEL TDGNLEKTEK KTNKHSIKAR LESEAQEENK VDKRNHHTKK
PRHKKKKSGI EGVSHSRLES YGL