Gene PHATRDRAFT_47666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47666 
Symbol 
ID7202856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp469045 
End bp470497 
Gene Length1453 bp 
Protein Length434 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181913 
Protein GI219123191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000959743 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGCGA ATATTGGTAC TTCAGTCACC AACACTATTG TCGCCATCGG ACAGATGGGC 
GATGGGGATC AGCTCGAACG TGCGTTTGCT GGTGCCACCG TCCACGATAT GTTTAACTTT
TTGTCGGTGG CCGTACTCCT TCCTGTAGAA GTCATTACTG GATACCTCTA CCACTTGACT
AAGGCAATGG TCAAGAATGC ATCCACTGAG AAAGGAGATA AATGGGAAGG TCCCGTCAAG
GTACTTGTTG CTCCTATCGG TTCGAAGATT ATCATCGCGA ATAAGGATAT TATCAAGGCA
ATTGCCCAGA ATAAAGGCAG CTGTGATGAG GGCGACGGCT TCTACCCCAT GAACTGTACC
GAGGACAGCT ATTCTGGCTG CGGCAAGTCA TTTGGTCTTA TAAGTTGTGA CAAGAAGTCT
GGAGACTGTC CTGCTTTCTT CCAGTCTGAT GCGTCTGCCA AGGATGACAA GGTCTCCGGA
GGTGTCGTCT TTTTCATTTC TATTGTCATT CTTTTCACTT GTCTTGCCGG TCTTGTGACT
GTCCTCCAGA AAATGTTGCT CGGTATGTCT TCTCGTATTG TCTACAAGGC AACAAACATT
AACGGATACC TTGCCATTGT GATTGGTGCT GGTATCACCA TGGTAGTACA GTCTTCTTCC
ATTACGACCT CTACGTTGAC TCCTTTGGTT GGTATGGGAG CTCTCCGTCT TGAACAGATG
TTACCCCTTA CTCTCGGCGC TAATATTGGA ACAACAATGA CTGCCATCTT GTCAGCACTT
GTCACAGAAG GAACTGGATC TCTCCAGGTT GCACTAGCTC ATTTGTTCTT CAACTTGACT
GGTATTGCCA TCTGGTATCC CCTTCCTTTT ATGCGCAACG TTCCGCTTGA AGCTGCTCGT
AAACTTGGGC GGACAACACG AATCTGGCGT GGCTTTCCTT TTCTTTATAT TGCCGTGATG
TTCTTTCTTA TTCCGCTGCT TTTGCTTGGT CTCTCTTCTC TCTTTGAGGA TGGCAGTAAG
GGTTTCACTG TTCTTGGATC ATTTCTCACT ATTATCCTTG GCCTTGGAAT CCTGTACGTC
ATGTACTGGT GTCGTTACAA GGAGGGCCGG GAGAAATGCT CAAGTTGCAT GGCCGAGCGT
GAGAAGAAGC GCGTCATTAT GAAGGAGCTT CCTGACGACA TGATTTACCT CAAGGAACAC
ATGAAACGCC TCATTGAGCA CACTGGTCTT CCAATTGTGG AAGAGGAAGA CGCTGAAGCT
GGCAAAGAGA TTGACGAAGG TGATTCCGAT GAGGTAGAGG CCTAGGCTTC CGCCCCGTGA
GGTAGATATC ACTGATTCTA GAATTGGTTT GAGGTCCCCT CTAAAATTTG GTTGTTTTAT
TTGAATATTG TGCCAAGTCT GTTCTCGCTT GTTAAAGCTA GAGTTACTAT AAACCTTTGG
AATATACCAC GCT
 
Protein sequence
MGANIGTSVT NTIVAIGQMG DGDQLERAFA GATVHDMFNF LSVAVLLPVE VITGYLYHLT 
KAMVKNASTE KGDKWEGPVK VLVAPIGSKI IIANKDIIKA IAQNKGSCDE GDGFYPMNCT
EDSYSGCGKS FGLISCDKKS GDCPAFFQSD ASAKDDKVSG GVVFFISIVI LFTCLAGLVT
VLQKMLLGMS SRIVYKATNI NGYLAIVIGA GITMVVQSSS ITTSTLTPLV GMGALRLEQM
LPLTLGANIG TTMTAILSAL VTEGTGSLQV ALAHLFFNLT GIAIWYPLPF MRNVPLEAAR
KLGRTTRIWR GFPFLYIAVM FFLIPLLLLG LSSLFEDGSK GFTVLGSFLT IILGLGILYV
MYWCRYKEGR EKCSSCMAER EKKRVIMKEL PDDMIYLKEH MKRLIEHTGL PIVEEEDAEA
GKEIDEGDSD EVEA