Gene PHATRDRAFT_49686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49686 
Symbol 
ID7198318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp419068 
End bp421458 
Gene Length2391 bp 
Protein Length686 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184460 
Protein GI219128521 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCACAGTCTC CCTGTTTTTT CAGCATGAGT CGGATTGCTG CACTGCGTGT GACTGTGCAT 
TCGTTCGCGA GTGTCAATAG CCGTTGGGTT GCCCGTCGCG GAGCGGGCCT CGATGGCGCT
CATCCACCAT CACAGCGACC CTTACCGGTA TCACCAGCAC ATCCACATTG CGTACAGGAA
CGTTCCTTCC AGTCCACTTC ACAACCGTTG TGGCTACCCA TCAAGGACGT CGAAGTGCTG
AGTCTCGTTG GCGATGGCGC TGTCCAGCAG AATGGATCGG ATCCCACCAT AGGCAGCGGC
CGCAGCACCG GCACCGCACG GTTGGTACAA ATCGTGGCCC CACCCGGATC GCTCGTCACC
GCGGGCGACG TCGTCGCCGT TCTCCAAACG GAACCGAACG GATCCACAAC GTCCGTACGG
TCCCCGCAAG ATGGGAAAAT TGTTGCCTGG ACCAAGACAC TCACGGAACG CGTGCAGCTC
GGCGACGTGC TCTTCCGCAT CGATACCGAC GCGGGGGACG ACGTCCCCCA CAACGACGTC
CACGAACTCG TGGCGCACGT CGCCACGCAC AACGGTACCC TACCCGACGA CGTCTTCCAG
ACGTACCTGG AGTGGACCGA CGTGCCGCGC ATACAAATCG TGGCCAACGT CTTGCGTGAT
CGCTTTCCCA CGCTGCGCGA ACGAGCCCTA GCGTTGTACA GCCGAGTCTT GGCCCTGCAG
CGATCCCAAC CGGAAACACC CGCTCGGCAA GTGGCGACCA CCGCCACCGA TATTGGTATA
CTCCTGTACC GGATGGGCGA TCTGGAACAA GCCTTGACGC ACTTGTCCTA CGCTCGGGAT
ATTCGCATCG AAACGTTGGG CCCGGAACAT CCCGAAACGG CGGCGGCACA CATACACATT
GGGGCCGTTT TGAACCAAAA AGGAGACTTG GACGGAGCCT TTGACAAATT TCAAACCGCC
TTGCAAGCAC AAATTGCGAA TCTCGGAGAA TCTCACGCCC TTGTGGCGGC TTCGTGGAAT
AATCTCGGAG CTATTCGCTA CCAAACTGGA CAATATGCCG AAGCACTCTC CCTGTACCGC
AAGGCACTCG CCATTCACCG AGAACTGCAC GGAGAAGCGC ACGCCGATAC CGCCGGGTCG
TACCACAATG TCAGTATTGC GCTCAAACAC GTCGGCGATA ACATGCCCAT GGCACTGGAG
CACTGTCAAA AGGCTTTGCA AATTCGTCGC GACGTTCTGG GTCCGGAAGC TCCCGATACA
GCCGCCAGTC ACTACGCACT CGGGCAGTTG TTGTCGGAAA TTGGACAGTG GGATGCCGCC
GTGGAGCAGT ACAAAGCGGC CGTGGCCATT CACGAGTCGG TCTATGGGCG ACAGTCCCCC
ATTACGGCGT CGGGATACAA CAATCTGGGT GCCGTCTACT ACCAACAACA GAACTACGCG
GCAGCCTTGA CGGAATACCG CAAGGGTTTG GACATTCTGC AAGCCGTACT ACCGTCCAAT
CACGCGGACG TGGCGGCGGC GTGGAACAAC GTCGGATTGG CACTAGCCCA ACAAGCGAGT
CGGGAACAAA ACGTGGCGAA ACTGGACGAA GCCTTGGCGG CGCACCGGCA CGCACGGGCG
ATTCTAGAAG AATCCTACGG ACCGGATCAT CCCAGTCTCG CAATGACCGT TGGGAGCATT
GGTAACGTAC TCAAGGCGCA ACAGCAATTT GATGCGGCCT TGTCCGAATT CCGTCACGCA
CACGTCCTCT TGGAAAAAGC GCTCGGTCCC GTCCACGCCG ACGTTGCTAG TTCGCACAAC
AACATTGGTC TCGTGTTGGC GCAACAAGCT CGTTTGGAAG AAGCACTGGC GGAGTACCGG
GCAGCTCAGA AAGCCTTTGC TGCGAGCTTG GGCGATACGC ATCCGCATAC GGGGTCGACG
CATTTCAATA TGGGTCTAGT TTTACAAGAA CTGCAGCGGA CTTCCGAGGC CAAGATCGAG
TATGACATGG CCCGTGCGGC GTGGACGGTA TCACTCGGCC GGGACCACGA TCATACGCAA
ATGGCCGCCC AAGCCGTGGA CATTCTGAAC GAGAATGGAT CCTGATGCTT GTAGAAGAGT
GGAACGTATT CGGATGGAAT TGAAAACTCT GATCGAGTTT CGAATGGTTT CGGCAGCATT
CCAATTGGGT ACGAGCACGC AACTTATCCT TTAAAGATTG TACTCAGCGG CCAGAAAGTG
GAACAGTGGA ACGTCTCTCG TGAAACTTGC TAGTTGGATC TGCCATCTAG TGTTTTTAGA
GTTTTCGCCT TCCCTTGGAC CGAAAGCTCG TGTCAGGTAA AAAATGTGCT CCAAAGGTTA
TTAATTTTTC TGTGGTTAAG GTGTAATCAG TAAAAGCAAT TCTTCAGAGT G
 
Protein sequence
MSRIAALRVT VHSFASVNSR WVARRGAGLD GAHPPSQRPL PVSPAHPHCV QERSFQSTSQ 
PLWLPIKDVE VLSLVGDGAV QQNGSDPTIG SGRSTGTARL VQIVAPPGSL VTAGDVVAVL
QTEPNGSTTS VRSPQDGKIV AWTKTLTERV QLGDVLFRID TDAGDDVPHN DVHELVAHVA
THNGTLPDDV FQTYLEWTDV PRIQIVANVL RDRFPTLRER ALALYSRVLA LQRSQPETPA
RQVATTATDI GILLYRMGDL EQALTHLSYA RDIRIETLGP EHPETAAAHI HIGAVLNQKG
DLDGAFDKFQ TALQAQIANL GESHALVAAS WNNLGAIRYQ TGQYAEALSL YRKALAIHRE
LHGEAHADTA GSYHNVSIAL KHVGDNMPMA LEHCQKALQI RRDVLGPEAP DTAASHYALG
QLLSEIGQWD AAVEQYKAAV AIHESVYGRQ SPITASGYNN LGAVYYQQQN YAAALTEYRK
GLDILQAVLP SNHADVAAAW NNVGLALAQQ ASREQNVAKL DEALAAHRHA RAILEESYGP
DHPSLAMTVG SIGNVLKAQQ QFDAALSEFR HAHVLLEKAL GPVHADVASS HNNIGLVLAQ
QARLEEALAE YRAAQKAFAA SLGDTHPHTG STHFNMGLVL QELQRTSEAK IEYDMARAAW
TVSLGRDHDH TQMAAQAVDI LNENGS