Gene PHATRDRAFT_46239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46239 
Symbol 
ID7201197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp689896 
End bp692319 
Gene Length2424 bp 
Protein Length778 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180487 
Protein GI219119454 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.514117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGGT TGGCCTCCCG TAATCGAGGC TCATCGTTCC AAGGTCAGCT TGAGGCCGTT 
ATGGAGGTGG ACCTTCCTTG CACAAACATG AATACGGGAA GCAAATCTCC ACCTACCTCT
TCTTTCCGAA TCAATCCCGT GACCCCTTCG TCGAGAACGG TGGTCGCTGA GAAGAGTGCC
ATTTTGAGAA GACTACAAGG TCTTCCCGAA TCTGTTCAGG ACGATAACGA CACGGTAATG
TCTTCCGCAA CAACTTCTTC CAGGAAGTCG TGGTGGGGAC GACGCTCCGG AGCAGGAGTT
GTACGTGGCG GTAACGCGAG TTCTGCTGCT AGCGTTAACG CCAGTATGCG AAACACGAAG
AAATCCAAGC CTGAAAAGCA AAAAGTGGAA GCAGGCAAAG GGTTTCGAAG TCGGTTCAGG
CTGCAAGGAG CTCATAGTTC CCGCGACCCT GCATCGAGTA CCAAGACCAA CAAAAAGAAG
GCGCCTCAAG TAGACAGCGA TCAACTTGTC ATGGACTTCT TCGATGTACA AAGTATTGTA
TCGGAACCGC ACTTTGCACC GTTCGTAAAG TCGGCACGGA CGCGGGATCC AACCGAAACC
CATTCGGTCA CATCGCAACG ATCAATTGGG AGCTTTTTCA TGCGCTTCCG CGAACGAAAT
AATCGTGAAG AAGCTATATT GGCCACAGAA ATTACGGCTC ATCACGATGA AGAGGACGAA
GTCGATGATG ATATTACCCT CGCTTCCGGG CTGCTGTCAA GTAGCGGCTC ATATGGTTTT
ATACGCGATG AGGACTATTC GGATGGAGAC TTTTCGGCAT TTGTCCAACC CGAACCAAAA
CCCGGACAGC ACGACTCGTC TTTGCCAAAA GACCCGAAAG CTTCCCGTCT AGGGCATTTA
TTTGGTCGTA AGAAGAGATT CCGTAGGCCA CGTCGAGGCT CGGGAGATAG CAGCGCTTCC
TCTGTGACTG ATGGCAACGG TAGTCTTGCT AATGGAGCAG TTGCCACAAC TTCCACCAAC
GCTCTGCATT CTACAAAGGG TCGTAACAGT ATGCACAGCG GCGGTACCGA TTCTCCTACT
GAATCAGAGG ATCTTGAAAT TGAGGAAGAA GAGTTGGAAA AGCTATTGGA GATCTCGAAT
CACATCGCGA GCAATTCTAA CCGTGAAAAT GTGGCTCGTC CTATTCCCGT TTCCGAAAAG
AGACTTACTG CATTCCTGGC AGCGGCAGAA GCTGCTAGCA CCCGACCGTC CGTCTCCAAT
ACCATTGCAA ACTACGAAAT CGCTCCTCCG CCGGTCCTCA GCAACAAGCA AGAAAGCAAG
CAGGCTAAGG ATGGTATGAA ATTTGCCAAA AGTGCACGAA AAGCAGCAAA GGAAGCTCTT
AGCGCAGCAA AAGAAGCAAA GGCGGCCAAA GAAGCGGTGA ATTTCGCAAA CGATGATGTC
TTCATGCCAA TCATTGAAGA AATCGAGGAA GACGCGGAGG ATATCTCCTC CCAGATTCTT
TGCAAGCAAA CACCAGATCG GTGGTCTCCC TCGCCAACAC CTGATGTCAA TTCTGGATCT
TATGCACCAA AGCGACCACA TCGACGAATA CCCGAGGGGG ATGTTGGTGC AAACAAGAAT
AAGGATTGGC GCGCTCAAGT GTTCAGCCAT GTCACTTGTT CCAACGACGT TCACTGTGAT
GGAGTAGAAG AAGAGAAAGC AACTGAGTCT TGCTCCTTCA TGCCGGACGA GGACGACGAC
AGTGAGTCGG TATTTTCGCT GCTTAGTGGT CTCATGGCAA CCTGGGTTGC CGAGCAAGCT
CAGGAAGAGT TCAAGCAAGA TGGGATCGTT CCATTCCTGC AATTGCGAAG TTGCCTCAAG
CAAGGCGGAC CTATTGACAA CGCCCGCCTG TGCTACCGAG TCAGTTTTGC CAAGATTGAG
ATTCGGGAGT ACGAACGCAC GGTTGGCGAT AATCCTGCCT GTGGCTCCGG TCCTCCTATC
ACCATTGGAT GGGGTTACGT TCCCGGGGTA GAAGCCAACA TTGAAGAATA CGAAGCCACG
AGAGTGCCAA GGACCAAGAA GCAATACTAT TTGCCACCCG CCAAACGTAT ACACTTGCTT
ACTCAAGAAT GGCAATGTAC CGAAGAGCAA ATTCGAAAAG CCCGACGAGA GGCAACGTAC
ATCCAATATT GCCGTGAGAA GACAGCCTTT TCGAAAGCTG ACAAGGAAGC TGCCTTTTTG
CGCAAGGCAC AGCGACGGCA ACCAATTACC AACAACGCTA GCTGGCCTAC ATCAGATACC
AAGCGAGCAG TGTCGGCACC ACAGTCACCA GTGCTGCCGG GCATGAGCCT GGTTTAGATG
AAGAAAGTTT AGTTGGTAGT AGACGGGTCT AATTGCTGGT TGCACAACCA TTTTTGACGC
TGCCCCAAAC ATTAGTTGTA CTAC
 
Protein sequence
MNRLASRNRG SSFQGQLEAV MEVDLPCTNM NTGSKSPPTS SFRINPVTPS SRTVVAEKSA 
ILRRLQGLPE SVQDDNDTVM SSATTSSRKS WWGRRSGAGV VRGGNASSAA SVNASMRNTK
KSKPEKQKVE AGKGFRSRFR LQGAHSSRDP ASSTKTNKKK APQVDSDQLV MDFFDVQSIV
SEPHFAPFVK SARTRDPTET HSVTSQRSIG SFFMRFRERN NREEAILATE ITAHHDEEDE
VDDDITLASG LLSSSGSYGF IRDEDYSDGD FSAFVQPEPK PGQHDSSLPK DPKASRLGHL
FGRKKRFRRP RRGSGDSSAS SVTDGNGSLA NGAVATTSTN ALHSTKGRNS MHSGGTDSPT
ESEDLEIEEE ELEKLLEISN HIASNSNREN VARPIPVSEK RLTAFLAAAE AASTRPSVSN
TIANYEIAPP PVLSNKQESK QAKDGMKFAK SARKAAKEAL SAAKEAKAAK EAVNFANDDV
FMPIIEEIEE DAEDISSQIL CKQTPDRWSP SPTPDVNSGS YAPKRPHRRI PEGDVGANKN
KDWRAQVFSH VTCSNDVHCD GVEEEKATES CSFMPDEDDD SESVFSLLSG LMATWVAEQA
QEEFKQDGIV PFLQLRSCLK QGGPIDNARL CYRVSFAKIE IREYERTVGD NPACGSGPPI
TIGWGYVPGV EANIEEYEAT RVPRTKKQYY LPPAKRIHLL TQEWQCTEEQ IRKARREATY
IQYCREKTAF SKADKEAAFL RKAQRRQPIT NNASWPTSDT KRAVSAPQSP VLPGMSLV