Gene PHATRDRAFT_47154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47154 
Symbol 
ID7202052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp634128 
End bp636204 
Gene Length2077 bp 
Protein Length639 aa 
Translation table 
GC content53% 
IMG OID 
Productiron ion binding protein 
Protein accessionXP_002181240 
Protein GI219121785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCCC AAATTCGTCG GCGGCAAACG AAAAGCCGAC GGAATCGCGA TGGCCTGATT 
CGGTTCGCCC GTGACAGAAC GAGCCAAAAT GGCGAAGATG GGATTATCGA ACGATTATTT
CAACTACTAC CAACCGAAAG CGAGCGTTGG TGCGTCGACT TGGGCGCCTG GGACGGAGTT
CATTTGAGCA ACACTAATTC GCTGCTGGTT GCCACAGCTG ATGAGCATGT GTCTGCCGCA
AACAGCACTC TATGGCATGG TGTGCTAGTG GAAGCCGACA CAGACCGCTT TCAACACTTG
CAACAGCTTT ACGTCAATAG AGGCAATGTT TGTTTGAACG TATCAGTTTC CGGCATGTCT
GATTCGCCGC ACACACTGGA AAATATTTTG AAGACACACG GTGACCGAAT TAGCTTACCC
AGTGACTTTG ATTTTCTATG CATCGATATT GACGGTGCCG ACTACTGGGT GTGGCACGAT
CTATTGAAGT CGGAATCCTA CCGCCCTAGG GTTGTCTGCG TGGAGTTCAA TCCGACCATA
CCGGATGATT TAATTTACAT TCCAGAAAGA AGTGACGTGA TACGACAAGG GTGCAGTCTA
GCGGCGTTGG TAGAGCTTGC CAACGAATAC GACTACGTCC TGGTCGAAAC GACCTTGTAC
AATGCTTTTT TCGTCCCGAT ATCACTCTAC GACTCCTATC TGGCTGACGA AATTCCGGAT
ACTTCGATTG AAGTCCTACA CGAGACTACA ATGGGTACAG CCCTTTATCA GCTTTATGAC
GGTTCCATCA AGTTGTGGGG TTGCAAAAAA CTCCTCTGGC ATCGACTACC GATGGACGAG
TCCAAGGTGC AGATGCTACC GAGAGGGCAG CGTCAGTTTC CATTTGCCCC CCGCGAGACC
AAAAATTCAT TGGCAATGAG CCACGCTGTT GACTTGCGCG TTTGCTACAG CGAAACATCT
TCCGCCGACC AACGTGCAAT ATGCTCCGCC AATCTCGTTC GTCAGTTACA GAAGGACGGC
TTTTGCTACG TGCGTGGGAC AGGAATTGCT CGACAAACGT GCCAAAGAGC ACTGGAGGCG
ACGCACTCGC TGCTTCAAGA TGCGGACGAA TGTGTGCGTC GGTCGTGTTT GACCACAGAC
CGTGCTCGGC GCGGGTACAG TCCCATGTGT ACCGAGAATT TCAGTTCGCT GTTGGGAGAA
ACGGGACCGA ACGATTTGGT GCGCAAGTTT CGGGTAGGAC CCGTCGATGG ACACGAGGGG
GGCGGTGCGC TGCTGCAGCC GAATGTCTGG CCAGTCGAAG GGACGTGGGA CGCGCCGACG
GCGGCCGCTT TTCGCGCACA CGTCGAAGCG TACTACGGTT CCATTTGTGC GGCAGCCACG
ACGATGGTGA CGACCATTTG CCAAGGTATC TTGGCGATGT ATCCGGATCT CGAGGCCGCA
TTGGCTCCAC TGATGAAGGA ATCACTGGCA CACTCATCGA TTCTAACGCT TCTGGGATAC
CGCGTTGGCT CTCGTCACAA GGGTCGATCG AAAGGTCCGT TGGTGGCGGC GCATACGGAC
GTGGGCGTGA TTACCGTGCT CGTCTTCGAT GACGGTGATT GCGCAACCTT GCAACGCCGT
ACCGGACAGG GAGACTGGGA GGACGTTGTT TTGCCCGCGT CGGTGCCGGA CGATCCCATT
TTTGTCGTGA ACGTGGCGGA CTGTTTTTCC GAATTGAGTG GTGGACGTTT GCCTTCGACG
ATTCATCGAG TAGTCGCGCG ACCGGGAAAG ACGCAACCGC GCAACGGCTG TGCCTTATTT
GTGGGACTGG ATCCTCACGA AATGCTATGT ATCCAGGACG AAGCGATGAC GTACGAATGC
TGGCGCAAAC GACGGATCGC ACGAGCGCAA ACGGTACACC GAGAGTCGTC GTCGTCATAG
TGGTCCACAT CGGCCGAGTC TTGCTCTAGC GCAAGAACGC CTGTGTAAAT CGGCGTAAAC
GACAGTCGAC TATAGTGTTG ACCGGAGGTG TACGACAATT GCGTCGCGTG TCTTTGAAAG
GGTATATAGT TTATACTATA ATACATACTT TGTAATT
 
Protein sequence
MEPQIRRRQT KSRRNRDGLI RFARDRTSQN GEDGIIERLF QLLPTESERW CVDLGAWDGV 
HLSNTNSLLV ATADEHVSAA NSTLWHGVLV EADTDRFQHL QQLYVNRGNV CLNVSVSGMS
DSPHTLENIL KTHGDRISLP SDFDFLCIDI DGADYWVWHD LLKSESYRPR VVCVEFNPTI
PDDLIYIPER SDVIRQGCSL AALVELANEY DYVLVETTLY NAFFVPISLY DSYLADEIPD
TSIEVLHETT MGTALYQLYD GSIKLWGCKK LLWHRLPMDE SKVQMLPRGQ RQFPFAPRET
KNSLAMSHAV DLRVCYSETS SADQRAICSA NLVRQLQKDG FCYVRGTGIA RQTCQRALEA
THSLLQDADE CVRRSCLTTD RARRGYSPMC TENFSSLLGE TGPNDLVRKF RVGPVDGHEG
GGALLQPNVW PVEGTWDAPT AAAFRAHVEA YYGSICAAAT TMVTTICQGI LAMYPDLEAA
LAPLMKESLA HSSILTLLGY RVGSRHKGRS KGPLVAAHTD VGVITVLVFD DGDCATLQRR
TGQGDWEDVV LPASVPDDPI FVVNVADCFS ELSGGRLPST IHRVVARPGK TQPRNGCALF
VGLDPHEMLC IQDEAMTYEC WRKRRIARAQ TVHRESSSS