Gene Tery_3205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3205 
Symbol 
ID4243800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4899215 
End bp4901035 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content35% 
IMG OID638108207 
Producthypothetical protein 
Protein accessionYP_722798 
Protein GI113476737 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.243951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0338261 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATAG AGATATTAAC CCTGTTTGCG GGTGCAGGAC TAGAACAAGC AACCAGTGTC 
ACATGGCATT GGGCAAGTAA ATGGGCAGCA AAGAAGGCTG ATGAAAAGAG TCAGGTAATA
CAGACAGCAT TACTAACTTC ATATTGTAAA GCTTTGCAAG TAATTCGTAA AGAGTCTCAA
CAACTACCTC AATGGAAACA AGAGAAAGCT GAAATTCAAG CTAGATTAAC AGAAATGGAG
AAAGATGCAC TCATGATATT AGATTTAGCA CAGACACAGA AGCATAGAAT GACTAGTATC
CCTCAAGGTG GAGAAGAACA AGCGATCACA AAAACAGTAA TGTCTAGACT GCAAAGTAAA
GATAAATGGC TCAAACATGC TCCAAAAGAA TTATCTTATT TAGTTGAAAC TAGATTAGTA
TCAATTACAG TGCATCAGTT TCGCCACGAA CTGAGAACAG AAAGCCGGGT ATTTCAAGCC
TTAGTCCACG ATATGTTAGC TACAGCAGGA GAAGATGTTA GTCGTTTGCT TGATGTCACT
AATAGAATAG AGAAAAGAGT AGATCAATTT GTGGGAGACT ACCGTCAAGA TATTGACAGA
ATTATTGTAG GTATTGGAGG AGCTAACTCC CACCTAGAAC AGTTAATATC ACTGTTAAGC
AGACAGATGA AGCAATCTCA AGCTATACCT GCTGACTTTG CTGCTCTGAT AGCGGATAAA
ACAGAAGGAT TTCTCGGACG AGGTTTTGTC TTTGACACGA TAGAGAATTT TATCCAAAAT
CAATCCAAAG GATATTTGAT CATTGAAGCA GATCCAGGAG TAGGTAAAAG TACAATTATA
GCCGAATATG TACGTCGCAC AGGTTGTATT GTCTATTTTA ATTTATTATC AGAAGGCAGA
ACTAGGGCTG AAGATTTTCT TAAGAGTGTG TGTACCCAAT TAATTGAACG TTATAATCTT
CCTTATGAAT TACCATTACA TCCAGATAAT ACCAGCAATG GGAACTTTCT GTCAAAACTA
CTGAATGAAA TTAGTATCCA ATTACTCCCT AAAGAAAAAC TTATAATAGC AGTGGATGCT
TTGGATGAAG TAGATTTAAG TAGTCAGAGT GGAGCAGCTA ATGTCCTATA TTTACCAGCA
AACTTACCTA ATAGAGTATA TTTTCTGCTT ACTAAGAGAT CGGATCCACT ACCATTAGTA
GTGTCGGCGC CACAGAAAAT TTTTGATTTA ATGATCTATC CTAATGAAAG TTTGAAAGAT
GTTAAACTAT TTATTTACCT AAGAACAAAG CGAACTACAG TTCAAGCATG GATTAAAACT
AGAGGATTAA CATTAGAAAA ATTTGTCGAA TCTATAGCTA GCAAAAGCGA AAATAATTTC
ATGTATTTAA AATATGTACT GGATGATATT GAGCAAGGCA GCTATAGAGA TATCAGCTTA
GAAAGTTTAC CCCAAGGCTT AGAGAAATAT TATGATCAAC ATTGGCATAG AATGAAAATG
AATGTTAAAC CATTGCCAAT TACTAAACTA AAAATTATAT ATTTCTTAAC AAAAACTCGT
AAGCCTGTAT CCTGTGGAAT GTTAGCTAAA TTTAGTGGAG AAATATCTTT GACAGTTCAA
GAAATATTGG ATGAATGGGC GCAATTTCTG CGGATTCATC AAACAGAAGG CAAGCCGGTG
TATAGCATAT ATCATGCTGC TTTTCAAGAT TTTCTCTACC GTAAAGATAT TGTTCAGGAA
GTAGAATCGT TAGTTGAAAT TGAAAGTATG GAAAAACAAA TCAAAAATAA TTTGTTAGGA
ATGGTTTATA AAAATGGTTA G
 
Protein sequence
MLIEILTLFA GAGLEQATSV TWHWASKWAA KKADEKSQVI QTALLTSYCK ALQVIRKESQ 
QLPQWKQEKA EIQARLTEME KDALMILDLA QTQKHRMTSI PQGGEEQAIT KTVMSRLQSK
DKWLKHAPKE LSYLVETRLV SITVHQFRHE LRTESRVFQA LVHDMLATAG EDVSRLLDVT
NRIEKRVDQF VGDYRQDIDR IIVGIGGANS HLEQLISLLS RQMKQSQAIP ADFAALIADK
TEGFLGRGFV FDTIENFIQN QSKGYLIIEA DPGVGKSTII AEYVRRTGCI VYFNLLSEGR
TRAEDFLKSV CTQLIERYNL PYELPLHPDN TSNGNFLSKL LNEISIQLLP KEKLIIAVDA
LDEVDLSSQS GAANVLYLPA NLPNRVYFLL TKRSDPLPLV VSAPQKIFDL MIYPNESLKD
VKLFIYLRTK RTTVQAWIKT RGLTLEKFVE SIASKSENNF MYLKYVLDDI EQGSYRDISL
ESLPQGLEKY YDQHWHRMKM NVKPLPITKL KIIYFLTKTR KPVSCGMLAK FSGEISLTVQ
EILDEWAQFL RIHQTEGKPV YSIYHAAFQD FLYRKDIVQE VESLVEIESM EKQIKNNLLG
MVYKNG