Gene Cthe_0193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0193 
Symbol 
ID4808611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp234540 
End bp235724 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content34% 
IMG OID640105606 
Productsodium/hydrogen exchanger 
Protein accessionYP_001036627 
Protein GI125972717 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0025] NhaP-type Na+/H+ and K+/H+ antiporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCGA GTTTAGCAAT TATTTTATTA TTGGGATTAC CGGCAAAACG AATATTTGAA 
AAATTAAGAT TACCTGGTTT GTTGGGAATG TTGATATTAG GCATTCTTAT AGGACCTCAT
GTTCTTAATC TGCTTCAAAC GGATATGCTT CAAATATCTT CTGATTTAAG AAGTATTGCA
TTAATAATTA TTTTATTAAG AGCAGGACTC GGACTTAACA AGGATGAATT GAAAAGTATT
GGAATCCCTG CATTAAAGAT GAGCTGTATT CCTGGATTAT TTGAAGGTTT ATTTATTGCA
CTGGCTTCTG TATATTTTCT TGATTTTACA TTTGTTCAGG GAGGAATGTT AGGGTTTATT
ATTGCTGCTG TTTCTCCTGC TGTTGTAGTT CCTTTTATGC TTAAGTTAAA TGAAAATAAA
ATTGGTACAA AAAAAGGAGT ACCAACTTTG ATATTGGCAG GTGCATCTAT TGATGATGTC
TTTGCCATTA CGGTTTTTAG CGCCTTTTTG GGTTTATATT ATGGTTCTGA AATAAATATA
GGCATTCAAT TATTAAATAT TCCAATCTCG ATTTTATTGG GAATTTTGTC TGGAATTCTT
GTAGGGTTTT TACTGATAAA TATTTTTAAA AAATATAATA TCCCGGATAC AAAAAAGGTG
TTATTAATAC TTGGATTTTC AATATTACTC AATCAGTTGG AAAGTGTATT GAAAACTAAA
TTACGGATAG CATCCCTGCT TGGAGTTATG GCTATTGGCT TTATATTGAC TGAATATCTT
CCTGATACAG GCAAAAAACT TTCTGATAAA TTTAATAAGG TGTGGGTATT TGCAGAAATA
TTGCTGTTTG TATTAGTAGG AGCAGAGGTA AATGTAAATG TGGCATTAAA AGCAGGTGGA
ATAGGAATTA TATTGATATT GACAGGTTTA ATTGGGAGAA GCGTAGGTGT TGCTATTTCT
TTACTAGGCA CAGATTTTAA TTGGAAAGAA AGGGTGTTTT GTATTATTGC TTATATCCCA
AAGGCAACAG TGCAGGCTGC AATGGGTGCA GTGCCGCTGT CATTGGGAGT GGAATCAGGA
GACATAATCT TGGCGATTGC AGTATTAGCC ATATTAATTA CTGCACCGTT AGGAGCAATC
GGGATTCATT ATTCGGCAGA AAAGTTACTA ATAGAGAAAC AATAG
 
Protein sequence
MAASLAIILL LGLPAKRIFE KLRLPGLLGM LILGILIGPH VLNLLQTDML QISSDLRSIA 
LIIILLRAGL GLNKDELKSI GIPALKMSCI PGLFEGLFIA LASVYFLDFT FVQGGMLGFI
IAAVSPAVVV PFMLKLNENK IGTKKGVPTL ILAGASIDDV FAITVFSAFL GLYYGSEINI
GIQLLNIPIS ILLGILSGIL VGFLLINIFK KYNIPDTKKV LLILGFSILL NQLESVLKTK
LRIASLLGVM AIGFILTEYL PDTGKKLSDK FNKVWVFAEI LLFVLVGAEV NVNVALKAGG
IGIILILTGL IGRSVGVAIS LLGTDFNWKE RVFCIIAYIP KATVQAAMGA VPLSLGVESG
DIILAIAVLA ILITAPLGAI GIHYSAEKLL IEKQ