Gene Cthe_0163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0163 
Symbol 
ID4808651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp201657 
End bp202931 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content45% 
IMG OID640105574 
ProductGTP1/OBG subdomain-containing protein 
Protein accessionYP_001036597 
Protein GI125972687 
COG category[R] General function prediction only 
COG ID[COG0536] Predicted GTPase 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02729] Obg family GTPase CgtA
[TIGR03595] Obg family GTPase CgtA, C-terminal extension 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGTTG ACAGAGCGAG GATATATATA AAAGCCGGGG ACGGCGGAGA CGGCGCGATA 
TCCTTTCACA GGGAAAAGTA CATAAGCAAG GGCGGCCCGG ACGGCGGAGA CGGCGGAAAA
GGCGGGGATG TAATCTTTGT TGTGGATGAA GGATTAAGAA CGCTTCAGGA CTTCAGATAT
AAAACCCGCT ACAGAGCCGA AGACGGGCAA AACGGCGGCA GCTCCAACTG TTCAGGCAGA
AGCGGGGAAG ATTTGATAAT AAAGGTCCCT CCGGGAACTT TGGTAAAGGA TGAGCAAACA
GGCAGGATTC TCGCCGACCT TGTAAAGCCC GGTAAGAAAG TTGTAATTGC AAAAGGCGGC
AAGGGTGGAG CCGGAAATCA GCATTTTGCG ACTCCGACAA GGCAGGTGCC AAGTTTTGCA
AAACCGGGGG AGCCGGGAGA AGAGCTGTGG GTGATATTGG AGCTTAAACT CCTGGCAGAC
GTGGGACTGA TAGGTTTTCC CAATGTGGGC AAATCCACAA TTCTTTCAAT GGTTACGGCT
GCCCAGCCCA AGATTGCAAA TTACCATTTT ACAACAATAA ACCCCAATTT GGGTGTTGTA
AACATTGACG CCGAGAACGC CTTTGTAATG GCAGACATAC CGGGACTTAT TGAAGGTGCG
CATCAAGGCG TGGGATTGGG TCATGAATTT TTAAAACATA TAGAAAGAAC AAAGCTTCTT
ATTCATGTGG TGGATATTTC AGGATCCGAG GGAAGAGACC CTGTCCAGGA TTTTGAAGTA
ATAAATGAAG AACTTAAAAA ATATAACCCT GTACTTTGTG AAAGGCCCCA GATTATTGCA
GCAAACAAGA TGGATGTCAC GGGAGCGGAG GAAAATCTTG AAAAATTCAG GAAGGTTATC
GAACCAAGAG GATACAAAAT TTTCCCTGTA TCCGCGGCAT CCAACAAAGG ACTTAAGGAA
TTGATATATT ATGCCGCACA AAAACTTAAG GAGCTGCCCG ATACCGTTCT TGTAAATGAC
CAGGACAATG AAGTTGTATA TACGGCGGTT GAAGAGGAAC CCTTTAATAT CAGGAAGGAA
AATGGTGTTT TTGTGGTTGA AGGAAGCTGG GTACAAAGGC TGGTAAGATC AGTGAACTTT
GACAATTATG AATCTCTGCA GTATTTCCAA AGAGCCATAA GGAGGAAAGG AATTGTTGAC
GCCCTGGAAA GTATGGGAAT TAATGAAGGG GATACCGTAA GAATGTATGA CCTGGAATTT
GAATATTTTA GATAA
 
Protein sequence
MFVDRARIYI KAGDGGDGAI SFHREKYISK GGPDGGDGGK GGDVIFVVDE GLRTLQDFRY 
KTRYRAEDGQ NGGSSNCSGR SGEDLIIKVP PGTLVKDEQT GRILADLVKP GKKVVIAKGG
KGGAGNQHFA TPTRQVPSFA KPGEPGEELW VILELKLLAD VGLIGFPNVG KSTILSMVTA
AQPKIANYHF TTINPNLGVV NIDAENAFVM ADIPGLIEGA HQGVGLGHEF LKHIERTKLL
IHVVDISGSE GRDPVQDFEV INEELKKYNP VLCERPQIIA ANKMDVTGAE ENLEKFRKVI
EPRGYKIFPV SAASNKGLKE LIYYAAQKLK ELPDTVLVND QDNEVVYTAV EEEPFNIRKE
NGVFVVEGSW VQRLVRSVNF DNYESLQYFQ RAIRRKGIVD ALESMGINEG DTVRMYDLEF
EYFR