Gene Cthe_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0033 
Symbol 
ID4808798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp40454 
End bp43120 
Gene Length2667 bp 
Protein Length888 aa 
Translation table11 
GC content43% 
IMG OID640105442 
Productsmall GTP-binding protein 
Protein accessionYP_001036467 
Protein GI125972557 
COG category[J] Translation, ribosomal structure and biogenesis
[R] General function prediction only 
COG ID[COG0480] Translation elongation factors (GTPases)
[COG3688] Predicted RNA-binding protein containing a PIN domain 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00484] translation elongation factor EF-G 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.991842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGT TGGTAGTTGG AATATTGGCG CATGTTGACG CAGGAAAGAC AACATTATCA 
GAGGCCCTGC TGTATGTGAG CGGAAAAATA AGAAAATTGG GAAGAGTTGA TAACAAAGAT
GCTTACCTGG ACACCTATGA ACTCGAAAGG GCCAGGGGCA TCACTATTTT TTCCAAGCAG
GCCGTTTTTG AAGTGGGGGA CACCCGGATT ACCTTGCTGG ATACTCCGGG CCATGTGGAT
TTTTCGGCTG AGATGGAGAG AACCCTTCAA GTGCTGGATT ATGCCATTTT AGTTGTCAGC
GGTGCTGATG GAGTGCAGGG ACATACCAAA ACCCTGTGGC ATCTTCTTGA AATCTACAAG
ATACCTGTGT TTATATTTGT CAATAAAATG GACCAGAACG GGACAAACAA GGATAAAGTA
ATTAATGAAA TAAAAAAGCA GTTGGATGAC AGATGCATAG AATTTAGTGA AGAAAATCCA
GAAGAGTTTT TTGACCGGCT GGCAATGTGT GATGAAATGA TAATGGAGAC GTATCTTGAG
AAAGGACGGG TTGAAACCTC GCAGATTAGT GCTGCCGTAA AAGAACGAAA GGTATTTCCG
TGTTTTTTCG GTTCGGCATT GAAATTGGAA GGTGTCGAGG AATTTATTCA CGGTCTTATG
AAATATACTG TAGTCCCAAG TTACCCTAAT GAATTTGGTG CCAAGATATT TAAAATAACA
AGGGACGAGC AGGGAAACCG TCTTACCCAC ATGAAGCTGA CCGGCGGGAA ACTCAAGGTA
AGAGATGTTT TGACAAACGG CGTGTGGGAA GAAAAAGTGA ATCAAATCCG CATCTACTCC
GGAGAAAAAT TTGAAGCGGT AAGCGAGGTG GATGCGGGCA CCGTGTTTGC AGTGACCGGC
CTTACCCAAT CCAGACCGGG AGAAGGTCTT GGAATTGAGA AATCTTCAGG TGCGCCGCTG
TTGGAGCCTG TGCTGCAGTA TCAAATCATA CTTCCGGAAG GTTGTGACCC AAGAGCGATG
CTGCCCAAGC TTAGGCAGAT TGAAGATGAG GAACCGGAGC TTAACATTGT CTGGGATGAA
CAGTTGCAGG AAATCCGGGT CCGGGTCATG GGAGAGGTAC AGATTGAAAT TCTGCAAAGC
ATTATAAAAA GCCGTTTTGG AGTTGATGTT GCTTTTGACG ACGGAAGTAT AGTATATAAA
GAGACTATTG CCAATACTGT TGAAGGAGTG GGGCATTTTG AGCCACTCAG ACACTATGCG
GAAGTTCACT TGCTATTGAA GCCCGGAGAG AGGGGAAGCG GCCTTAAGTT TGATGTAAAC
TGCAGTGAGG ATGTTTTGGC TAAAAGTTGG CAGAGACTGG TTTTAACCCA CCTTGAAGAA
AAAGTTCATA AAGGGGTTTT GACGGGTTCG GCGATTACGG ATATGAAAAT TACTTTGGTA
TCCGGAAGGG CGCACAACAA GCATACTCAG GGCGGCGACT TCAGGGAGGC TACTTACCGT
GCGGTGCGTC AGGGTTTGAA AGAAGCGGAA TCCATACTGC TTGAGCCCTA TTATGACTTT
CAGCTGGAAG TGCCGGAAAA AATGGTGGGA AGAGCCATGA TGGATATTGA GAAAATGCAC
GGCACATGTG AGATATCCCA GATAAACGGC GATATGGCAG TTTTGGTTGG CAGCGCTCCC
GTTGTCACCA TGAGGAATTA TCAGAAAGAG GTTGTGGCAT ATACCAAAGG TCTTGGCAGA
CTGTTTTGCA GTTTTAAAGG GTATGAGCCG TGCCATAATG CGCAGGAAGT TATAGAGCGC
ATAGGATATG ATTCGGAAAG GGATGTGGAA AATCCCACAG GTTCCGTATT TTGCGCCAAT
GGTGTCAGTT TTTTGGTAAG TTGGGATGAA GTAAAGAATT ACATGCATGT GGAGAGCTAT
TTTCAGAAAA AAGAGGATGA AGAAAATTTA AACCAAAACC GGCCTGTGTA TTCGGGAGAA
AAAGCGATAA GCCTGGAAGA AATTGATCAA ATTATGAATA AAACCTTTTA TGCAAACCAG
GGAAGGAAAT CTGCATGGAA GAGGCAAAAG GCATCGGAAG AAAGGTATTA CAAACCTGCA
GCCGGTGCAA AAAGGCATGA GGCAAAAGAA GAGTATCTTT TGGTGGACGG ATATAACATC
ATCTATGCAT GGCCGGAGTT AAAAAAGCTT GCCGATGAGA ATTTGGACGG CGCAAGAATG
AAATTGCTTG ATATGCTGAG CAATTACCAA TGGATTCGAA GATGTCATGT CATTGTCGTA
TTTGATGCTT ACCGTGTTGA GGGACATAAA GAAGAGATAA TAGATTATTA TAATATCCAT
GTGGTTTATA CGAGAGAGGC TCAAACAGCG GATCAGTATA TTGAGAAATT TGCCTATGAA
AACAGTAAAA ACTATGATAT CACTGTTGCC ACATCCGACG GATTGCAGCA GATAATTGTA
AGGGGAGAAG GATGCGCACT GCTGTCCGCA AGGGAGCTTA AGGCTGAGCT TGAAGCTGCC
AATGAAAGAA TAAAACAGGA ATACCATAAA ATGAATAAAA TAGACCGCAA TTATTTGGGC
GATGCTTTGT CTACGAAAGC AAAACGGCAT ATGGAAGATT TGATTGAAGA AGAAAATAAG
AATAACGGAA TTAAAAAAGA TAAATAA
 
Protein sequence
MKKLVVGILA HVDAGKTTLS EALLYVSGKI RKLGRVDNKD AYLDTYELER ARGITIFSKQ 
AVFEVGDTRI TLLDTPGHVD FSAEMERTLQ VLDYAILVVS GADGVQGHTK TLWHLLEIYK
IPVFIFVNKM DQNGTNKDKV INEIKKQLDD RCIEFSEENP EEFFDRLAMC DEMIMETYLE
KGRVETSQIS AAVKERKVFP CFFGSALKLE GVEEFIHGLM KYTVVPSYPN EFGAKIFKIT
RDEQGNRLTH MKLTGGKLKV RDVLTNGVWE EKVNQIRIYS GEKFEAVSEV DAGTVFAVTG
LTQSRPGEGL GIEKSSGAPL LEPVLQYQII LPEGCDPRAM LPKLRQIEDE EPELNIVWDE
QLQEIRVRVM GEVQIEILQS IIKSRFGVDV AFDDGSIVYK ETIANTVEGV GHFEPLRHYA
EVHLLLKPGE RGSGLKFDVN CSEDVLAKSW QRLVLTHLEE KVHKGVLTGS AITDMKITLV
SGRAHNKHTQ GGDFREATYR AVRQGLKEAE SILLEPYYDF QLEVPEKMVG RAMMDIEKMH
GTCEISQING DMAVLVGSAP VVTMRNYQKE VVAYTKGLGR LFCSFKGYEP CHNAQEVIER
IGYDSERDVE NPTGSVFCAN GVSFLVSWDE VKNYMHVESY FQKKEDEENL NQNRPVYSGE
KAISLEEIDQ IMNKTFYANQ GRKSAWKRQK ASEERYYKPA AGAKRHEAKE EYLLVDGYNI
IYAWPELKKL ADENLDGARM KLLDMLSNYQ WIRRCHVIVV FDAYRVEGHK EEIIDYYNIH
VVYTREAQTA DQYIEKFAYE NSKNYDITVA TSDGLQQIIV RGEGCALLSA RELKAELEAA
NERIKQEYHK MNKIDRNYLG DALSTKAKRH MEDLIEEENK NNGIKKDK