Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0033 |
Symbol | |
ID | 4808798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 40454 |
End bp | 43120 |
Gene Length | 2667 bp |
Protein Length | 888 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105442 |
Product | small GTP-binding protein |
Protein accession | YP_001036467 |
Protein GI | 125972557 |
COG category | [J] Translation, ribosomal structure and biogenesis [R] General function prediction only |
COG ID | [COG0480] Translation elongation factors (GTPases) [COG3688] Predicted RNA-binding protein containing a PIN domain |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00484] translation elongation factor EF-G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.991842 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGT TGGTAGTTGG AATATTGGCG CATGTTGACG CAGGAAAGAC AACATTATCA GAGGCCCTGC TGTATGTGAG CGGAAAAATA AGAAAATTGG GAAGAGTTGA TAACAAAGAT GCTTACCTGG ACACCTATGA ACTCGAAAGG GCCAGGGGCA TCACTATTTT TTCCAAGCAG GCCGTTTTTG AAGTGGGGGA CACCCGGATT ACCTTGCTGG ATACTCCGGG CCATGTGGAT TTTTCGGCTG AGATGGAGAG AACCCTTCAA GTGCTGGATT ATGCCATTTT AGTTGTCAGC GGTGCTGATG GAGTGCAGGG ACATACCAAA ACCCTGTGGC ATCTTCTTGA AATCTACAAG ATACCTGTGT TTATATTTGT CAATAAAATG GACCAGAACG GGACAAACAA GGATAAAGTA ATTAATGAAA TAAAAAAGCA GTTGGATGAC AGATGCATAG AATTTAGTGA AGAAAATCCA GAAGAGTTTT TTGACCGGCT GGCAATGTGT GATGAAATGA TAATGGAGAC GTATCTTGAG AAAGGACGGG TTGAAACCTC GCAGATTAGT GCTGCCGTAA AAGAACGAAA GGTATTTCCG TGTTTTTTCG GTTCGGCATT GAAATTGGAA GGTGTCGAGG AATTTATTCA CGGTCTTATG AAATATACTG TAGTCCCAAG TTACCCTAAT GAATTTGGTG CCAAGATATT TAAAATAACA AGGGACGAGC AGGGAAACCG TCTTACCCAC ATGAAGCTGA CCGGCGGGAA ACTCAAGGTA AGAGATGTTT TGACAAACGG CGTGTGGGAA GAAAAAGTGA ATCAAATCCG CATCTACTCC GGAGAAAAAT TTGAAGCGGT AAGCGAGGTG GATGCGGGCA CCGTGTTTGC AGTGACCGGC CTTACCCAAT CCAGACCGGG AGAAGGTCTT GGAATTGAGA AATCTTCAGG TGCGCCGCTG TTGGAGCCTG TGCTGCAGTA TCAAATCATA CTTCCGGAAG GTTGTGACCC AAGAGCGATG CTGCCCAAGC TTAGGCAGAT TGAAGATGAG GAACCGGAGC TTAACATTGT CTGGGATGAA CAGTTGCAGG AAATCCGGGT CCGGGTCATG GGAGAGGTAC AGATTGAAAT TCTGCAAAGC ATTATAAAAA GCCGTTTTGG AGTTGATGTT GCTTTTGACG ACGGAAGTAT AGTATATAAA GAGACTATTG CCAATACTGT TGAAGGAGTG GGGCATTTTG AGCCACTCAG ACACTATGCG GAAGTTCACT TGCTATTGAA GCCCGGAGAG AGGGGAAGCG GCCTTAAGTT TGATGTAAAC TGCAGTGAGG ATGTTTTGGC TAAAAGTTGG CAGAGACTGG TTTTAACCCA CCTTGAAGAA AAAGTTCATA AAGGGGTTTT GACGGGTTCG GCGATTACGG ATATGAAAAT TACTTTGGTA TCCGGAAGGG CGCACAACAA GCATACTCAG GGCGGCGACT TCAGGGAGGC TACTTACCGT GCGGTGCGTC AGGGTTTGAA AGAAGCGGAA TCCATACTGC TTGAGCCCTA TTATGACTTT CAGCTGGAAG TGCCGGAAAA AATGGTGGGA AGAGCCATGA TGGATATTGA GAAAATGCAC GGCACATGTG AGATATCCCA GATAAACGGC GATATGGCAG TTTTGGTTGG CAGCGCTCCC GTTGTCACCA TGAGGAATTA TCAGAAAGAG GTTGTGGCAT ATACCAAAGG TCTTGGCAGA CTGTTTTGCA GTTTTAAAGG GTATGAGCCG TGCCATAATG CGCAGGAAGT TATAGAGCGC ATAGGATATG ATTCGGAAAG GGATGTGGAA AATCCCACAG GTTCCGTATT TTGCGCCAAT GGTGTCAGTT TTTTGGTAAG TTGGGATGAA GTAAAGAATT ACATGCATGT GGAGAGCTAT TTTCAGAAAA AAGAGGATGA AGAAAATTTA AACCAAAACC GGCCTGTGTA TTCGGGAGAA AAAGCGATAA GCCTGGAAGA AATTGATCAA ATTATGAATA AAACCTTTTA TGCAAACCAG GGAAGGAAAT CTGCATGGAA GAGGCAAAAG GCATCGGAAG AAAGGTATTA CAAACCTGCA GCCGGTGCAA AAAGGCATGA GGCAAAAGAA GAGTATCTTT TGGTGGACGG ATATAACATC ATCTATGCAT GGCCGGAGTT AAAAAAGCTT GCCGATGAGA ATTTGGACGG CGCAAGAATG AAATTGCTTG ATATGCTGAG CAATTACCAA TGGATTCGAA GATGTCATGT CATTGTCGTA TTTGATGCTT ACCGTGTTGA GGGACATAAA GAAGAGATAA TAGATTATTA TAATATCCAT GTGGTTTATA CGAGAGAGGC TCAAACAGCG GATCAGTATA TTGAGAAATT TGCCTATGAA AACAGTAAAA ACTATGATAT CACTGTTGCC ACATCCGACG GATTGCAGCA GATAATTGTA AGGGGAGAAG GATGCGCACT GCTGTCCGCA AGGGAGCTTA AGGCTGAGCT TGAAGCTGCC AATGAAAGAA TAAAACAGGA ATACCATAAA ATGAATAAAA TAGACCGCAA TTATTTGGGC GATGCTTTGT CTACGAAAGC AAAACGGCAT ATGGAAGATT TGATTGAAGA AGAAAATAAG AATAACGGAA TTAAAAAAGA TAAATAA
|
Protein sequence | MKKLVVGILA HVDAGKTTLS EALLYVSGKI RKLGRVDNKD AYLDTYELER ARGITIFSKQ AVFEVGDTRI TLLDTPGHVD FSAEMERTLQ VLDYAILVVS GADGVQGHTK TLWHLLEIYK IPVFIFVNKM DQNGTNKDKV INEIKKQLDD RCIEFSEENP EEFFDRLAMC DEMIMETYLE KGRVETSQIS AAVKERKVFP CFFGSALKLE GVEEFIHGLM KYTVVPSYPN EFGAKIFKIT RDEQGNRLTH MKLTGGKLKV RDVLTNGVWE EKVNQIRIYS GEKFEAVSEV DAGTVFAVTG LTQSRPGEGL GIEKSSGAPL LEPVLQYQII LPEGCDPRAM LPKLRQIEDE EPELNIVWDE QLQEIRVRVM GEVQIEILQS IIKSRFGVDV AFDDGSIVYK ETIANTVEGV GHFEPLRHYA EVHLLLKPGE RGSGLKFDVN CSEDVLAKSW QRLVLTHLEE KVHKGVLTGS AITDMKITLV SGRAHNKHTQ GGDFREATYR AVRQGLKEAE SILLEPYYDF QLEVPEKMVG RAMMDIEKMH GTCEISQING DMAVLVGSAP VVTMRNYQKE VVAYTKGLGR LFCSFKGYEP CHNAQEVIER IGYDSERDVE NPTGSVFCAN GVSFLVSWDE VKNYMHVESY FQKKEDEENL NQNRPVYSGE KAISLEEIDQ IMNKTFYANQ GRKSAWKRQK ASEERYYKPA AGAKRHEAKE EYLLVDGYNI IYAWPELKKL ADENLDGARM KLLDMLSNYQ WIRRCHVIVV FDAYRVEGHK EEIIDYYNIH VVYTREAQTA DQYIEKFAYE NSKNYDITVA TSDGLQQIIV RGEGCALLSA RELKAELEAA NERIKQEYHK MNKIDRNYLG DALSTKAKRH MEDLIEEENK NNGIKKDK
|
| |