Gene Cthe_2729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2729 
Symbol 
ID4810231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3220322 
End bp3222415 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content43% 
IMG OID640108148 
Productelongation factor G 
Protein accessionYP_001039121 
Protein GI125975211 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0480] Translation elongation factors (GTPases) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00484] translation elongation factor EF-G 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.255792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAGGC AATTTAGTTT AGAGAATACA AGAAATATTG GAATAATGGC TCACATAGAT 
GCGGGTAAAA CCACAACAAC TGAACGTATC CTGTTTTATA CCGGTAGAGT TCATAAAATA
GGAGAAACCC ATGAAGGTTC AGCAACCATG GACTGGATGG AACAGGAACA GGAAAGAGGT
ATAACCATAA CTTCAGCTGC TACTACCGCC CAGTGGAAAG GTACAAGAAT AAATATAATT
GATACACCAG GGCACGTTGA TTTTACAGTT GAAGTTGAAA GATCTCTTCG TGTCCTTGAC
GGAGCTGTTG CGGTTTTCTG TGCCAAGGGA GGAGTTGAGC CACAGTCTGA AACCGTATGG
AGACAGGCTG ACAAATACAA AGTTCCCCGT ATGGCATACG TCAACAAGAT GGATATAATG
GGAGCTGACT TTTTCAACTG TATCAAAATG ATGAAGGAAA GACTTCAGGC AAATCCGGTT
CCCATCCAGC TTCCGATAGG TAAAGAAGAT AATTTTCAGG GAATCATAGA CCTTATAGAA
ATGAAAGCTT ATTACTACAT GGACGATTTG GGAAAAGTTA TTGAACAAAG GGATATTCCT
GAGGATATGA GAGAACTGGC CGAGGAATAT CGTACAAATC TCCTTGAAAA TGTTGCAGAA
TATGACGAAG AGCTCATGAT GAAGTATCTT GAAGGTGAAG AAATTACAGA AGCCGAGATA
AAGGCGGCTT TAAGAAAAGG TACCATTGCC GTAAAGGCAA TACCTGTACT CTGTGGTTCT
TCATACAAAA ACAAAGGAGT TCAGCGTCTT CTTGATGCAA TTGTGGATTA TATGCCTTCT
CCTGTTGACA TTGAAGCCAT AAAAGGTGTG TCTGTCGACG GAGAGACCGA AATTGAAAGA
CATGCCAGTG ATGATGAACC GTTCTCGGCA TTGGCATTTA AGATTATGTC CGACCCGTAT
GTCGGTAAAC TCTGCTTTTT TAGGGTTTAC TCGGGAAAGC TCAGTTCCGG TTCCTATGTT
CTCAATGCCA CTAAAGGCAA GAGAGAAAGA ATAGGAAGGC TGCTGATGAT GCATGCCAAC
CACAGGGAAG AAGTCGACAT GGTTTATGCC GGTGATATAG CGGCGGCAGT TGGATTAAAG
GAAACTACAA CAGGAGATAC TCTCTGTGAT GAGGCAAATC CTGTTATTCT TGAATCCATG
AACTTCCCGG AACCGGTTAT CCATGTTGCC ATTGAGCCTA AGACAAAGGC CGGACAGGAG
AAAATGGCTC TGGCGCTTCA AAAATTGGCT GAAGAGGACC CAACGTTCAG GACATATACT
GATCAGGAAA CCGGACAGAC AATAATAGCC GGTATGGGAG AGCTTCACCT TGAAATAATT
GTTGACCGTC TTTTAAGAGA ATTCAAGGTG GAAGCAAATG TCGGTAATCC GCAGGTTGCT
TACAAGGAAA CAATCAGAAA GTCTGTCAAA TCAGAAGGTA AATATATCAG ACAGTCCGGT
GGTAAAGGTC AGTACGGTCA CTGCTGGATA GAAATCGAGC CTAAGGAACG TGGAACAGGA
TATGAATTTG TCAACAAAAT CGTCGGAGGT GTTATTCCGA AAGAATATAT CCCGGCGGTT
GACGCAGGTA TCCAAAGTGC CATGAACAAC GGTGTTCTGG CGGGATATCC GGTTGTTGAC
GTTAAAGTAA CCTTGTACGA CGGTTCATAC CATGAGGTTG ACTCTTCGGA AATGGCGTTT
AAAGTTGCCG CTTCCATGGC TTTCAAAGAA GGTATGAAAA AAGCCGATCC TGTGATTCTC
GAGCCCATAA TGAAAGTAGT TGTCACAGTT CCTGAAGACT ACATGGGCGA CGTTATAGGC
GACCTTAACT CGAGAAGAGG AAGAATTGAA GGAATGGAAG CAAGAGCGGG AGCACAGGTT
ATACATGCTT ATGTTCCTTT GGCGGAGATG TTTGGATATG CTACGGCCCT GCGTTCAAGA
TCACAGGGTA GGGGCGTATT CTCAATGGAA ATAAGCCATT TTGAGGAAGT GCCGAAAAAC
ATTCAGGAGC AGATAATAAG TGGAAGAGCT AAAAATAACA GTAGCGATGA ATAA
 
Protein sequence
MPRQFSLENT RNIGIMAHID AGKTTTTERI LFYTGRVHKI GETHEGSATM DWMEQEQERG 
ITITSAATTA QWKGTRINII DTPGHVDFTV EVERSLRVLD GAVAVFCAKG GVEPQSETVW
RQADKYKVPR MAYVNKMDIM GADFFNCIKM MKERLQANPV PIQLPIGKED NFQGIIDLIE
MKAYYYMDDL GKVIEQRDIP EDMRELAEEY RTNLLENVAE YDEELMMKYL EGEEITEAEI
KAALRKGTIA VKAIPVLCGS SYKNKGVQRL LDAIVDYMPS PVDIEAIKGV SVDGETEIER
HASDDEPFSA LAFKIMSDPY VGKLCFFRVY SGKLSSGSYV LNATKGKRER IGRLLMMHAN
HREEVDMVYA GDIAAAVGLK ETTTGDTLCD EANPVILESM NFPEPVIHVA IEPKTKAGQE
KMALALQKLA EEDPTFRTYT DQETGQTIIA GMGELHLEII VDRLLREFKV EANVGNPQVA
YKETIRKSVK SEGKYIRQSG GKGQYGHCWI EIEPKERGTG YEFVNKIVGG VIPKEYIPAV
DAGIQSAMNN GVLAGYPVVD VKVTLYDGSY HEVDSSEMAF KVAASMAFKE GMKKADPVIL
EPIMKVVVTV PEDYMGDVIG DLNSRRGRIE GMEARAGAQV IHAYVPLAEM FGYATALRSR
SQGRGVFSME ISHFEEVPKN IQEQIISGRA KNNSSDE