Gene Cthe_0714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0714 
Symbol 
ID4810332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp869041 
End bp871125 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content38% 
IMG OID640106131 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase/S1 RNA-binding domain protein 
Protein accessionYP_001037142 
Protein GI125973232 
COG category[I] Lipid transport and metabolism
[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0539] Ribosomal protein S1
[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming)
[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000369991 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAGATAA TAGTTGCTAA ATCAGCCGGT TTTTGTTTCG GGGTAAGTAA CGCAGTTAAA 
ACCGTCAATA ATCTTCTGGA AAATCAAAAA GAGCCCATAT ACACATACGG TCCTATTATT
CACAATGCGC AAGTAGTAGA TCTTTTTACT TCCAAAGGTG TTAAAAAAAT TGATGATATT
GACGAAGCAG AGCCCAACGG TCATATTGTA ATCAGGGCTC ATGGTGTTAC ACCGGACATT
TATAAAAAAA TTTCAGACAA GGGGTTGATT TTGGAAGATG CTACATGTCC ATACGTAAAA
AAAATTCACA ACCTTGTAAA GGAGAAAAGT GAGGAAGGCT ACAAGATAAT AATTGTAGGC
GACAGGAATC ATCCTGAAGT TATAGGGATA AATGGATGGT GTAACAACCA GGCATATATA
GTTGACAGTG TTGACGATGT AGAAAAATTT CCCACAAGCG ACGAAAAAGT TTGCGTAGTT
GCACAAACAA CGATAACAAA CGAAAAGTGG CTCGAGGTAA ACACGGCATT AAAAAAGAAA
TTCAAAAATA TTTTAAAATT TGATACAATA TGTAGTGCAA CCAGCAGAAG ACAAAACGAA
GCAGAAGAAA TAGCAAAAAA TGTTGATATG ATGATTATTA TCGGGGGAAA AAACAGTTCA
AACACCCAAA AGCTTTATGA CATATGTAAA AAGCATTGCA ATCTGACGTA TAAAATTGAG
ACTTCCGGTG ATTTGCCACC GGTAGACATA AAAAAAATAA AAAAAGTAGG TATTTCTGCC
GGAGCTTCTA CGCCGGACTG GGTAATTGAG GAGGTTATTA AAAAAATGAG TGAATTAAAC
AAACAAGGAA TGGTCGATAT TTTAGAAAAT GATGGTGAAA TAGATTTCGC CAATGCTTTT
GAGAATTCGT TTGTCAGAAT ACATGCAGGA GACACTGTAA AGGGAAAAAT AATTGGCTTT
AACAGCAATG AGGTTTTCGT TGACTTGGGT TACAAGGCTG ACGGCATAAT TCCTCTCGAA
GAGTACACTG ACGACCCTAA CTTCAATATT GAAAAGGAAG TAAAGATTGG AGAAGAAGTC
GAAGTTCTCG TTGAAATGGT CAATGACGGC GAAGGCAACG TAAGACTCTC AAAGAGGAAA
GTGGATGCCA TAAAATCTTG GGATGACATA GTTAAAGCCT ATGAAAACAA GACTCCTGTA
AACGCTTATG TTGTTGAAGT TGTAAAGGGG GGCGTAATTG CCAGCTACAA GGGCGTAAGA
ATATTCGTTC CTGCTTCACA GGTAAGCGAT AGATATGTAA AGGATTTGAA CGAATTCCTG
AAGAGAAGCA TAACTGTCAG AATTTTAGAG CTTAATGAAA AAAGACGTAA AGTGGTAGGT
TCCGCAAGGG TGATTATCGA AGAGGAAAAG GAAGCTCTTG CCAACAGGAC ATGGAACAGC
ATGGAAGTTG GAAAAGTATT CAAAGGTACT GTAAAAAGTC TTACTGATTT CGGTGCTTTT
GTTGATATAG GCGGAGTTGA CGGACTTATC CATATTTCAG AACTTTCATG GACAAGAGTC
AAACATCCGT CTGAAGTATT AAAAGTCGGA GATGAAGTTG AAGTAACAGT TTTGGAGTTT
GACAAAGAAA AGAAAAAAGT ATCTTTGGGA TATAGAAAGA TGGAAGACAA TCCATGGTAT
AAAATAGAAG AAAAATACAA AGTTGGTGAT GTTGTCAAGG TAACCGTTCT CCGTTTTGCT
CCCTTTGGTG CTTTTGTTGA GCTGGAAAAA GGTGTGGACG GATTGGTTCA CATATCCCAG
ATATCTTCAA AGAGACTGGC AAAAGTTGAA GATGCTCTTG AGATTGGTAT GAAAGTTGAC
GCAAAAATAA TTGAAGTAGA CGGCGAAAAC AAGAAAATCA GCCTTAGTAT TAAAGAGGTT
ATGCCGATAG ATCCTCCTTC ATCAAAGAGT GAATCCAAAG CAAAGGACGG ATCCGAAGCA
AAAGAAACCT CTGCCAATAC TGAAGAGGAA GCTGAACCGA CAGAGCATAG GGAAGACATG
AACGTAACAG TTGAAGATTT GGTATCAAAA ACAACGCAAT CTTAA
 
Protein sequence
MEIIVAKSAG FCFGVSNAVK TVNNLLENQK EPIYTYGPII HNAQVVDLFT SKGVKKIDDI 
DEAEPNGHIV IRAHGVTPDI YKKISDKGLI LEDATCPYVK KIHNLVKEKS EEGYKIIIVG
DRNHPEVIGI NGWCNNQAYI VDSVDDVEKF PTSDEKVCVV AQTTITNEKW LEVNTALKKK
FKNILKFDTI CSATSRRQNE AEEIAKNVDM MIIIGGKNSS NTQKLYDICK KHCNLTYKIE
TSGDLPPVDI KKIKKVGISA GASTPDWVIE EVIKKMSELN KQGMVDILEN DGEIDFANAF
ENSFVRIHAG DTVKGKIIGF NSNEVFVDLG YKADGIIPLE EYTDDPNFNI EKEVKIGEEV
EVLVEMVNDG EGNVRLSKRK VDAIKSWDDI VKAYENKTPV NAYVVEVVKG GVIASYKGVR
IFVPASQVSD RYVKDLNEFL KRSITVRILE LNEKRRKVVG SARVIIEEEK EALANRTWNS
MEVGKVFKGT VKSLTDFGAF VDIGGVDGLI HISELSWTRV KHPSEVLKVG DEVEVTVLEF
DKEKKKVSLG YRKMEDNPWY KIEEKYKVGD VVKVTVLRFA PFGAFVELEK GVDGLVHISQ
ISSKRLAKVE DALEIGMKVD AKIIEVDGEN KKISLSIKEV MPIDPPSSKS ESKAKDGSEA
KETSANTEEE AEPTEHREDM NVTVEDLVSK TTQS