Gene Cthe_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0158 
Symbol 
ID4808646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp197773 
End bp199266 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content41% 
IMG OID640105569 
Productribonuclease G 
Protein accessionYP_001036592 
Protein GI125972682 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID[TIGR00757] ribonuclease, Rne/Rng family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAGTG AGATTGTGGT AGATGTGGGG ATTAATGAAA AAAGAGTGGC TCTGTTGGAA 
GACAGGGAAC TGGTTGAATT ATTTATAGAG AGAAATGATT GTGAAAGACT TGTGGGCAAT
ATTTACAGAG GCAGAGTGGA AAGTGTCCTG CCTGGAATGC AGGCGGCTTT TATAGATATC
GGATATGAAA AGAATGCTTT TTTGTATGTA GGTGATGCCA TACCTCAGAA AGAATTTTCC
GAAGATGATG AAGAAATCTA TAGCGATGTT AAGAGTTACA ATATCGAAGA GATATTAAGA
CCCGGTCAGG AAATAACGGT ACAGGTAACC AAAGAACCCA TAGGAACAAA AGGCCCAAGG
GTAACTACCC ATATAACGTT GCCGGGAAGG CAGATGGTGC TTCTTCCCAA CGCCGATTAC
ATTGGCATTT CTAGGAGAAT TGAGGATGAG GAGGAACGGG CAAAGCTAAG AAAAATAGCC
GAAAAGATCA AACCAAAGAA TATGGGAATT ATTGTAAGGA CTGTTTCGGA AGGCAAACGG
GAAGAAGATT TTAAAAGTGA TTTGAATTTT TTGGTCAAAC TTTGGGCAAA AATAAAACAA
AGAGAACAGA GCGGACCGGT TCCCAGGTGT TTGCACAAAG ATTTGAGTGT AATTTACAGA
GCAGTCAGGG ACATCTTTAC ATGGAACATT GACAGGTTTG TTATTAATGA CCGGCAGGAG
TACAATAAGG TTCTTGAGCT TGTTGAAATG ATTTCGCCGG CTTTGAAAAT GAGAGTGGAG
TATTTCAACA AAAATATTGA TTTGTTTGAG TACTACCAGA TTGACAGCAT GATACAGAAG
GCATTGGCCA AAAAGGTCTG GTTAAAATGC GGAGGATATA TTGTAATCGA GAGAACGGAA
GCTCTTACGG TTATTGATGT GAACACCGGG AAGTATGTGG GGGTAAACAA TCTCGAAGAC
ACCGTTTTAA GGACCAATCT TGATGCGGTC AAAGAAATCG GGAAACAATT GAGGCTAAGA
GACATCGGAG GAATAATTAT TATTGATTTT ATCGACATGC ATGATCCGGA ACATCAAAAA
CAGGTACTGG AAGCTTTAAA GCAGGTATTG AAAAAGGATC GCACCAAAAC CACTGTTGTC
GGCATGACCG GTCTTGGCCT TATTGAGATG ACGAGGAAAA AGGTTAGGGA AGGCTTGGAG
TCAATGATGC TTCAGGATTG TCCTTATTGT GAAGGAAGGG GGAAAATACT TTCGCCCGAG
TCTGTGGCAA GAAATGTTGA GAAAGAGATA AGCAAATACT TTACAAAAAC AATAGCAAAT
GCTATCATGG TTGAGGTTCA TCCTACTGTG GCCGAGGTGT TGAGAGGAGA AGACAACGAC
AACCTTGCAA GAATTCAGAA TCTGTTTAAC AAAAAAGTCA TAATAAAACC CTCGGCGGAA
GTGGGACATG AGGAAGTGAA GGGAAGTTGT AAAATAAAAT GCCGTAATAT ATAG
 
Protein sequence
MVSEIVVDVG INEKRVALLE DRELVELFIE RNDCERLVGN IYRGRVESVL PGMQAAFIDI 
GYEKNAFLYV GDAIPQKEFS EDDEEIYSDV KSYNIEEILR PGQEITVQVT KEPIGTKGPR
VTTHITLPGR QMVLLPNADY IGISRRIEDE EERAKLRKIA EKIKPKNMGI IVRTVSEGKR
EEDFKSDLNF LVKLWAKIKQ REQSGPVPRC LHKDLSVIYR AVRDIFTWNI DRFVINDRQE
YNKVLELVEM ISPALKMRVE YFNKNIDLFE YYQIDSMIQK ALAKKVWLKC GGYIVIERTE
ALTVIDVNTG KYVGVNNLED TVLRTNLDAV KEIGKQLRLR DIGGIIIIDF IDMHDPEHQK
QVLEALKQVL KKDRTKTTVV GMTGLGLIEM TRKKVREGLE SMMLQDCPYC EGRGKILSPE
SVARNVEKEI SKYFTKTIAN AIMVEVHPTV AEVLRGEDND NLARIQNLFN KKVIIKPSAE
VGHEEVKGSC KIKCRNI