Gene Cthe_0185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0185 
Symbol 
ID4808673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp221399 
End bp224245 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content42% 
IMG OID640105596 
Productcell wall hydrolase/autolysin 
Protein accessionYP_001036619 
Protein GI125972709 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00443483 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGGG GTCTTTTCGT TCTGTTTGTT GCAATGGTTC TTGTATTCAA TTCTTTCACT 
ATGTGTGCAG AAGCGCTGAA GCTTACTTAT GACGGCAAAG TGCATGAGTA TACGGGAAAT
ATTTACACAT TGAAAGTCGA CGGTGAAGTT GTCAATTCCG ATTTGCCTCC AATTATAATT
AACGGACGTT CTTTAGTCCC TGTCAGAGCG ATTTTTGAAA AGCTCGGTGC AAAGGTCGGA
TGGGATGCGG CGAGTAAAAA AGTGACAATA TCATATGACA GCAACACAAT AGAGCTCAAA
ATAAATGATG TAAATGCCCT GGTAAACGGA ACCAAAGTAG CGATGGAAGT TCCGGCAAAG
ATTATAAATG ACAGGACTGT GGTTCCTGTG CGCTTTGTCG GAGAGCAGCT GGGCATGGAA
GTCGGATGGA ATGCGGAAAA AGGCGAGATT ACTTTGGACA GCGGAAAAAG CGTTGTTGCC
AGCATAAATG ATATCAATTA TAAAATTGAC GGAAACTTGC ATCAGGTAAA AATAGAGCTC
GATAAATGCA GGGAATACAA AATAATGAGA GTTGCCAATC CCGACAGAAT TGTTGTGGAT
TTTCCCAATA CGAAGCTTTC AAATGCCAAC ACAAACATTA GTGTGGGCAG TGAGCTTATA
AGTTCGATAC GCTGCGGAAA TCCCGATACA AAAACCGCAA GAGTGGTGCT CGATGTGGTT
GGCCAGCCCC AGTACATAGT AAAAGAAGAG GGAAGCAATG TTGTATTGAA CCTTCAAAAG
CCAAATACGG GTAGACCATC CGGAACTGTG GGCAGCAATA AACTTGATGT TGAACATGTC
AGTAAAACCG ACCATGACGA AGTATATATA AAATGCGGGA CTGCTCCTGA ATACAATTCT
TTTACTCTTT CCGATCCTGA AAGAATTGTG ATTGACGTAT CAGGAGCGTA TATTGAGGGC
GAAATTAAAA ATATTGAAAC AAAAGGCAAC CTGGTAAAGT CCGTAAGATG TGCAAACCAA
AGCGGTAATG TGGCAAGAGT GGTGGTTGAT CTTCAGCAAA AGTTGAACCA TAAAATCATA
AAATCGGGAG AATATCTTAT TGTATACATA TCCAGAGCAC CTATTTCGGA AAATCCTGCA
GTTTCTTTGC CCAGCCGCGG AGGCACCGGC AAAGATGAAG GGTCAAGGGA CAATATTCTT
TATGTTGCCT ATGAACCTGA CGGTCAAAAG GACAAAGTTG TTTTAAGCCT TGACAGTTAC
GAGAATTACA ATATTGTAAA GAATGTTGAA AAAAATAAAA TTATTATCGA CATACCCAAC
GCCATAGGTC CTTCTGAGGC AAAAACAATC AGCGTTGACA GTGATATGGT CGGCAGCGTC
AAATATGTCG GCTTTGACAA GTCGTATGCA CAGGTGGAGA TTGGGCTTAA AAGCAAAATT
CAATACGAAG TAATTGAAAA AGAAGGCAGA CTTGAACTGG TTCTTTCCAA AGCTGCCGAA
GTTTCGCCGT CGCCATCTCC AACACCTACG CCTACACCGA CTCCGACACA GACACCTACC
TCTACGCCGA CACCGACGCA GTCGCCGACT CCAACTCCGA CTCCGACCCC GGTGCAGGTG
GTGAAGGATG GTTCTCTTTC AATAGCTTAT AACGTTGCAT CCACTTACAG CAAAGTCATA
CTTGGGATAC AGAATTATAA GAATTATAAT GTAAACCGTA TTTCAGACCC TGATCGTATT
GTGATTGACA TAACGGGGGC AAATGTTGAG AAAACTGCGA ATACCGTTGA GATAAAGAAG
GGATTTATAG AGGCAATAAG GTATTCGCAA TATGAAGTGG GAGTTGTCAG GGTTGTTATT
GATGTCAAAG ACAATCCGAA GCACGATGTG AAAAAATCGG GAGACAAATT GGAGATATAC
CTTAAAGATT CAGTTTCAGG TCAAAATTAT AAAAATATAA AGTATGTCAA CAACATGGAC
AGAATACACT TCATCTTGCA AGGAGCAAAG CTTACGGAGG GCGGAGCCGA CCTTAAGAAG
TTTTATACTG AAAAGTATGA CCTCGGCGGA AAAAGATATA CAATAACATT CCCTTCAAAC
CTTGCTGATA TTGGCAGCGG CATAATGCAG ATAAACGACG GAATAGTAGA TTATGTCAAG
ATTACTCAAA ATCCTGACAC AAAACAAACA AGCATGGAAT TTAACACAAA AGAAGCCTAT
AGTTATTTGA TAATTACCCG GGGTGACGTG AACAATACAA CCATCACACT GCTGAAGAAA
GCATCCAGGG ACGACAAGCT TGTGGTCATT GACCCCGGAC ACGGAGGTTT GGAAACCGGT
GCGGTGTACG GAGACTGCTA TGAAAAGGAT TTCAACCTCG ATATTGCAAA GAGACTGAAC
GCACTTTTAA AGAGTAAAGG TGTCAAAACT TATATGATTC GTGAGGATGA CAGTTATGTG
GGCTTGTATG AGAGAGCATA TATTGCAAAT ACTCTCAACG CCACACTGTT TTTAAGCATA
CACAATAATG CATATAATAC CAAGTCTCAT GGAACTGAGA CACTGTATTA TCCGACACCG
GCAGGAGCTA CCGGATTTAC CAGCAAGAGG TTTGCGCAGA TTATTCAAAG CCGCCTTGTC
AGCAAGCTGA AGACAAAGGA CAGGGGAATT GTGGAAAGAC CGAATTTGGT AGTTCTTAAA
GCGACTAAAA TGCCGGCGGC TTTGGCGGAA GTTGCGTTTA TGGACAATAG CGAAGAACTG
CAAAAGCTTA AAACGGAAGA ATTCAGGCAA AAAGCGGCGG AAGCACTGTG TGAAGCCGTA
ATTCAGGCTT TGGCGGAAGT AGAATAG
 
Protein sequence
MKRGLFVLFV AMVLVFNSFT MCAEALKLTY DGKVHEYTGN IYTLKVDGEV VNSDLPPIII 
NGRSLVPVRA IFEKLGAKVG WDAASKKVTI SYDSNTIELK INDVNALVNG TKVAMEVPAK
IINDRTVVPV RFVGEQLGME VGWNAEKGEI TLDSGKSVVA SINDINYKID GNLHQVKIEL
DKCREYKIMR VANPDRIVVD FPNTKLSNAN TNISVGSELI SSIRCGNPDT KTARVVLDVV
GQPQYIVKEE GSNVVLNLQK PNTGRPSGTV GSNKLDVEHV SKTDHDEVYI KCGTAPEYNS
FTLSDPERIV IDVSGAYIEG EIKNIETKGN LVKSVRCANQ SGNVARVVVD LQQKLNHKII
KSGEYLIVYI SRAPISENPA VSLPSRGGTG KDEGSRDNIL YVAYEPDGQK DKVVLSLDSY
ENYNIVKNVE KNKIIIDIPN AIGPSEAKTI SVDSDMVGSV KYVGFDKSYA QVEIGLKSKI
QYEVIEKEGR LELVLSKAAE VSPSPSPTPT PTPTPTQTPT STPTPTQSPT PTPTPTPVQV
VKDGSLSIAY NVASTYSKVI LGIQNYKNYN VNRISDPDRI VIDITGANVE KTANTVEIKK
GFIEAIRYSQ YEVGVVRVVI DVKDNPKHDV KKSGDKLEIY LKDSVSGQNY KNIKYVNNMD
RIHFILQGAK LTEGGADLKK FYTEKYDLGG KRYTITFPSN LADIGSGIMQ INDGIVDYVK
ITQNPDTKQT SMEFNTKEAY SYLIITRGDV NNTTITLLKK ASRDDKLVVI DPGHGGLETG
AVYGDCYEKD FNLDIAKRLN ALLKSKGVKT YMIREDDSYV GLYERAYIAN TLNATLFLSI
HNNAYNTKSH GTETLYYPTP AGATGFTSKR FAQIIQSRLV SKLKTKDRGI VERPNLVVLK
ATKMPAALAE VAFMDNSEEL QKLKTEEFRQ KAAEALCEAV IQALAEVE