Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0185 |
Symbol | |
ID | 4808673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 221399 |
End bp | 224245 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105596 |
Product | cell wall hydrolase/autolysin |
Protein accession | YP_001036619 |
Protein GI | 125972709 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0860] N-acetylmuramoyl-L-alanine amidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00443483 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGGG GTCTTTTCGT TCTGTTTGTT GCAATGGTTC TTGTATTCAA TTCTTTCACT ATGTGTGCAG AAGCGCTGAA GCTTACTTAT GACGGCAAAG TGCATGAGTA TACGGGAAAT ATTTACACAT TGAAAGTCGA CGGTGAAGTT GTCAATTCCG ATTTGCCTCC AATTATAATT AACGGACGTT CTTTAGTCCC TGTCAGAGCG ATTTTTGAAA AGCTCGGTGC AAAGGTCGGA TGGGATGCGG CGAGTAAAAA AGTGACAATA TCATATGACA GCAACACAAT AGAGCTCAAA ATAAATGATG TAAATGCCCT GGTAAACGGA ACCAAAGTAG CGATGGAAGT TCCGGCAAAG ATTATAAATG ACAGGACTGT GGTTCCTGTG CGCTTTGTCG GAGAGCAGCT GGGCATGGAA GTCGGATGGA ATGCGGAAAA AGGCGAGATT ACTTTGGACA GCGGAAAAAG CGTTGTTGCC AGCATAAATG ATATCAATTA TAAAATTGAC GGAAACTTGC ATCAGGTAAA AATAGAGCTC GATAAATGCA GGGAATACAA AATAATGAGA GTTGCCAATC CCGACAGAAT TGTTGTGGAT TTTCCCAATA CGAAGCTTTC AAATGCCAAC ACAAACATTA GTGTGGGCAG TGAGCTTATA AGTTCGATAC GCTGCGGAAA TCCCGATACA AAAACCGCAA GAGTGGTGCT CGATGTGGTT GGCCAGCCCC AGTACATAGT AAAAGAAGAG GGAAGCAATG TTGTATTGAA CCTTCAAAAG CCAAATACGG GTAGACCATC CGGAACTGTG GGCAGCAATA AACTTGATGT TGAACATGTC AGTAAAACCG ACCATGACGA AGTATATATA AAATGCGGGA CTGCTCCTGA ATACAATTCT TTTACTCTTT CCGATCCTGA AAGAATTGTG ATTGACGTAT CAGGAGCGTA TATTGAGGGC GAAATTAAAA ATATTGAAAC AAAAGGCAAC CTGGTAAAGT CCGTAAGATG TGCAAACCAA AGCGGTAATG TGGCAAGAGT GGTGGTTGAT CTTCAGCAAA AGTTGAACCA TAAAATCATA AAATCGGGAG AATATCTTAT TGTATACATA TCCAGAGCAC CTATTTCGGA AAATCCTGCA GTTTCTTTGC CCAGCCGCGG AGGCACCGGC AAAGATGAAG GGTCAAGGGA CAATATTCTT TATGTTGCCT ATGAACCTGA CGGTCAAAAG GACAAAGTTG TTTTAAGCCT TGACAGTTAC GAGAATTACA ATATTGTAAA GAATGTTGAA AAAAATAAAA TTATTATCGA CATACCCAAC GCCATAGGTC CTTCTGAGGC AAAAACAATC AGCGTTGACA GTGATATGGT CGGCAGCGTC AAATATGTCG GCTTTGACAA GTCGTATGCA CAGGTGGAGA TTGGGCTTAA AAGCAAAATT CAATACGAAG TAATTGAAAA AGAAGGCAGA CTTGAACTGG TTCTTTCCAA AGCTGCCGAA GTTTCGCCGT CGCCATCTCC AACACCTACG CCTACACCGA CTCCGACACA GACACCTACC TCTACGCCGA CACCGACGCA GTCGCCGACT CCAACTCCGA CTCCGACCCC GGTGCAGGTG GTGAAGGATG GTTCTCTTTC AATAGCTTAT AACGTTGCAT CCACTTACAG CAAAGTCATA CTTGGGATAC AGAATTATAA GAATTATAAT GTAAACCGTA TTTCAGACCC TGATCGTATT GTGATTGACA TAACGGGGGC AAATGTTGAG AAAACTGCGA ATACCGTTGA GATAAAGAAG GGATTTATAG AGGCAATAAG GTATTCGCAA TATGAAGTGG GAGTTGTCAG GGTTGTTATT GATGTCAAAG ACAATCCGAA GCACGATGTG AAAAAATCGG GAGACAAATT GGAGATATAC CTTAAAGATT CAGTTTCAGG TCAAAATTAT AAAAATATAA AGTATGTCAA CAACATGGAC AGAATACACT TCATCTTGCA AGGAGCAAAG CTTACGGAGG GCGGAGCCGA CCTTAAGAAG TTTTATACTG AAAAGTATGA CCTCGGCGGA AAAAGATATA CAATAACATT CCCTTCAAAC CTTGCTGATA TTGGCAGCGG CATAATGCAG ATAAACGACG GAATAGTAGA TTATGTCAAG ATTACTCAAA ATCCTGACAC AAAACAAACA AGCATGGAAT TTAACACAAA AGAAGCCTAT AGTTATTTGA TAATTACCCG GGGTGACGTG AACAATACAA CCATCACACT GCTGAAGAAA GCATCCAGGG ACGACAAGCT TGTGGTCATT GACCCCGGAC ACGGAGGTTT GGAAACCGGT GCGGTGTACG GAGACTGCTA TGAAAAGGAT TTCAACCTCG ATATTGCAAA GAGACTGAAC GCACTTTTAA AGAGTAAAGG TGTCAAAACT TATATGATTC GTGAGGATGA CAGTTATGTG GGCTTGTATG AGAGAGCATA TATTGCAAAT ACTCTCAACG CCACACTGTT TTTAAGCATA CACAATAATG CATATAATAC CAAGTCTCAT GGAACTGAGA CACTGTATTA TCCGACACCG GCAGGAGCTA CCGGATTTAC CAGCAAGAGG TTTGCGCAGA TTATTCAAAG CCGCCTTGTC AGCAAGCTGA AGACAAAGGA CAGGGGAATT GTGGAAAGAC CGAATTTGGT AGTTCTTAAA GCGACTAAAA TGCCGGCGGC TTTGGCGGAA GTTGCGTTTA TGGACAATAG CGAAGAACTG CAAAAGCTTA AAACGGAAGA ATTCAGGCAA AAAGCGGCGG AAGCACTGTG TGAAGCCGTA ATTCAGGCTT TGGCGGAAGT AGAATAG
|
Protein sequence | MKRGLFVLFV AMVLVFNSFT MCAEALKLTY DGKVHEYTGN IYTLKVDGEV VNSDLPPIII NGRSLVPVRA IFEKLGAKVG WDAASKKVTI SYDSNTIELK INDVNALVNG TKVAMEVPAK IINDRTVVPV RFVGEQLGME VGWNAEKGEI TLDSGKSVVA SINDINYKID GNLHQVKIEL DKCREYKIMR VANPDRIVVD FPNTKLSNAN TNISVGSELI SSIRCGNPDT KTARVVLDVV GQPQYIVKEE GSNVVLNLQK PNTGRPSGTV GSNKLDVEHV SKTDHDEVYI KCGTAPEYNS FTLSDPERIV IDVSGAYIEG EIKNIETKGN LVKSVRCANQ SGNVARVVVD LQQKLNHKII KSGEYLIVYI SRAPISENPA VSLPSRGGTG KDEGSRDNIL YVAYEPDGQK DKVVLSLDSY ENYNIVKNVE KNKIIIDIPN AIGPSEAKTI SVDSDMVGSV KYVGFDKSYA QVEIGLKSKI QYEVIEKEGR LELVLSKAAE VSPSPSPTPT PTPTPTQTPT STPTPTQSPT PTPTPTPVQV VKDGSLSIAY NVASTYSKVI LGIQNYKNYN VNRISDPDRI VIDITGANVE KTANTVEIKK GFIEAIRYSQ YEVGVVRVVI DVKDNPKHDV KKSGDKLEIY LKDSVSGQNY KNIKYVNNMD RIHFILQGAK LTEGGADLKK FYTEKYDLGG KRYTITFPSN LADIGSGIMQ INDGIVDYVK ITQNPDTKQT SMEFNTKEAY SYLIITRGDV NNTTITLLKK ASRDDKLVVI DPGHGGLETG AVYGDCYEKD FNLDIAKRLN ALLKSKGVKT YMIREDDSYV GLYERAYIAN TLNATLFLSI HNNAYNTKSH GTETLYYPTP AGATGFTSKR FAQIIQSRLV SKLKTKDRGI VERPNLVVLK ATKMPAALAE VAFMDNSEEL QKLKTEEFRQ KAAEALCEAV IQALAEVE
|
| |