Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2617 |
Symbol | |
ID | 4809039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3095742 |
End bp | 3096662 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108031 |
Product | peptidase M23B |
Protein accession | YP_001039010 |
Protein GI | 125975100 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATTTTA AAAAATTGTT TCCTGACAAG AAAATTTCAC AAAAGAAAGT GTTGGATTTT CTCGATAAAA AAGGATTCTA TATTGTTTTG GCTTTGTGTA TAACTGCAAT TGCAGCTACA GCCGCAGTAA TTACAGCTCA TAACATACGC TCATCGAGAA GTATTGGTGA GGAAGATATA ATCTCGCCTG ATATGGCAAG CAGTATAATG GATGAAAACG GAAATAGCAG TAGTTACTTG TTACCGCAGG ACGGTTTTGA AAATGAAGAT GCGGCAGAAA CACTCGGAAA GGCATCCGAA GCAGTAAAGC CGACCGAACC TGCTGCAAGT CCGAAAGTCC AGGAAAGTCC AAAACCTTCG GACAAGCCTC AGAAAACAGA AACATCAACA AAAACGGAAA ACAAGGACGA GAAAACTCAA AACACAAAAT CATCGGGAAA AACTGAAAGC AAATCCGCAA ACAAATCCGA AGGCAAGACT GAGAGTGGCA AGAAAGTTAC ATTTGTAATG CCTGTTTATG GAGAAGTGAC TTTTGAATAT GCTATGGACA GGCTGGTTTA TTCAAAAACT TTGGATGAAT GGCGCGCTCA CAGTGGTGTG GATTTGAGAG CGGACAGAGG TACTCCTGTC AAAGTTGTTG CCGACGGTGT GGTAACTGAA GTTAAAAACG ATCCCAGGTT TGGTGTGACT GTAATTGTTG AGCACGAAAA CGGCCTGAAA ACAGTGTATG CAAATCTTGC CAGCGGAGAC ATGGTAACTC CGAACCAGAA GGTCAAACAG GGTGAAATAA TCGGAAGCAT AGGGAATACC GCAATTATTG AATCAGCAGA ACCTGCACAC CTTCACTTTG AAGTTTTGAA AGACAATAAA CCTGTTGACC CGAAGGATTA TCTCCAGTTG CCTTCGGATG GCAAAAAATA G
|
Protein sequence | MDFKKLFPDK KISQKKVLDF LDKKGFYIVL ALCITAIAAT AAVITAHNIR SSRSIGEEDI ISPDMASSIM DENGNSSSYL LPQDGFENED AAETLGKASE AVKPTEPAAS PKVQESPKPS DKPQKTETST KTENKDEKTQ NTKSSGKTES KSANKSEGKT ESGKKVTFVM PVYGEVTFEY AMDRLVYSKT LDEWRAHSGV DLRADRGTPV KVVADGVVTE VKNDPRFGVT VIVEHENGLK TVYANLASGD MVTPNQKVKQ GEIIGSIGNT AIIESAEPAH LHFEVLKDNK PVDPKDYLQL PSDGKK
|
| |