Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1757 |
Symbol | |
ID | 4810187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2077401 |
End bp | 2078330 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107170 |
Product | peptidase M23B |
Protein accession | YP_001038171 |
Protein GI | 125974261 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000249073 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT CAAGCCAGAG AAAAGAAAGT AAATATTTGT CAATAGTTGT TATACCTCAC AATACAGATG AGATAAAGGT ATTTAAAATC TCTTCTGTAA AGTACAAGTT GATGGCTCTC GGTGCAATTC TGCTGACCAT GTTTATATGC TCCGGGATTT TGATTACATA TTTGATATAC GAAAACCGTG TGTTAAATGA AGACAGAAAA AAGGTTATTG CCGTCAACAA CCAGCAAATG AACTTGATTA CCGAGAGAGA GGAAATCATT CAGTCCTATA TCAGGAAAGT TGAAGAGCAG GACAAGCTAA TCAAAAATTT CACAACTCTC TACAGGGATA TGACCGCAAA ATACATCGAC GGCAGCATGG AAAACGTTAC AGCTTCAAGA TCCGGTCTTA GAGATGATCG TGCCTTTATA AACGATATAA ACAAACTCAA AGGCATTTTA GACAAACTCG AAGAAATTAA CGGTGATGAC GCTGATATTC TTAGCAGTTT AAGCGAAACC CAATCCAAGC TGAAACAATA CATAGACTCC ATTCCGACTC TCTGGCCGGC TTCCGGAAGA ATCAGTTCTC GTTTCGGAAC CCGCTCGGAT CCTTTTAACT TCTCCCAGAA AGTACATGAA GGCATCGATA TTGCAGCAGA CTATGGTACA ACAATATTGG CTTCAGCCAC TGGAAAAGTT ACTCTTTCAG ACTGGTACGG CAATTACGGC AAATGTGTCA TTATCGACCA CGGGTACGGT CTTAGCACAC TTTACGGCCA TTGCCAGACA CTTCTGGTCA AAGAGGGACA AACTGTCAAA AAGGGAGATA AAATAGCGAC GGTCGGCAGC ACCGGAAGAA GCACAGGTCC CCATCTGCAT TTTGAAGTAA GACTCAACGG TGTTCCTGTA GACCCGCTTC AGTATCTGGA CAACAAATAG
|
Protein sequence | MKKSSQRKES KYLSIVVIPH NTDEIKVFKI SSVKYKLMAL GAILLTMFIC SGILITYLIY ENRVLNEDRK KVIAVNNQQM NLITEREEII QSYIRKVEEQ DKLIKNFTTL YRDMTAKYID GSMENVTASR SGLRDDRAFI NDINKLKGIL DKLEEINGDD ADILSSLSET QSKLKQYIDS IPTLWPASGR ISSRFGTRSD PFNFSQKVHE GIDIAADYGT TILASATGKV TLSDWYGNYG KCVIIDHGYG LSTLYGHCQT LLVKEGQTVK KGDKIATVGS TGRSTGPHLH FEVRLNGVPV DPLQYLDNK
|
| |