Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1858 |
Symbol | |
ID | 4809409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2203047 |
End bp | 2204174 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107277 |
Product | peptidase M23B |
Protein accession | YP_001038272 |
Protein GI | 125974362 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000219165 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAAGG CAATATTGGT GGTTGTAGCG TTTGCGTTGA TTTTGTCGTC ATTGATGATA CCTGTATTTG CCAAAACCAT TTCCGATGTA CAAAAGGAGA AAAATACCGT TGACAGCAAA TTAAACAGCA TTACGAAACA AAAGAAAGAG GAAAAGCAAA AACTCAGTAA TATTGAGAGT GAAAAGAAAA AAATAGAGTC ACAGCAGGCG GAAAAGACCA GGGAATATAA TTCACTGAAT CAGCAGGTTG AAGAACTGAA CAAACATATA GAGGAAATAG ATGCCGCGAT AAAGGAAAGC GAAGACAGAT ACAACAAGCA GTTGGAACTG TTGAAAGTAA GAATCAATGT AATGTATCAG AATTCCGGAG CTACATATAT TCAAACACTG GCAGAATCGA AAAACTTTAT TGATTTTCTG AACAAACTTG AACTTGTTGC AGCCATAAGC AAAAGGGACA AAGAGATAAT TGAGGACCTC AAACAGGCTA AAGCGGATGT GGAGTTTAAA AAGAAACTGG CCGTTGAGAA GCGGGATACT GTTAAAGAGA AAGCGGAACA ATCGTTGAAG GCGTTAAATG AGCTCAGTGT CGCAAGATCA AAGCTTGACA GCCAAATAAA CAGTATAAAT GCCCAACTTA AGAAACTTGA ACAACAGGAA AATGAGTTGA TAAAACAGTC AAATGAACTT GCCGGTCAGA TAAGAAAACT TCAGCAAAGC GGAAGCTACG CCGGTGGAAC CATGCGCTGG CCTTTGCCTG GCAGTACCAA AATTTCATCT TACTTTGGCA ACAGGCTTCA TCCCATACTC AAAGTGTATA AGATGCATAC AGGTATAGAT ATTTCGGCAG CCACCGGAAC ATCGATAGTA GCCGCCAACA AAGGCGTGGT CATAATGTCA GGGTGGCAGA ACGGATACGG CTATACAGTG GTTGTGGACC ATGGAGGCGG AATTTCCACA TTGTATGCCC ATTGCAGCAA ACTGCTTGTC AAAGTGGGCG ATTCGGTTAA TGCCGGGGAT ACGATTGCAA AAGTAGGAAG CACCGGACTT GCCACCGGGC CTCACTTGCA CTTTGAGGTA AGAAAGAACG GCACTCCGGT AAATCCGCTC GACTATGTAA AGCCGTAA
|
Protein sequence | MKKAILVVVA FALILSSLMI PVFAKTISDV QKEKNTVDSK LNSITKQKKE EKQKLSNIES EKKKIESQQA EKTREYNSLN QQVEELNKHI EEIDAAIKES EDRYNKQLEL LKVRINVMYQ NSGATYIQTL AESKNFIDFL NKLELVAAIS KRDKEIIEDL KQAKADVEFK KKLAVEKRDT VKEKAEQSLK ALNELSVARS KLDSQINSIN AQLKKLEQQE NELIKQSNEL AGQIRKLQQS GSYAGGTMRW PLPGSTKISS YFGNRLHPIL KVYKMHTGID ISAATGTSIV AANKGVVIMS GWQNGYGYTV VVDHGGGIST LYAHCSKLLV KVGDSVNAGD TIAKVGSTGL ATGPHLHFEV RKNGTPVNPL DYVKP
|
| |