Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0734 |
Symbol | |
ID | 4810352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 891758 |
End bp | 892879 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640106151 |
Product | peptidase M23B |
Protein accession | YP_001037162 |
Protein GI | 125973252 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0522246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAGGC TTTTTATTAT TGTACTGATA GCGTCTTTGA CGGCGGCTTT GATTTTTCCT GTTATGGCTG ACAATAATTC TTTAAAAGAT AAAATGAGTG AAATAGACGA TAAATTGAAT GATATTAGCA AGCAAAAAGT AGAGATAGAT AAAGAAAAAA AGGAACTTGA ACGTGAGAAA AAAGAATTAA TAAATGCCGA AAATGAAGCA AACTTGGAGT ACCAGAATTT GGTTTCTGAA CTGGAGGCAT TGGACAGTCA AATAGAAAGC TATGAGACTT CGTTAAAAGA TACGGAAGAA AGATACGCCA AAACTTTAAA AGCCTTTGAA GAACGTCTTG TAACGATGTA CAGAAATTCA TATGTTTCAT ACCTTAATAT ATTAGCCGAT TCGGAGAATC TGATTAATTT TTTTGAAAGA CTTGAGTTGA TTTCATCAAT TGCGAAAAAA GATAAGGAAA TTGTAAAGAA AGTTCAGGAT ATAAAAAAGG ATCTTACATA TAAAAAGCAA TTGGTTCAAT ATATAAAAGG TGCAAAGCAA CTTGAATTAA TCAGAAAGAA AAACAATATT GATTCATTGG TTGCATCCAG GAGTGGGCTT GAAAACAAAA TCAAAGAAAG AGAAGAAGAA ATCAGACGTT TGGAAGAGCA GGAAGACAAG CTGATTCAGC AATCTTACGA AATAGCAAAT CAAATAAGGA GAAGTACCGG AAGTATCAAA AATTATGCCG GCGGTACAAT GGTGTGGCCG GTACCAAGTT CAAGAAAGAT AGATTCTCGA TTTGGCACGA GACTTCATCC TATATTCAAA AAGTATAAAA TGCATACCGG TGTTGACATT GATGCGGCTT ACGGAGCTTC AATAGTTGCT GCCAATAATG GAATTGTGAT TTTTTCGGGC TGGGAGGATG GATACGGTTA TACGGTTATT ATCGACCATG GCGGCGGAAT AACCACGCTG TATGCTCATT GCAGCAAGCT TCTTGTTAAC AAAGGTGACA AGGTTCGAAA GGGTCAAACC ATAGCCCAGG CCGGCAGTAC CGGAACGGCT ACAGGATCGC ATCTACATTT TGAAGTTCGA ATAGACGGAA ATGTAACCAA TCCTTTGGAT TATATTAAAT GA
|
Protein sequence | MKRLFIIVLI ASLTAALIFP VMADNNSLKD KMSEIDDKLN DISKQKVEID KEKKELEREK KELINAENEA NLEYQNLVSE LEALDSQIES YETSLKDTEE RYAKTLKAFE ERLVTMYRNS YVSYLNILAD SENLINFFER LELISSIAKK DKEIVKKVQD IKKDLTYKKQ LVQYIKGAKQ LELIRKKNNI DSLVASRSGL ENKIKEREEE IRRLEEQEDK LIQQSYEIAN QIRRSTGSIK NYAGGTMVWP VPSSRKIDSR FGTRLHPIFK KYKMHTGVDI DAAYGASIVA ANNGIVIFSG WEDGYGYTVI IDHGGGITTL YAHCSKLLVN KGDKVRKGQT IAQAGSTGTA TGSHLHFEVR IDGNVTNPLD YIK
|
| |