Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0315 |
Symbol | |
ID | 3831782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 319073 |
End bp | 320113 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637828250 |
Product | peptidase M23B |
Protein accession | YP_429192 |
Protein GI | 83589183 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases [COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.612531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGGGA TACCTATGGA CCCGATGGCA GCGACAGCAA CAATCAGCAT AGCCAAAAAA GCCGGCCAGC TGGCCACGTC GGCCCGGGCC AGGGAGGGCT TGGCCGTAAA GATTGAATAC CTGACGATAG GTATCTTTTT TCTTATTCCT GCCCTGCTTA TTTCACTGGT CCTCATATTT TTCCTGGCCC TGGGAATGGA CGATCAAGGC AAAATCACCG GCTTCCAGTC CGGCGCCCCG ACGGCTTTCG CCGTGGCCGA TATACCGGCC CAGTACCTGC CTATTTTCTT AAAAGCCCAG GAAAAATACG GCGTCTCCTG GGCGGTGCTC GCGGCCATCG CCAAGATAGA GTCCGGTTTC GGGCGCGATA TGGGACCTTC CAGCGCCGGG GCGATAGGCT TCATGCAGTT CATGCCAGCC ACCTGGGAGC AATACAAGCA GGACGGCGAC GGGGACGGCC GGATGGACCC TTATAATCCC TACGACGCTA TCTTCACCGC CGCCAACATG CTTAAAACGG ACGGCTTCGC CACCGACCCC CGGGGAGCCA TCTTCGCCTA TAACCACGCC AACTGGTATG TAGATATGGT CATGAGCCAG GCGGCGGCCT ACGCCTCCAC CATGCTGCCG GTAGGCCAGG GCGCCTGGCC CCTCCCGGCT CAATATAAAA CAATCACTGA TGGTTACGGC ATGCGCTGGC ACCCGATTCT AAAAAAATTT AGCTTTCACG ACGGCATCGA CCTGCCGGCG CCTCAGGGAA CTCAGGTGTT CGCGGTAAAA GACGGTAAAG TGACCTGGGA CCGGGAGAAC GGCGCCTACG GTCTTACCGT CATGCTTGAC CACGGCGGCC TGGAGACGAA GTATTGTCAC CTGTCTATGG TAGCCGTAAG AAAAGGCGAA CAGGTCAAAG CCGGACAGGT AATCGGGTAT GTTGGGAATA CCGGGCTTTC CACCGGCCCC CACCTGCATT TCAGCGTTTA CATTAACGGC CGGCCGGCCA ACCCGGAAGA GTGGCTGAAG ATACCTTCGG GAAATAACTG A
|
Protein sequence | MGGIPMDPMA ATATISIAKK AGQLATSARA REGLAVKIEY LTIGIFFLIP ALLISLVLIF FLALGMDDQG KITGFQSGAP TAFAVADIPA QYLPIFLKAQ EKYGVSWAVL AAIAKIESGF GRDMGPSSAG AIGFMQFMPA TWEQYKQDGD GDGRMDPYNP YDAIFTAANM LKTDGFATDP RGAIFAYNHA NWYVDMVMSQ AAAYASTMLP VGQGAWPLPA QYKTITDGYG MRWHPILKKF SFHDGIDLPA PQGTQVFAVK DGKVTWDREN GAYGLTVMLD HGGLETKYCH LSMVAVRKGE QVKAGQVIGY VGNTGLSTGP HLHFSVYING RPANPEEWLK IPSGNN
|
| |