Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2422 |
Symbol | |
ID | 3832173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2544351 |
End bp | 2545778 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637830341 |
Product | peptidase M23B |
Protein accession | YP_431247 |
Protein GI | 83591238 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000000638116 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCGC CAGATAAGCT ACCGCAGCTA GCGGCCAGCG TAGGGCGCTT CCTAAAAAAG CTGCCCTTAC GGTGGGCAGG AATAAGGGAT AATAAAGGAA AAAGAGTATT AGTAATAACT GGTATTTTAG CCGGCGGCCT GCTCTTAGCA GCCTGGCACC AGCTTACAAC CCCGAATGCC CTGGCTGTTT TCATCAACGG CCAGCAGGTG GCAATAGTAG CCACCCGGGA ACAGTTCAAC CGGGCCTTAC AGGAGATCCT AAAAGAACGA GGGGCTGGCA GTTACCAGGG TGTGCGCTAT ACCGATCAGG TAGATTTTAA AGCAGTCCGG GCCAATCCGC AGGAAGTTAT AGCGGAGGAA CAATTAAAAA GCCTCCTGGC CGAAAGATTG CACCTGGTGG CGGCGGCAAC GGTCATTACC ATTGACGGGC AGCCGCGGCT GGTTTTGAAA GACGATGCTA CCGCTGACGC TGTTCTGGCC GCGTTTAAAC AGGCCTTTGA ACCACCGGCG GCTATAGGGC AGGTCCAGGA GGTCAAATTT TTGGAACAAG TGGCTTACGA GCACCGCCAG GCCAGTCCGG AAGAGATTTT AAGCCCGGAA GCGGCCCTTG CCAAGTTGAA GGGAACAGCA GCCAGCAGTA GCACCTATAC TGTTAAAGAA GGGGATTCCC TCTGGTCCAT TGCCAGGGAA CATAACCTGC TGGTTGACGA CATTAAAAGG GCCAACCCCG AGATCCAGGG AGAACGCCTG GATATCGGGC AGCAATTGAA ACTGACAACA GTACAACCAT TGCTTCAGGT GATGGTTGTT TATAATCAGG ATATCAAGGA ACCGGTACCC TTTGAAACCC GGGTAGAAAA AACGGCTGAC CTTTTACGAG GCCAGGAAAA GATAATTCAG GAGGGCACAG AGGGCGAACA ACTGGTTACT TACCAAGTAG TGACCAGAAA TGGTGTGGTC GTAGACAAAA AGATACTTCG GCAGCAGGTC CTTACGGAAC CGGTTGCCCG GGTGGTACAG CAGGGTACGG GGGTAACAGC CCTGGCTTCC CGGAGCGGCG GTTCCGGGCT CCTGGCCTGG CCCATCCGCG GGTATATCAC TTCCCCCTAT GGCTACCGGG GTAGCGAGTT TCACTCAGGC CTGGATATTG CGGGCAGTAT CGGCGAACCG GTGGGAGCGG CTGCAGGCGG AGTAGTTGTT AGCACCGGTT ACGACGGTGG CTACGGTCGG ATGGTAGTAA TCGACCATGG TGGCCTGGTT ACCAGGTATG CCCATTTATC TGGTTACAAT GTAAGGCCGG GCCAGCGGGT ATCTCAAGGC CAGATCATCG GTTATGTAGG GGTTAGCGGC CGTACCACAG GCCCCCACCT GCACTTTGAG GTCCTGGTCG GAGGCAGCTT CCGCAACCCG GCCAGTTATC TGAGGTGA
|
Protein sequence | MPPPDKLPQL AASVGRFLKK LPLRWAGIRD NKGKRVLVIT GILAGGLLLA AWHQLTTPNA LAVFINGQQV AIVATREQFN RALQEILKER GAGSYQGVRY TDQVDFKAVR ANPQEVIAEE QLKSLLAERL HLVAAATVIT IDGQPRLVLK DDATADAVLA AFKQAFEPPA AIGQVQEVKF LEQVAYEHRQ ASPEEILSPE AALAKLKGTA ASSSTYTVKE GDSLWSIARE HNLLVDDIKR ANPEIQGERL DIGQQLKLTT VQPLLQVMVV YNQDIKEPVP FETRVEKTAD LLRGQEKIIQ EGTEGEQLVT YQVVTRNGVV VDKKILRQQV LTEPVARVVQ QGTGVTALAS RSGGSGLLAW PIRGYITSPY GYRGSEFHSG LDIAGSIGEP VGAAAGGVVV STGYDGGYGR MVVIDHGGLV TRYAHLSGYN VRPGQRVSQG QIIGYVGVSG RTTGPHLHFE VLVGGSFRNP ASYLR
|
| |