Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1042 |
Symbol | |
ID | 3831848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1069558 |
End bp | 1070568 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637828970 |
Product | peptidase RseP |
Protein accession | YP_429899 |
Protein GI | 83589890 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0750] Predicted membrane-associated Zn-dependent proteases 1 |
TIGRFAM ID | [TIGR00054] RIP metalloprotease RseP |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000013997 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | GTGACCATTA TCCTGGCCCT GGTTATCTTC AGCATTCTGG TTATCGTCCA TGAAGGGGGG CACTACCTGG CGGCCAAGCG CGCCGGGATT AAGGTGGAGG AGTTCGCTAT CGGCATGGGC CCGGCCCTGT GGCAGGTTAA AAAGGGAGAA ACCATTTATT CCCTGCGGGC CTTTCCCCTG GGAGGGTTTA ACCGGATGGC CGGCATGGAG GGGCCAGACC TTGACGACCC ACGTGGCTTC AACCGCCAGC CGGTACTCGC CCGGATGGGG GTCATCGGCG CCGGTTCTGG TATGAACTTC CTCCTGGCGT TGTTCCTGTT TATTCTGGTC TTTATGGTCC TGGGGATACC GGCTGATATC AATATTATTG GCCGGGTCGA GCCGGGTATG CCGGCCGCCC TGGCCGGCTT GCAACCCGGG GATAAAATCC TTCAGGTTAA CGATACCCCG GTGAATACCT GGCGCGATAT GGTCGACCTG ATTTATAAAC ACCCGGAAGA AAAAATAACC CTGGTGATTG AACGGGACGG CCGGCAACAA CAGATCAACC TCACCACCGC CAGGGATCCC CAGACGGGGG TGGGATTGAT CGGCATCGGC CCCACCTGGG AGAGGCAGGG TTTCTGGCGC TCTATTGTCC TAGGCACCAG GCAGGCAATA GAGATCACCA GGCTCATTAT CCTGAGCTTG GTAGAGATGG TGACCGGCAA GGTGGCGGCG GAGGTAGTCG GTCCGGTGGG TATCGTCCAG CTGGTGGGCC AAGCGGCAGC CTTCGGCCTG GCCAATGTTT TGAACTTTAT GGCCGTCCTG AGCCTTGACC TGGGGATTAT TAACCTGCTG CCGGTCCCCG CCCTGGATGG CAGCCGGCTG GTGTTCCTGG GCCTGGAAGC AGTGCGCGGG CGACCCATTA ACCCGGAAAA GGAGAATTTT ATCCACCTGA TCGGCTTTGC CATCCTGATG GGCCTGTTAA TTCTCATTAC CTATAAGGAT TTAATCCGGA TCTTCAGCTG A
|
Protein sequence | MTIILALVIF SILVIVHEGG HYLAAKRAGI KVEEFAIGMG PALWQVKKGE TIYSLRAFPL GGFNRMAGME GPDLDDPRGF NRQPVLARMG VIGAGSGMNF LLALFLFILV FMVLGIPADI NIIGRVEPGM PAALAGLQPG DKILQVNDTP VNTWRDMVDL IYKHPEEKIT LVIERDGRQQ QINLTTARDP QTGVGLIGIG PTWERQGFWR SIVLGTRQAI EITRLIILSL VEMVTGKVAA EVVGPVGIVQ LVGQAAAFGL ANVLNFMAVL SLDLGIINLL PVPALDGSRL VFLGLEAVRG RPINPEKENF IHLIGFAILM GLLILITYKD LIRIFS
|
| |