Gene Mjls_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_2026 
Symbol 
ID4877747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp2133175 
End bp2134389 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content68% 
IMG OID640139324 
Productmetallopeptidase MEROPS family protein 
Protein accessionYP_001070304 
Protein GI126434613 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.456946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0994336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTTG CTCTCGGCAT CGTGCTCTTC GCACTGGCCA TCCTGGTGTC GGTAGCCCTG 
CACGAATGCG GCCACATGTG GGTCGCGCGG GCCACCGGGA TGAAGGTGCG CCGGTACTTC
GTCGGGTTCG GGCCCACCCT GTGGTCGACT CACCGCCCCA ACCGCCTCGG CAGCACCGAG
TACGGCGTCA AGGCCGTACC GCTCGGCGGG TTCTGCGATA TCGCGGGTAT GACGTCGGTC
GAGGAACTCG CCCCGGAGGA CCGCCCGTAC GCCATGTACC GGCAGAAGGT GTGGAAGCGC
GTCGCCGTGC TGTTCGCCGG ACCGGGGATG AACTTCGTCA TCGGCCTGGT CCTCGTCTAC
GCCATCGCGG TGATCTGGGG CCTGCCGAAC CTGAACCCCC CGACCGCCGC GATCGTCGGC
CAGACCGGCT GTGTCGCACC GCAGCTCAGC AAGGACCAGA TCGGCGAGTG CACCGGGCCC
GGCCCGGCGG CGCAGGCCGG TATCCAGGCC GGCGACGTGA TCGTCAAGGT CGGCGACACC
GACGTCGCGA CGTTCGACGA GGCTCGGGTG ACCCTGCAGA AGTCCTCCGG CCCGACACCG
ATCGTCATCG AGCGGGACGG CCAGGAACTC ACCAAGGTGG TCGACGTCAC CCAGACCCAG
CGCTTCACCG GCGAGGGCGA CCAACCGACC ACCGTCGGCG CGATCGGCAT CGCCGCCGCG
CAGTTCGGGC CGACCCAGCA CAACGCGCTC TCGGCGGTGC CCGCCACGTT CGCGTTCACC
GGCGACCTCG CCGTCGAACT GGGTAAGTCG CTGGCCAAGA TCCCCACCAA GGTGGGCGCG
CTGGTGGACT CCATCGGCGG TGGTGAGCGT GATCCCGAGA CGCCGATCAG CGTCGTGGGC
GCCAGCATCA TCGGCGGCGA CACCGTCGAC GCGGGGCTGT GGGTGGCCTT CTGGTTCTTC
CTGGCCCAGC TCAACTTCGT CCTCGGCGCG GTGAACCTGG TGCCGCTGCT GCCGTTCGAC
GGTGGACACA TCGCGATCGC CGTGTTCGAG AAGATCCGCA ACATGATCCG GTCGGCCCGC
GGCATGGTGG CCGCGGCGCC GGTGAACTAC CTCAAGCTCA TGCCCGCCAC CTACGTAGTG
TTGGTGGTGG TGGTCGGCTA CATGCTGCTG ACCGTGACCG CTGACCTGGT CAACCCGATC
AGGTTGTTCC AATAG
 
Protein sequence
MMFALGIVLF ALAILVSVAL HECGHMWVAR ATGMKVRRYF VGFGPTLWST HRPNRLGSTE 
YGVKAVPLGG FCDIAGMTSV EELAPEDRPY AMYRQKVWKR VAVLFAGPGM NFVIGLVLVY
AIAVIWGLPN LNPPTAAIVG QTGCVAPQLS KDQIGECTGP GPAAQAGIQA GDVIVKVGDT
DVATFDEARV TLQKSSGPTP IVIERDGQEL TKVVDVTQTQ RFTGEGDQPT TVGAIGIAAA
QFGPTQHNAL SAVPATFAFT GDLAVELGKS LAKIPTKVGA LVDSIGGGER DPETPISVVG
ASIIGGDTVD AGLWVAFWFF LAQLNFVLGA VNLVPLLPFD GGHIAIAVFE KIRNMIRSAR
GMVAAAPVNY LKLMPATYVV LVVVVGYMLL TVTADLVNPI RLFQ