Gene Mkms_5781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5781 
Symbol 
ID4610210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008703 
Strand
Start bp294881 
End bp296563 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content73% 
IMG OID639789436 
Productpeptidase S8 and S53, subtilisin, kexin, sedolisin 
Protein accessionYP_935771 
Protein GI119855166 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0404434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.131255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGGCA AAAGTGTGGC GGCCGGCCTG GCCGCATTCG GACTGCTCAG CGCCAACGTG 
TTGGCGCCCC CGGCGGCGCT GGCGGTGGCC CCGCCGGTGA TCGACCCGGG CGCACTGCCG
CCGGACGAGA CTCCGGGCCC GCCCCAGGAG ATGCGCCAAA CCAAAGCGTG CGTGACCCCG
GTGGTGGTGG GCGATCCCAA TGTGGCGCAA CCCGACCCGG GCAACACCAT GCTCAACATC
GAACAGGCCT GGCAGTACTC GACGGGCGCT GGAGTGACGG TGGCCATCAT CGACACCGGC
GTCACCCCCA ACCCGCGGTT CCCGCGGCTG TTTCCCGGCG GCGACTTCGT GCAGGGACTA
CCCGACGGCG GGCTGACCGA TTGCGAAAGC CACGGCACCA TTGTCGCCTC GATCATCGGC
GCCGCGCCGG CTAACCCGGC CGACCGCCCC ACGCCGCGTC CTGCGGGAGC AGGGGCGCCC
CCTCCGCCCC CGGGCGTGCC GGCCAATCCC GCGCCGCCAG CGTTTCCCCC GCCGCCGACG
ATCACCGCGA CCGCGACAGT GACCGCGCCC GCGCCGCCGC CCGAGCCGCC GCCACCTCCT
GCTGAGCCGC CGCCGGGGGG ACCGCCGCCC GGTGGACCGC CGCCGGCGGG ACCTCCGCCC
GCGCAGGGCC CGGGAGACAC CGGCGCCGCG CAGCCGTTGG TACCGGGGCC GCCGCCGGCG
GGACCGGACG GCGTGGTGGG CGTGGCCCCC GATGTCTCGC TGATTTCGAT TCGGCAGTCC
TCGACGGCGT TCACCCCGGC ACGACCAGCC CCCGGCGACA TCGAGGGTCA ACGCAAGGCC
GGCGACATTG CGACCCTGGC CAAGGCGATT CGCCATGCCG CAGACCTGCC CGGCGTTCGG
GTGATCAACG TGTCGTTGGC GTCGTGCATC AACGCCGCCG CCCCGGTCAA CCAGGACGCT
CTGGGCGCGG CGGTGCGCTA CGCCGCGGTG GACAAGGACA TCGTGATCGT GGCCGCCGCC
GGCAATCAGG GCGGCGGTGA CCAGGGACAG GACTGCGGAC AGAACCCCGC CTTCAATGCG
CTGGACCCCA ACGATCCCCG GGACTGGGCT GGGGTGCGCA CGATCGTGAC TCCGGCGTGG
TTCTCCGACT ACGTGCTCAC CGTCGGCGCG GTGACCCCCG AGGGGCTGCC GCTGCCGGAT
TCGATCAACG GCCCGTGGGT GTCGGTGGCC GCCCCTGGCT GGCGGATCAT GGGCCTGTCG
AACACCAACG GCGCCGCGGT CAACGCCCGA CCCGATGAAC CCGGCTTGGG TGCGGGCTTC
TGGGGGACCA GCTTCTCGGC CGCCTACGTC AGCGGCGTGG TCGCGCTGGT GCGGGCCAAA
TTCCCCGATC TCACCGCGGC CCAAGTCATG CGGCGCATCA CCGAGACCGC CCACAATCCG
GCCCGCGGGG TCGACAACCA GGTCGGCTAC GGCGTGGTCG ATCCGGTAGC GGCGCTGACC
TTCGATGTGC CGCTGGGCGA TCCGAAGCCA GTCGAGCGGC TCAGCACCGA CCTGTATGTA
CCTCCGCCCC CGCCGGGACC GGATCATCGC CCGCGCAACA GCGCCCTGCT CGCCGGCGCC
GCGGTCCTGC TCGTCGCCGC GGTCGCCGTG GCGGTCGTGG GCATGCGACG GAGGCTGCGA
TGA
 
Protein sequence
MRGKSVAAGL AAFGLLSANV LAPPAALAVA PPVIDPGALP PDETPGPPQE MRQTKACVTP 
VVVGDPNVAQ PDPGNTMLNI EQAWQYSTGA GVTVAIIDTG VTPNPRFPRL FPGGDFVQGL
PDGGLTDCES HGTIVASIIG AAPANPADRP TPRPAGAGAP PPPPGVPANP APPAFPPPPT
ITATATVTAP APPPEPPPPP AEPPPGGPPP GGPPPAGPPP AQGPGDTGAA QPLVPGPPPA
GPDGVVGVAP DVSLISIRQS STAFTPARPA PGDIEGQRKA GDIATLAKAI RHAADLPGVR
VINVSLASCI NAAAPVNQDA LGAAVRYAAV DKDIVIVAAA GNQGGGDQGQ DCGQNPAFNA
LDPNDPRDWA GVRTIVTPAW FSDYVLTVGA VTPEGLPLPD SINGPWVSVA APGWRIMGLS
NTNGAAVNAR PDEPGLGAGF WGTSFSAAYV SGVVALVRAK FPDLTAAQVM RRITETAHNP
ARGVDNQVGY GVVDPVAALT FDVPLGDPKP VERLSTDLYV PPPPPGPDHR PRNSALLAGA
AVLLVAAVAV AVVGMRRRLR