Gene Mmcs_5563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5563 
Symbol 
ID4114431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008147 
Strand
Start bp139058 
End bp140440 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content67% 
IMG OID638034718 
Productpeptidase S8 and S53, subtilisin, kexin, sedolisin 
Protein accessionYP_642719 
Protein GI108802523 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value0.3647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCGCC AAGTCGGTTC GATCGCGGCA GCGGTGGCGC TGCTGGTGGT ATGCAGCCCG 
CCGCCACAGG CCGCCGCCTT GACGATCCCC GTCGTCGATC CTGCCGCGCT GCCGCCCGAT
GGTCCTCCCT CGCCGAATCA GGAGATGCGT CAGAACGGCA CATGCTCGTT CTACGGCGCC
ATGCCCGGCT TCGACCCGGC CGCTGTTCCG CCCAGCCAGA ACATGCTCAA CCTGCCTGAG
GCGTGGAAGA CCTCACGCGG CAGGGGAGTG GCGGTAGCGG TCATCGACTC GGGAGTGACG
CCCCAGCCGC GGCTGCCCAA CCTCAACCCC GGCGGTGACT ACATCGATCC CCCCGCCAAC
GGCCTGGTCG ACTGTGACGG ACACGGGACA GTGGTCGCCG GCATCATCGG TGCTCAGCCT
GGACCTGACG GCTTCTCTGG AGTCGCCCCC GAGGCGTCCA TCGTGTCGAT CCGGCAGAGT
TCAGCGCAAT GGTCACCCAA GCAACCCCAA GGTGACGGTG ACCCACAGCA GATCAAGACC
GCCGGTGACG TCGCCACCCT GGCTCGCGCA GTACGCCATG CCGCCGACAT GCCCGCAGTA
AAGGTCATCA ACATCTCACT GACCGACTGC ATCCCGGTGT ACAAGCAGGT CGATCAGGCT
GCCTTGGGCG CCGCGCTGCG CTACGCCGCG ATCGATAAAG ACATCGTGGT CATCGCGGCC
GCCGGCAACA TCGGCGAAAA CAACTGCGAC AGTAACCCGT TGACAGATCC GAACCGCCCA
GACGACCCCC GCAACTGGGC TGGCGCCACG GTCATCTCCA CCCCGTCGTA CTGGCAGCCC
TACGTCCTGT CGGTGGCGTC ACTGACCCCT GAGGGCCAAC CGTCCGGCTT CACCATGGCT
GGTCCGTGGG TAGGAATCGC CGCGCCGGGG GAGCAGATCA CCTCCCTGGG CAACGCCCCG
AATTCCGGGC TGGTCAACGG CCAGCCCTCC AACCAGGAGC CTCTGGTGCC GATCAATGGG
ACCAGCTTCG CGGCGGCCTA CGTCTCCGGT GTCGCGGCGC TGGTGCGTTC AGCGCTGCCT
GAGCTGACCG CGCGCCAGGT CGTCAACCGG CTCGTCGCGT CAGCGCACAA CGCGGCCCGT
TCTCCGTCGA ACCTTGTCGG CGCCGGCGTC ATCGACCCGG TGGCGGCGCT GACCTGGGAC
ATCCCCGAGG GCGAAGAGGT GCCCGCCAAT ATGCCGGTCG TGCGGGTCGA ACCGCCGGCG
CCGCCGCCGC CGGACAACCG CCTGCCACGG TTCATCGCAT TCGGCGTGGG CATCGTCGCG
ATCGCCGCCG CCATCATCGC TGTGACCATG ATTGGAATGC GCCGTGGCAA CGACAACCGT
TAA
 
Protein sequence
MLRQVGSIAA AVALLVVCSP PPQAAALTIP VVDPAALPPD GPPSPNQEMR QNGTCSFYGA 
MPGFDPAAVP PSQNMLNLPE AWKTSRGRGV AVAVIDSGVT PQPRLPNLNP GGDYIDPPAN
GLVDCDGHGT VVAGIIGAQP GPDGFSGVAP EASIVSIRQS SAQWSPKQPQ GDGDPQQIKT
AGDVATLARA VRHAADMPAV KVINISLTDC IPVYKQVDQA ALGAALRYAA IDKDIVVIAA
AGNIGENNCD SNPLTDPNRP DDPRNWAGAT VISTPSYWQP YVLSVASLTP EGQPSGFTMA
GPWVGIAAPG EQITSLGNAP NSGLVNGQPS NQEPLVPING TSFAAAYVSG VAALVRSALP
ELTARQVVNR LVASAHNAAR SPSNLVGAGV IDPVAALTWD IPEGEEVPAN MPVVRVEPPA
PPPPDNRLPR FIAFGVGIVA IAAAIIAVTM IGMRRGNDNR