Gene Mmcs_0157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0157 
Symbol 
ID4109003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp171679 
End bp173697 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content68% 
IMG OID638029282 
ProductPgPepO oligopeptidase 
Protein accessionYP_637334 
Protein GI108797137 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID[TIGR02543] Listeria/Bacterioides repeat 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCTTCT GGCAACATGT TGGGGTGACG GTAGAAGCGA CCATCAAATC CGGCATCGAC 
CTCAGTTACC TCGACACCGC GGCCCGCCCG CAGGACGACC TGTTCGGCCA CGTCAACGGC
CGCTGGCTCG CCGATTACGA GATCCCCGCC GACCGGGCCG CCGACGGCGC CTTCCGCACC
CTCTACGACC GTGCCGAAGA GCAGATCCGC GACATCATCA CCGAGGCGGC CGACGCGAAC
GCCGCGGACG GCACCGACGA GCAACGCATC GGCGACCTGT ACGCCAGCTT CCTCGACGAG
GCGACGGTGG CGCGCATCGG TGTGCAGCCG CTGCTCGACG AACTCGCGCT CGTCGACGCC
GCCGACAGCC CCGACGCGCT CGCGGCGGTC CTCGGCGGGC TGCAACGCAC CGGCGTCGGC
GGCGGCGCCG GTGTGTACGT CGACACCGAC TCCAAGAACT CGACCCGCTA CCTGCTGCAC
TTCAGCCAGT CCGGTATCGG GCTGCCCGAC GAATCGTATT TCCGCGACGA GCAGCACGCC
GAGATACTCG CCGCCTACCC CGGGCACATC GCGGCGATGT TCGCGCTGGT CTACGGGGGC
GATCACACGC AGACGGCCGA GCGCATCGTC GCGCTGGAGC GCAAGCTGGC CGCCGCGCAC
TGGGATGTGG TGAAGCGTCG CGACGCCGAC CTGACCTACA ACCTGCGCAC GTTCGCCGAT
CTGTCCGCCG AGGCGCCCGG CTTCGACTGG GCCGGCTGGG TGACGGCGCT GGGCACCACG
CCGGAGTCAG TGGCCGAGGT CGTGGTGCGC CAGCCCGACT ACCTCACCGA GTTCGCGGCG
GCCTGGTCGA GTGAACCGCT GGAGGACTGG AAGCACTGGG TGCGGTGGCG CCTCATCCAC
GCCCGCGCCT TCCTGCTGAC CGACGAGCTG GTGGCGGAGG ACTTCGCGTT CTACGGTCGC
CTGCTCTCAG GCACCGAGCA GATCCGCGAC CGCTGGAAGC GCGGGGTCTC CGTGGTGGAG
AACCTGATGG GCGAGGCGCT CGGCAAGCTC TACGTGCAGC GGCACTTCCC GCCGAATGCC
AAGGCGCGCA TGGACGAACT GGTCGCCAAC CTGCGCGAGG CCTACCGGGT GAGCATCAAC
CGGCTGGAGT GGATGACGCC GGAGACCCGC GAGAAGGCGC TGGCCAAGCT CGACAAGTTC
ACGCCGAAGA TCGGCTACCC GGTCCGCTGG AAGGACTACT CGCAGCTGGT CATCCGCCGC
GACGACCTCT ACGGCAACTA CCGCCGCGGC TACCAGCTGG CCTCCGACCG GGAAGTCCAG
AAGCTCGGCG GTCCCGTGGA CCGCGACGAA TGGTTCATGA CGCCGCAGAC GGTCAACGCG
TACTACAACC CGGGGATGAA CGAGATCGTC TTCCCCGCAG CGATTCTGCA GCCGCCGTTC
TTCGACGCCG ACGCCGACGA CGCCGCCAAC TACGGCGGTA TCGGCGCGGT CATCGGTCAC
GAGATCGGCC ACGGTTTCGA CGATCAGGGC GCCAAGTACG ACGGCGACGG CAACCTGGTC
GACTGGTGGA CCGACGCCGA CCGCACAGAG TTCGGCGCCC GCACCAAGGC GTTGATCGAA
CAATACGAGC AGTACACGCC GCGGGAACTC GAGGGCCAGA ACGGACACCA CGACGCGCAC
GTCAATGGGG CGTTCACCGT CGGCGAGAAC ATCGGCGACC TCGGCGGGTT GTCGATCGCA
CTGCTGGCCT ACGAGCTGTC GCTCAAGGGC GAACCGGCGC CGGTGATCGA CGGGTTGACC
GGTGTGCAGC GCGTGTTCTT CGGGTGGGCG CAGGTGTGGC GCACGAAATA CCGTTCGGCG
GAAGCGATCC GGCGCCTGGC CACCGATCCG CATTCGCCGC CGGAGTTCCG CTGCAACGGC
GTCATCCGCA ACCTCGACGC GTTCTATGAG GCGTTCGAGG TCGGCGCCGA CGATGCGCTC
TACCTGGAAC CCGAACGCCG CGTCCGCATC TGGAACTAG
 
Protein sequence
MRFWQHVGVT VEATIKSGID LSYLDTAARP QDDLFGHVNG RWLADYEIPA DRAADGAFRT 
LYDRAEEQIR DIITEAADAN AADGTDEQRI GDLYASFLDE ATVARIGVQP LLDELALVDA
ADSPDALAAV LGGLQRTGVG GGAGVYVDTD SKNSTRYLLH FSQSGIGLPD ESYFRDEQHA
EILAAYPGHI AAMFALVYGG DHTQTAERIV ALERKLAAAH WDVVKRRDAD LTYNLRTFAD
LSAEAPGFDW AGWVTALGTT PESVAEVVVR QPDYLTEFAA AWSSEPLEDW KHWVRWRLIH
ARAFLLTDEL VAEDFAFYGR LLSGTEQIRD RWKRGVSVVE NLMGEALGKL YVQRHFPPNA
KARMDELVAN LREAYRVSIN RLEWMTPETR EKALAKLDKF TPKIGYPVRW KDYSQLVIRR
DDLYGNYRRG YQLASDREVQ KLGGPVDRDE WFMTPQTVNA YYNPGMNEIV FPAAILQPPF
FDADADDAAN YGGIGAVIGH EIGHGFDDQG AKYDGDGNLV DWWTDADRTE FGARTKALIE
QYEQYTPREL EGQNGHHDAH VNGAFTVGEN IGDLGGLSIA LLAYELSLKG EPAPVIDGLT
GVQRVFFGWA QVWRTKYRSA EAIRRLATDP HSPPEFRCNG VIRNLDAFYE AFEVGADDAL
YLEPERRVRI WN