Gene Mkms_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0166 
Symbol 
ID4615395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp179019 
End bp181013 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content68% 
IMG OID639789842 
ProductPgPepO oligopeptidase 
Protein accessionYP_936174 
Protein GI119866222 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID[TIGR02543] Listeria/Bacterioides repeat 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTAG AAGCGACCAT CAAATCCGGC ATCGACCTCA GTTACCTCGA CACCGCGGCC 
CGCCCGCAGG ACGACCTGTT CGGCCACGTC AACGGCCGCT GGCTCGCCGA TTACGAGATC
CCCGCCGACC GGGCCGCCGA CGGCGCCTTC CGCACCCTCT ACGACCGTGC CGAAGAGCAG
ATCCGCGACA TCATCACCGA GGCGGCCGAC GCGAACGCCG CGGACGGCAC CGACGAGCAA
CGCATCGGCG ACCTGTACGC CAGCTTCCTC GACGAGGCGA CGGTGGCGCG CATCGGTGTG
CAGCCGCTGC TCGACGAACT CGCGCTCGTC GACGCCGCCG ACAGCCCCGA CGCGCTCGCG
GCGGTCCTCG GCGGGCTGCA ACGCACCGGC GTCGGCGGCG GCGCCGGTGT GTACGTCGAC
ACCGACTCCA AGAACTCGAC CCGCTACCTG CTGCACTTCA GCCAGTCCGG TATCGGGCTG
CCCGACGAAT CGTATTTCCG CGACGAGCAG CACGCCGAGA TACTCGCCGC CTACCCCGGG
CACATCGCGG CGATGTTCGC GCTGGTCTAC GGGGGCGATC ACACGCAGAC GGCCGAGCGC
ATCGTCGCGC TGGAGCGCAA GCTGGCCGCC GCGCACTGGG ATGTGGTGAA GCGTCGCGAC
GCCGACCTGA CCTACAACCT GCGCACGTTC GCCGATCTGT CCGCCGAGGC GCCCGGCTTC
GACTGGGCCG GCTGGGTGAC GGCGCTGGGC ACCACGCCGG AGTCAGTGGC CGAGGTCGTG
GTGCGCCAGC CCGACTACCT CACCGAGTTC GCGGCGGCCT GGTCGAGTGA ACCGCTGGAG
GACTGGAAGC ACTGGGTGCG GTGGCGCCTC ATCCACGCCC GCGCCTTCCT GCTGACCGAC
GAGCTGGTGG CGGAGGACTT CGCGTTCTAC GGTCGCCTGC TCTCAGGCAC CGAGCAGATC
CGCGACCGCT GGAAGCGCGG GGTCTCCGTG GTGGAGAACC TGATGGGCGA GGCGCTCGGC
AAGCTCTACG TGCAGCGGCA CTTCCCGCCG AATGCCAAGG CGCGCATGGA CGAACTGGTC
GCCAACCTGC GCGAGGCCTA CCGGGTGAGC ATCAACCGGC TGGAGTGGAT GACGCCGGAG
ACCCGCGAGA AGGCGCTGGC CAAGCTCGAC AAGTTCACGC CGAAGATCGG CTACCCGGTC
CGCTGGAAGG ACTACTCGCA GCTGGTCATC CGCCGCGACG ACCTCTACGG CAACTACCGC
CGCGGCTACC AGCTGGCCTC CGACCGGGAA GTCCAGAAGC TCGGCGGTCC CGTGGACCGC
GACGAATGGT TCATGACGCC GCAGACGGTC AACGCGTACT ACAACCCGGG GATGAACGAG
ATCGTCTTCC CCGCAGCGAT TCTGCAGCCG CCGTTCTTCG ACGCCGACGC CGACGACGCC
GCCAACTACG GCGGTATCGG CGCGGTCATC GGTCACGAGA TCGGCCACGG TTTCGACGAT
CAGGGCGCCA AGTACGACGG CGACGGCAAC CTGGTCGACT GGTGGACCGA CGCCGACCGC
ACAGAGTTCG GCGCCCGCAC CAAGGCGTTG ATCGAACAAT ACGAGCAGTA CACGCCGCGG
GAACTCGAGG GCCAGAACGG ACACCACGAC GCGCACGTCA ATGGGGCGTT CACCGTCGGC
GAGAACATCG GCGACCTCGG CGGGTTGTCG ATCGCACTGC TGGCCTACGA GCTGTCGCTC
AAGGGCGAAC CGGCGCCGGT GATCGACGGG TTGACCGGTG TGCAGCGCGT GTTCTTCGGG
TGGGCGCAGG TGTGGCGCAC GAAATACCGT TCGGCGGAAG CGATCCGGCG CCTGGCCACC
GATCCGCATT CGCCGCCGGA GTTCCGCTGC AACGGCGTCA TCCGCAACCT CGACGCGTTC
TATGAGGCGT TCGAGGTCGG CGCCGACGAT GCGCTCTACC TGGAACCCGA ACGCCGCGTC
CGCATCTGGA ACTAG
 
Protein sequence
MTVEATIKSG IDLSYLDTAA RPQDDLFGHV NGRWLADYEI PADRAADGAF RTLYDRAEEQ 
IRDIITEAAD ANAADGTDEQ RIGDLYASFL DEATVARIGV QPLLDELALV DAADSPDALA
AVLGGLQRTG VGGGAGVYVD TDSKNSTRYL LHFSQSGIGL PDESYFRDEQ HAEILAAYPG
HIAAMFALVY GGDHTQTAER IVALERKLAA AHWDVVKRRD ADLTYNLRTF ADLSAEAPGF
DWAGWVTALG TTPESVAEVV VRQPDYLTEF AAAWSSEPLE DWKHWVRWRL IHARAFLLTD
ELVAEDFAFY GRLLSGTEQI RDRWKRGVSV VENLMGEALG KLYVQRHFPP NAKARMDELV
ANLREAYRVS INRLEWMTPE TREKALAKLD KFTPKIGYPV RWKDYSQLVI RRDDLYGNYR
RGYQLASDRE VQKLGGPVDR DEWFMTPQTV NAYYNPGMNE IVFPAAILQP PFFDADADDA
ANYGGIGAVI GHEIGHGFDD QGAKYDGDGN LVDWWTDADR TEFGARTKAL IEQYEQYTPR
ELEGQNGHHD AHVNGAFTVG ENIGDLGGLS IALLAYELSL KGEPAPVIDG LTGVQRVFFG
WAQVWRTKYR SAEAIRRLAT DPHSPPEFRC NGVIRNLDAF YEAFEVGADD ALYLEPERRV
RIWN