Gene Mkms_4647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4647 
Symbol 
ID4612595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4871845 
End bp4873965 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content68% 
IMG OID639794338 
Productoligopeptidase B 
Protein accessionYP_940628 
Protein GI119870676 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCC CGCCGCCGCA GGCCAAACGG GTTGAACAGC GGCGCGAACA CCACGGCGAC 
GTGTTCATCG ACCCCTACGA GTGGCTGCGC GACAAATCCG ATCCCGAGGT GCTTGCCCAC
CTGGAGGCCG AGAACGCCTA CACCGAGGCG CAGACCGCGC ATCTCGCGCC GTTGCGGCAG
AAGATCTTCG ACGAGATCAA GGCCCGCACC AAGGAGACCG ACCTCTCGGT GCCCACCCGG
CGCGGCAACT GGTGGTACTA CGGGCGCAGC TTCGAGGGTA AGCAGTACGG CGTCCACTGC
CGTTGCCCTG TCGCAGATCC CGAGGACTGG ACACCTCCGG TGCTCGACGA GAACACCACC
ATCGACGGTG AACAGGTCCT GCTCGACGAG AACGTCGAAG CCGAGGGCCA CGAGTTCTTC
GCGCTCGGGG CGGCCACCGT CAGCATCGAC GGCAACGTCC TGGCCTACTC CGTCGACACA
GTGGGCGACG AGCGGTACAC ACTGCGGTTC AAAGACCTCC GCACCGGCGA GCAGTACGAC
GACACGATCG TCGGGATCGG TGCCGGCGCC ACCTGGGCCG CCGACAACCG CACCATCTAC
TACAGCACCG TCGACGACGC CTGGCGGCCC GACACCGTGT GGCGCCACCG GTTGGGGGCG
GGGTTGCCGG CCGAACGGGT GTACCACGAA CCCGACGAAC GGTACTGGCT GGGCGTCGGC
CGCACCCGCA GCGACAAGTA CCTGATCATC GCCGCGGGCA GCGCCGTCAC CTCGGAGGTG
CGCTACGCCG ACGCGAGTGA TCCGGAGGCG GAGTTCGCCG TGGTGTGGCC GCGCCGCGAC
GGGGTCGAGT ACTCCGTCGA ACACGCGGTG GTCGGCGGGG AGGACCGGTT CCTCATCATG
CACAACGACG GTGCGGAGAA CTTCGCACTC GTCGACGCGC CGGTGGCCGA TCCGGGCGCG
GCCCGCACGC TCATCGAACA CCGCGAGGAC GTGCGACTCG ACGCGGTCGA CGCGTTCGCC
GGACACCTCG TGCTGAGTTA CCGCGAAGAG GCACTGCCCA AGATCCAACT GTGGCCCCTG
GGCGCCGACG GATCATACGG TCGCGCCGAG GACATCACCT TCGACACCGA ACTGACCTCG
GCGGGGCTGG GCGGCAACCC GAACTGGGAC GCGCCGAAAA TGCGCATCGC GGCAACGTCT
TTCGTCACCC CGGTGCGGAT CTACGACCTC GACCTCGCAA CAGGGGAGCG CACGCTGCTG
CGCGAGCAGC CGGTGCTCGG CGGCTACCGG CCCGAGGACT ACGTCGAGCG CAGGGACTGG
GCGATCGCCC CGGACGGCGC GCGGGTGCCC ATCTCGATCG TCCACCGTGC CGGCCTTCAG
TTTCCGGCAC CGACCCTGCT CTACGGCTAC GGCGCCTACG AGTCGTGTGA GGACCCACGG
TTCTCGATCG CCCGGCTCTC CCTGCTGGAC CGCGGCATGG TGTTCGCCGT CGCCCACGTC
CGCGGCGGCG GGGAACTCGG CCGGCCGTGG TACGAACAGG GCAAGATGCT GGAGAAGGGC
AACACCTTCA CCGACTTCAT CGCGGTGGCA AGGCATCTCA TCGCCACCGA CACCACGCGG
CCGCAGAACC TGGTGGCGCT CGGCGGCAGC GCCGGTGGGT TGTTGATCGG TGCGGTCGCC
AACCTGGCGC CCGAACTGTT CGCCGGCTTC CTGGCACAGG TGCCGTTCGT CGATCCGTTG
ACCACTATCC TCGACCCGTC GCTGCCGCTG ACGGTCACCG AGTGGGACGA ATGGGGCAAC
CCGCTCGAGC ACGAGCACGT CTACCACTAC ATGAAGTCCT ACTCGCCGTA CGAGAACGTC
GTCGCCAAGG ACTATCCGCC GATCCTGGCG ATGACCTCGC TCAACGACAC CCGTGTGTAC
TACGTCGAAC CGGCGAAATG GATTGCCGCC CTTCGGCACA CGAAGACCGA CGGGAACCCG
GTGCTGCTCA AGACCGAGAT GAGCGCCGGA CACGGCGGGA TCAGCGGACG CTACGAACGG
TGGAAGGAAG CCGCGTTCCA GTACGCCTGG GTGCTCGCGA CCGCCGACCC GGAGACCTAC
GACAACCGAA GCCCGAACTA G
 
Protein sequence
MTVPPPQAKR VEQRREHHGD VFIDPYEWLR DKSDPEVLAH LEAENAYTEA QTAHLAPLRQ 
KIFDEIKART KETDLSVPTR RGNWWYYGRS FEGKQYGVHC RCPVADPEDW TPPVLDENTT
IDGEQVLLDE NVEAEGHEFF ALGAATVSID GNVLAYSVDT VGDERYTLRF KDLRTGEQYD
DTIVGIGAGA TWAADNRTIY YSTVDDAWRP DTVWRHRLGA GLPAERVYHE PDERYWLGVG
RTRSDKYLII AAGSAVTSEV RYADASDPEA EFAVVWPRRD GVEYSVEHAV VGGEDRFLIM
HNDGAENFAL VDAPVADPGA ARTLIEHRED VRLDAVDAFA GHLVLSYREE ALPKIQLWPL
GADGSYGRAE DITFDTELTS AGLGGNPNWD APKMRIAATS FVTPVRIYDL DLATGERTLL
REQPVLGGYR PEDYVERRDW AIAPDGARVP ISIVHRAGLQ FPAPTLLYGY GAYESCEDPR
FSIARLSLLD RGMVFAVAHV RGGGELGRPW YEQGKMLEKG NTFTDFIAVA RHLIATDTTR
PQNLVALGGS AGGLLIGAVA NLAPELFAGF LAQVPFVDPL TTILDPSLPL TVTEWDEWGN
PLEHEHVYHY MKSYSPYENV VAKDYPPILA MTSLNDTRVY YVEPAKWIAA LRHTKTDGNP
VLLKTEMSAG HGGISGRYER WKEAAFQYAW VLATADPETY DNRSPN