Gene Mkms_0636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0636 
Symbol 
ID4615019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp693178 
End bp695169 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content69% 
IMG OID639790311 
Productprolyl oligopeptidase 
Protein accessionYP_936642 
Protein GI119866690 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTAG ACGACGCGCA CCTGTGGCTC GAGGACATCA CCGGCGACGA CGCCCTGGAC 
TGGGTGCGAC GGCACAACGA ACCGACCCTG GCCGACCTGG GCGGTGAGCG CTTCGAGCAG
ATGCGCGCCG AGGCGCTCGA GGTGCTCGAC ACCGACGCCC GCATCCCCTA CGTCCGGCGC
CGCGGTGAGT ACCTCTACAA CTTCTGGCGC GATGCGGCCA ACCCGCGGGG GCTGTGGCGA
CGCACCACGC TGGAGAGCTA CCGCACCGAG GAGCCCGACT GGGAGGTGGT CATCGACGTC
GACGCACTCG CCCGCGCCGA CGACGAGAAC TGGGTGTGGG CGGGCGCCGA CGTCATCGAC
CCCGACCACA CCCGTGCGCT GATCAGCCTC TCGCGCGGCG GTGCCGACGC CGCGATCGTG
CGCGAATTCG ACATGGTGTC AATGGAGTTC GTCGACGGCG GGTTCGAGCT GCCGGAGGCC
AAGACGGCGA TCACGTGGGA GGACGAGGAC ACCGTCCTGG TGGGAACGGA TTTCGGGGAG
GGCGCCCTGA CCGAATCCGG TTATCCGAGG CTGGTCAAGC GGTGGCGTCG CGGTACGCCG
CTCACCGAGG CCGAGACGGT CTACAGCGGC GAGCCCGCCG ACGTCATCGT CACCGCGTCG
GTCGATCGGA CACCGGGCTT CGAGCGCACC GTGGTGCGCC GCGCCGTCGA CTTCTTCAAC
GACGAGGTGT ACGAGTTGCG CGCCGGCGAA CTCAGCCGCA TCGACGCCCC GACCGACGCG
ACCGTGTCGG CACACCGCGA CTGGCTGCTC ATCGAGCTGC GCAGCGACTG GGACGGCTAC
CGGGCAGGAT CGCTGCTGGC CGCGAAATAC GACGAATACC TCGATGGCAC AAGGGCTCTG
CAGGTGGTGT TCGAACCCGA TGAGCACACG TGCCTGCACC ACTACGCGTG GACCAAGGAC
CGACTCGTGG TCGTCACGCT GGCCGACGTC GCGAGCCGCG TCGAGGTGTA CACCCCCGGC
GAGTGGACGG CGCAGCCCGT GCCGGGACTG CCGGACAACA CCAACACGGT GATCGTGGCG
GCCGACGACC TGGGCGACGA GATCTTCCTG GACTCCAGCG GTTTCGACAC CCCGTCGCGG
CTCCTGCAGG GCGCGGCCGG CGGTGAACTC ACCGAGATCA AGCGGGCGCC GTCGTTCTTC
GACGCCGCCG ATCTCAAGGT CGACCAGCAC TTCGCGACAT CAGCCGACGG CACCAAGATC
CCGTACTTCG TTGTCGGCCA CCGGCATCAG CAGGCGCCCG GGCCGACGCT GCTGGGCGGT
TACGGCGGGT TCGAGGTCGC GCGCACACCC GGTTACGACG GTGTGCTCGG CCGGCTCTGG
TTGTCTCGGG GCGGCACCTA CGTGCTGGCC AACATCCGCG GCGGCGGGGA GTACGGACCG
ACGTGGCATA CGCAGGCGAT GCGCGAGGGC CGCCACCTGG TGGGTGAGGA CTTCGCCGCC
GTCGCAGCCG ATCTCGTCGA ACGCGGAATC ACGACGGTCG ACCGGTTGGG CGCGCAGGGC
GGCAGCAACG GCGGGCTGCT GATGGGGATC ATGCTCACGC AGTACCCGGA GTTGTTCGGC
GCGCTGGTCT GCAGCGTGCC GCTGCTCGAC ATGCGCCGGT TCCACCTGCT GCTCGCCGGG
GCGTCCTGGG TGGCCGAGTA CGGCAACCCG GATGACCCGG ACGACTGGGA GTTCATCTCG
AAATACTCTC CCTATCAGAA CATCTCGGCC GAGCGCCGAT ACCCGCCGGT GCTGATCACC
ACCTCCACAC GCGACGACCG CGTGCATCCG GGACATGCGC GCAAGATGAC CGCAGCGCTC
GAGGATGCCG GACAGCCGGT GCAGTACTAC GAGAACATCG AGGGTGGGCA CGGCGGCGCC
GCGGACAATT CGCAGGCTGC GTTCCGCGCG GCGCTGATCT ACGAGTTCCT GTGGCGGAAG
CTGGGCGGAT AG
 
Protein sequence
MTVDDAHLWL EDITGDDALD WVRRHNEPTL ADLGGERFEQ MRAEALEVLD TDARIPYVRR 
RGEYLYNFWR DAANPRGLWR RTTLESYRTE EPDWEVVIDV DALARADDEN WVWAGADVID
PDHTRALISL SRGGADAAIV REFDMVSMEF VDGGFELPEA KTAITWEDED TVLVGTDFGE
GALTESGYPR LVKRWRRGTP LTEAETVYSG EPADVIVTAS VDRTPGFERT VVRRAVDFFN
DEVYELRAGE LSRIDAPTDA TVSAHRDWLL IELRSDWDGY RAGSLLAAKY DEYLDGTRAL
QVVFEPDEHT CLHHYAWTKD RLVVVTLADV ASRVEVYTPG EWTAQPVPGL PDNTNTVIVA
ADDLGDEIFL DSSGFDTPSR LLQGAAGGEL TEIKRAPSFF DAADLKVDQH FATSADGTKI
PYFVVGHRHQ QAPGPTLLGG YGGFEVARTP GYDGVLGRLW LSRGGTYVLA NIRGGGEYGP
TWHTQAMREG RHLVGEDFAA VAADLVERGI TTVDRLGAQG GSNGGLLMGI MLTQYPELFG
ALVCSVPLLD MRRFHLLLAG ASWVAEYGNP DDPDDWEFIS KYSPYQNISA ERRYPPVLIT
TSTRDDRVHP GHARKMTAAL EDAGQPVQYY ENIEGGHGGA ADNSQAAFRA ALIYEFLWRK
LGG