Gene Mmcs_0623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0623 
Symbol 
ID4109469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp685874 
End bp687865 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content69% 
IMG OID638029749 
Productprolyl oligopeptidase 
Protein accessionYP_637800 
Protein GI108797603 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGTAG ACGACGCGCA CCTGTGGCTC GAGGACATCA CCGGCGACGA CGCCCTGGAC 
TGGGTGCGAC GGCACAACGA ACCGACCCTG GCCGACCTGG GCGGTGAGCG CTTCGAGCAG
ATGCGCGCCG AGGCGCTCGA GGTGCTCGAC ACCGACGCCC GCATCCCCTA CGTCCGGCGC
CGCGGTGAGT ACCTCTACAA CTTCTGGCGC GATGCGGCCA ACCCGCGGGG GCTGTGGCGA
CGCACCACGC TGGAGAGCTA CCGCACCGAG GAGCCCGACT GGGAGGTGGT CATCGACGTC
GACGCACTCG CCCGCGCCGA CGACGAGAAC TGGGTGTGGG CGGGCGCCGA CGTCATCGAC
CCCGACCACA CCCGTGCGCT GATCAGCCTC TCGCGCGGCG GTGCCGACGC CGCGATCGTG
CGCGAATTCG ACATGGTGTC AATGGAGTTC GTCGACGGCG GGTTCGAGCT GCCGGAGGCC
AAGACGGCGA TCACGTGGGA GGACGAGGAC ACCGTCCTGG TGGGAACGGA TTTCGGGGAG
GGCGCCCTGA CCGAATCCGG TTATCCGAGG CTGGTCAAGC GGTGGCGTCG CGGTACGCCG
CTCACCGAGG CCGAGACGGT CTACAGCGGC GAGCCCGCCG ACGTCATCGT CACCGCGTCG
GTCGATCGGA CACCGGGCTT CGAGCGCACC GTGGTGCGCC GCGCCGTCGA CTTCTTCAAC
GACGAGGTGT ACGAGTTGCG CGCCGGCGAA CTCAGCCGCA TCGACGCCCC GACCGACGCG
ACCGTGTCGG CACACCGCGA CTGGCTGCTC ATCGAGCTGC GCAGCGACTG GGACGGCTAC
CGGGCAGGAT CGCTGCTGGC CGCGAAATAC GACGAATACC TCGATGGCAC AAGGGCTCTG
CAGGTGGTGT TCGAACCCGA TGAGCACACG TGCCTGCACC ACTACGCGTG GACCAAGGAC
CGACTCGTGG TCGTCACGCT GGCCGACGTC GCGAGCCGCG TCGAGGTGTA CACCCCCGGC
GAGTGGACGG CGCAGCCCGT GCCGGGACTG CCGGACAACA CCAACACGGT GATCGTGGCG
GCCGACGACC TGGGCGACGA GATCTTCCTG GACTCCAGCG GTTTCGACAC CCCGTCGCGG
CTCCTGCAGG GCGCGGCCGG CGGTGAACTC ACCGAGATCA AGCGGGCGCC GTCGTTCTTC
GACGCCGCCG ATCTCAAGGT CGACCAGCAC TTCGCGACAT CAGCCGACGG CACCAAGATC
CCGTACTTCG TTGTCGGCCA CCGGCATCAG CAGGCGCCCG GGCCGACGCT GCTGGGCGGT
TACGGCGGGT TCGAGGTCGC GCGCACACCC GGTTACGACG GTGTGCTCGG CCGGCTCTGG
TTGTCTCGGG GCGGCACCTA CGTGCTGGCC AACATCCGCG GCGGCGGGGA GTACGGACCG
ACGTGGCATA CGCAGGCGAT GCGCGAGGGC CGCCACCTGG TGGGTGAGGA CTTCGCCGCC
GTCGCAGCCG ATCTCGTCGA ACGCGGAATC ACGACGGTCG ACCGGTTGGG CGCGCAGGGC
GGCAGCAACG GCGGGCTGCT GATGGGGATC ATGCTCACGC AGTACCCGGA GTTGTTCGGC
GCGCTGGTCT GCAGCGTGCC GCTGCTCGAC ATGCGCCGGT TCCACCTGCT GCTCGCCGGG
GCGTCCTGGG TGGCCGAGTA CGGCAACCCG GATGACCCGG ACGACTGGGA GTTCATCTCG
AAATACTCTC CCTATCAGAA CATCTCGGCC GAGCGCCGAT ACCCGCCGGT GCTGATCACC
ACCTCCACAC GCGACGACCG CGTGCATCCG GGACATGCGC GCAAGATGAC CGCAGCGCTC
GAGGATGCCG GACAGCCGGT GCAGTACTAC GAGAACATCG AGGGTGGGCA CGGCGGCGCC
GCGGACAATT CGCAGGCTGC GTTCCGCGCG GCGCTGATCT ACGAGTTCCT GTGGCGGAAG
CTGGGCGGAT AG
 
Protein sequence
MTVDDAHLWL EDITGDDALD WVRRHNEPTL ADLGGERFEQ MRAEALEVLD TDARIPYVRR 
RGEYLYNFWR DAANPRGLWR RTTLESYRTE EPDWEVVIDV DALARADDEN WVWAGADVID
PDHTRALISL SRGGADAAIV REFDMVSMEF VDGGFELPEA KTAITWEDED TVLVGTDFGE
GALTESGYPR LVKRWRRGTP LTEAETVYSG EPADVIVTAS VDRTPGFERT VVRRAVDFFN
DEVYELRAGE LSRIDAPTDA TVSAHRDWLL IELRSDWDGY RAGSLLAAKY DEYLDGTRAL
QVVFEPDEHT CLHHYAWTKD RLVVVTLADV ASRVEVYTPG EWTAQPVPGL PDNTNTVIVA
ADDLGDEIFL DSSGFDTPSR LLQGAAGGEL TEIKRAPSFF DAADLKVDQH FATSADGTKI
PYFVVGHRHQ QAPGPTLLGG YGGFEVARTP GYDGVLGRLW LSRGGTYVLA NIRGGGEYGP
TWHTQAMREG RHLVGEDFAA VAADLVERGI TTVDRLGAQG GSNGGLLMGI MLTQYPELFG
ALVCSVPLLD MRRFHLLLAG ASWVAEYGNP DDPDDWEFIS KYSPYQNISA ERRYPPVLIT
TSTRDDRVHP GHARKMTAAL EDAGQPVQYY ENIEGGHGGA ADNSQAAFRA ALIYEFLWRK
LGG