Gene Mmcs_3560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3560 
Symbol 
ID4112392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3790093 
End bp3792093 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content72% 
IMG OID638032695 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_640723 
Protein GI108800526 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0779604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGACA CTGTCTCGCC GTTCGCGGAG CTCGACGCCT ACCTCGCACT CCCACGGGTG 
GCCGGGCTCG CCGTGTCTCC CGACGGGTCG CGGGTGGTGA CCACGATCAG CGAGCTCGAC
GACAAGCGCA CCGCGTTCGT CACCGCGATC TGGGAACTGG ACCCCGCCGG GAGGCGCCCC
GCCCGCCGCC TCACCCGCGG CGCGAAGGGG GAGCGGGCGC CGGCGTTCAC CCCGGGCGGA
GATCTGCTGT TCCTCGCGTC GCGCCCCACC GGGGATTCCG CCGAGGACGG GGACTCGCCG
CCCGCGGCGC TGTGGCGGCT GCCCGCACAG GGCGGGGAAG CGGTCGAGGA ACTCACCCCG
CCCGGCGGTG TAACGTCCGT GCGCTGCGCC CGGGCCGCGG GGGTCGCGGT GGTGAGCGCG
CCGATGCTGG TCTCGGCCGC CGACCTCGAC GACGACAAGA GGCTGCGTGC GCTGAGAAAG
GACAACAAGG TCTCCGCGGT CCTGCACAGC GGGTATCCGG TGCGCTCCTG GGATCACGAC
CTCGGACCCG ATCAGCCGCA TCTGCTCGAC GCCGCCGACG GCCGCGACCT CACACCCCGA
CCGGGCGGCG GTCTGCGCGA CGCCGCCGTC GACGTCAGCG ACGACGGCAG CTTTCTCGTC
ACCTCCTGGC AGAACCCGTC CGCCGGGGCG GCGCTGCGCG ACACCCTGGT ACGCGTCGAG
GTCGGCAGCG GTGAGCGCAC CACGGTCGCC GACGACCCCG GGGCCGATCT GGGCCATCCG
GCCATCTCCC CGGACGGCCG GATGCTGGCG TTCACCCGCG AGACGATCTC CACTCCGCTG
CAGGCCCCGC GAATCACATT GTGCTGCCTG CATTTCGGTG GTGAGGTGCG CGAACTGACA
GCCCACTGGG ACCGGTGGCC GACATCGGTC ACCTGGAGCC GCGACGGCGC GAAACTAATC
GTCACCGCCG ACGACAACGG CCGCGGGCCG ATCTTCCTGA TCGACCCGGA CACCGGCGCT
GTCACCAAGC TGACCGACGA CGACCACACC TACACCGACG TCGTCACCGC ACCCGGCGGT
GTGCTCTTCG CGATCCGCCA CAACTACGCC GCCCCACCGC ACCCGGTGCG CATCGACCCC
GACGGCACCG TCACCGTCCT GCCGACCGTC GACGCCCCGA GGCTGCCGGG CACGCTGAGC
GAGATCACCG CCACCGCACC CGACGGCGCC GCCGTGCGGT CCTGGCTGGC CCTGCCCGAC
GGCGCCGGCG AGAACGCCCC GGCGCCGCTG CTGCTGTGGA TCCACGGCGG ACCGCTCGCC
AGTTGGAACG CCTGGCACTG GCGGTGGAAT CCGTGGCTGA TGGTCGCGCA GGGCTACGCC
GTGCTGCTCC CCGATCCGGC CCTGTCCACC GGCTACGGCC AGGACTTCAT CCAGCGGGGC
TGGGGCGCCT GGGGCGAGGC GCCCTACACG GATCTGATGG CCGCCACCGA CGCGGCGACC
GCCGACCCGC GCATCGACGG CACCCGCACC GCGGCGATGG GTGGGTCGTT CGGCGGATAC
ATGGCCAACT GGATCGCCGG GCACACCGAC CGGTTCGATG CGATCGTCAC CCACGCCAGC
CTGTGGGCGC TCGATCAGTT CGGTCCCACC ACCGACGGCG CGTACTGGTG GGCGCGCGAG
ATGACACCCG AGATGGCCGA ACGCAATTCA CCGCACCTGT TCGTGGAGAA CATCGCCACG
CCGATGTTGG TGATCCACGG CGACAAGGAC TACCGGGTGC CGATCGGCGA AGCGCTGCGG
CTCTGGTACG AGCTGCTCAC CAGATCGCGC CTGCCCGCCG CGGACGACGG CACCGGACCG
CACCGCTTCC TCTACTACCC CTCGGAGAAC CACTGGGTGC TTGCTCCCCA GCATGCGAAG
CTCTGGTACC AGGTCGTCTT CGCATTCCTG GCCCGGCACG TGCTCGGGCG GGACGTCGAG
CTGCCCGAAC TGCTCGGGTA G
 
Protein sequence
MPDTVSPFAE LDAYLALPRV AGLAVSPDGS RVVTTISELD DKRTAFVTAI WELDPAGRRP 
ARRLTRGAKG ERAPAFTPGG DLLFLASRPT GDSAEDGDSP PAALWRLPAQ GGEAVEELTP
PGGVTSVRCA RAAGVAVVSA PMLVSAADLD DDKRLRALRK DNKVSAVLHS GYPVRSWDHD
LGPDQPHLLD AADGRDLTPR PGGGLRDAAV DVSDDGSFLV TSWQNPSAGA ALRDTLVRVE
VGSGERTTVA DDPGADLGHP AISPDGRMLA FTRETISTPL QAPRITLCCL HFGGEVRELT
AHWDRWPTSV TWSRDGAKLI VTADDNGRGP IFLIDPDTGA VTKLTDDDHT YTDVVTAPGG
VLFAIRHNYA APPHPVRIDP DGTVTVLPTV DAPRLPGTLS EITATAPDGA AVRSWLALPD
GAGENAPAPL LLWIHGGPLA SWNAWHWRWN PWLMVAQGYA VLLPDPALST GYGQDFIQRG
WGAWGEAPYT DLMAATDAAT ADPRIDGTRT AAMGGSFGGY MANWIAGHTD RFDAIVTHAS
LWALDQFGPT TDGAYWWARE MTPEMAERNS PHLFVENIAT PMLVIHGDKD YRVPIGEALR
LWYELLTRSR LPAADDGTGP HRFLYYPSEN HWVLAPQHAK LWYQVVFAFL ARHVLGRDVE
LPELLG