Gene Mjls_3565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3565 
Symbol 
ID4879276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3757554 
End bp3759554 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content72% 
IMG OID640140869 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001071833 
Protein GI126436142 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.533433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACA CTGTCTCGCC GTTCGCGGAG CTCGACGCCT ACCTCGCACT CCCACGGGTG 
GCCGGGCTCG CCGTGTCTCC CGACGGGTCG CGGGTGGTGA CCACGATCAG CGAGCTCGAC
GACAAGCGCA CCGCGTTCGT CACCGCGATC TGGGAACTGG ACCCCGCCGG GAGGCGCCCC
GCCCGCCGCC TCACCCGCGG CGCGAAGGGG GAGCGGGCGC CGGCGTTCAC CCCGGGCGGA
GATCTGCTGT TCCTCGCGTC GCGCCCCACC GGGGATTCCG CCGAGGACGG GGACTCGCCG
CCCGCGGCGC TGTGGCGGCT GCCCGCACAG GGCGGGGAAG CGGTCGAGGA ACTCACCCCG
CCCGGCGGTG TAACGTCCGT GCGCTGCGCC CGGGCCGCGG GGGTCGCGGT GGTGAGCGCG
CCGATGCTGG TCTCGGCCGC CGACCTCGAC GACGACAAGA GGCTGCGTGC GCTGAGAAAG
GACAACAAGG TCTCCGCGGT CCTGCACAGC GGGTATCCGG TGCGCTCCTG GGATCACGAC
CTCGGACCCG ATCAGCCGCA TCTGCTCGAC GCCGCCGACG GCCGCGACCT CACACCCCGA
CCGGGCGGCG GTCTGCGCGA CGCCGCCGTC GACGTCAGCG ACGACGGCAG CTTTCTCGTC
ACCTCCTGGC AGAACCCGTC CGCCGGGGCG GCGCTGCGCG ACACCCTGGT ACGCGTCGAG
GTCGGCAGCG GTGAGCGCAC CACGGTCGCC GACGACCCCG GGGCCGATCT GGGCCATCCG
GCCATCTCCC CGGACGGCCG GATGCTGGCG TTCACCCGCG AGACGATCTC CACTCCGCTG
CAGGCCCCGC GAATCACATT GTGCTGCCTG CATTTCGGTG GTGAGGTGCG CGAACTGACA
GCCCACTGGG ACCGGTGGCC GACATCGGTC ACCTGGAGCC GCGACGGCGC GAAACTAATC
GTCACCGCCG ACGACAACGG CCGCGGGCCG ATCTTCCTGA TCGACCCGGA CACCGGCGCT
GTCACCAAGC TGACCGACGA CGACCACACC TACACCGACG TCGTCACCGC ACCCGGCGGT
GTGCTCTTCG CGATCCGCCA CAACTACGCC GCCCCACCGC ACCCGGTGCG CATCGACCCC
GACGGCACCG TCACCGTCCT GCCGACCGTC GACGCCCCGA GGCTGCCGGG CACGCTGAGC
GAGATCACCG CCACCGCACC CGACGGCGCC GCCGTGCGGT CCTGGCTGGC CCTGCCCGAC
GGCGCCGGCG AGAACGCCCC GGCGCCGCTG CTGCTGTGGA TCCACGGCGG ACCGCTCGCC
AGTTGGAACG CCTGGCACTG GCGGTGGAAT CCGTGGCTGA TGGTCGCGCA GGGCTACGCC
GTGCTGCTCC CCGATCCGGC CCTGTCCACC GGCTACGGCC AGGACTTCAT CCAGCGGGGC
TGGGGCGCCT GGGGCGAGGC GCCCTACACG GATCTGATGG CCGCCACCGA CGCGGTGACC
GCCGACCCGC GCATCGACGG CACCCGCACC GCGGCGATGG GTGGGTCGTT CGGCGGATAC
ATGGCCAACT GGATCGCCGG GCACACCGAC CGGTTCGATG CGATCGTCAC CCACGCCAGC
CTGTGGGCGC TCGATCAGTT CGGTCCCACC ACCGACGGCG CGTACTGGTG GGCGCGCGAG
ATGACACCCG AGATGGCCGA ACGCAATTCA CCGCACCTGT TCGTGGAGAA CATCGCCACG
CCGATGTTGG TGATCCACGG CGACAAGGAC TACCGGGTGC CGATCGGCGA AGCGCTGCGG
CTCTGGTACG AGCTGCTCAC CAGATCGCGC CTGCCCGCCG CGGACGACGG CACCGGACCG
CACCGCTTCC TCTACTACCC CTCGGAGAAC CACTGGGTGC TTGCTCCCCA GCATGCGAAG
CTCTGGTACC AGGTCGTCTT CGCATTCCTG GCCCGGCACG TGCTCGGGCG GGACGTCGAG
CTGCCCGAAC TGCTCGGGTA G
 
Protein sequence
MPDTVSPFAE LDAYLALPRV AGLAVSPDGS RVVTTISELD DKRTAFVTAI WELDPAGRRP 
ARRLTRGAKG ERAPAFTPGG DLLFLASRPT GDSAEDGDSP PAALWRLPAQ GGEAVEELTP
PGGVTSVRCA RAAGVAVVSA PMLVSAADLD DDKRLRALRK DNKVSAVLHS GYPVRSWDHD
LGPDQPHLLD AADGRDLTPR PGGGLRDAAV DVSDDGSFLV TSWQNPSAGA ALRDTLVRVE
VGSGERTTVA DDPGADLGHP AISPDGRMLA FTRETISTPL QAPRITLCCL HFGGEVRELT
AHWDRWPTSV TWSRDGAKLI VTADDNGRGP IFLIDPDTGA VTKLTDDDHT YTDVVTAPGG
VLFAIRHNYA APPHPVRIDP DGTVTVLPTV DAPRLPGTLS EITATAPDGA AVRSWLALPD
GAGENAPAPL LLWIHGGPLA SWNAWHWRWN PWLMVAQGYA VLLPDPALST GYGQDFIQRG
WGAWGEAPYT DLMAATDAVT ADPRIDGTRT AAMGGSFGGY MANWIAGHTD RFDAIVTHAS
LWALDQFGPT TDGAYWWARE MTPEMAERNS PHLFVENIAT PMLVIHGDKD YRVPIGEALR
LWYELLTRSR LPAADDGTGP HRFLYYPSEN HWVLAPQHAK LWYQVVFAFL ARHVLGRDVE
LPELLG