Gene Mvan_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0785 
Symbol 
ID4643581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp823055 
End bp825085 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content68% 
IMG OID639804285 
Productprolyl oligopeptidase 
Protein accessionYP_951629 
Protein GI120401800 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.770159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGGCG TGACCGACAC CGACAGCAAG CAGACCGATC CCTACCTGTG GCTGGAGGAG 
GTCACCGGCG ACGATGCGCT GGCCTGGGTG CGTAAGCACA ACGACCCGAC GGTGGCCGAC
CTCGGCGGTG AACGCTTCGA GCAGATGCGC GCCGACGCGC TGGAGGTGCT CGACACCGAC
GCGCGGATTC CGTACGTCCG GCGCCGGGGT GCCCACCTCT ACAACTTCTG GCGCGACGCG
ACCAACCCCA AGGGGCTGTG GCGGCGCACC ACGCTGGAGA GCTACCTGAC CGAGAAGCCG
GCCTGGGACG TCATCATCGA CGTCGATGAG CTGGCCCGGG CTGACGGTCA GAACTGGGTG
TGGGCCGGCG CCGACGTCAT CGAACCGGAT CATTCGCTGG CGCTGATCAG CCTGTCGCGC
GGTGGCTCCG ATGCCGCCGT GGTGAGGGAA TTCGACATGC GGACACGGGA ATTCGTCACC
GGCGGGTTCG AGCTGTCCGA GGCCAAGTCG CAGGTCTCGT GGGAGGACGC CGACACGCTG
CTGGTCGGCA CCGACTTCGG CGAGGGATCG CTGACCGAAT CGGGATATCC CCGTCTGGTG
AAGCGGTGGC GGCGCGGGCA GCCGCTCGCC GAGGCCGAGA CCGTGTTCAG CGGTTCCGAA
TCGGACGTGG TGGTGGCCGG TTCCCGGGAC CGCACCGACG GATTCGAACG CACGCTGGTC
AGCCGCGCCC TGGACTTCTT CAACGAGCAG GTCTACGAGC TGCGCGACGG CGAGCTGATC
CGCATCGACA CCCCCACCGA TGCCAGCATC TCGATCCACC GGCAGTGGCT GCTCATCGAA
CTGCGCACCG ATTGGAGCTA CCGATTCCAG AATTATCCCG CCGGATCGCT GCTGGCCGCA
GACTACGAGG AGTTCCTCGA CGGCAGGGGC GAAGTGCAGG TGGTGTTCAC CCCTGACGCG
CACACCTGCC TGCACCACTA CGCCTGGACC CGCGACCGCC TGGTCGTCGT CACGCTGGCT
GACGTGGCCA GCCGGGTGCA GGTCTACACA CCCGGGACGT GGACTGCCGA ACCGGTGGCC
GACCTGCCGC AGAACACGAA CACCACGATC GCCGCGGCGG ACCCCCTCGG TGACGAGATC
TTCCTGGACT CTTCCGGTTT CGACACCCCG TCGCGACTCC TGCACGGCAC GGCCGGCGGA
CCGCTGAGCG AGATCAAGCA GGCGCCGTCG TTCTTCGACT CCGCCGACCT CGAGGTGTCG
CAGTACTTCG CCACCTCCGA CGACGGAACC CGGGTCCCGT ACTTCGTGGT CGGCCACCGG
CACCGCCAAG GTCCCGGACC CACGCTGCTG GGCGGGTACG GCGGATTCGA GGTGGCCCGC
ACCCCGGGCT ACGACGGCGT GCTGGGCCGG TTGTGGCTGG CCCGCGGCGG CACCTACGTG
CTGGCCAACA TCCGCGGCGG CGGCGAATAC GGACCGACAT GGCACACCCA GGCCATGCGA
GAGGGGCGGC ACCTGGTGGC CGAGGATTTC GCTTCTGTCG CAAGAGATCT GGCGGCGCGT
GGGATCACCA CCGTCGAGCA GCTGGGTGCG CAGGGCGGCA GCAACGGCGG CCTGTTGATG
GGCATCATGC TCACCAAATA CCCCGAGCTG TTCGGAGCGC TGGTGTGCAG TGTGCCGCTG
CTCGACATGA AACGCTTCCA CCTGCTGCTG GCCGGCGCGT CCTGGGTGGC CGAGTACGGA
GATCCGGACA ACCCCGACGA CTGGGCGTTC ATCTCCGAAT ATTCGCCGTA CCAGAACATC
TCCGCCGACC GGCGCTATCC ACCGGTGCTG ATCACCACCT CCACCCGCGA CGACCGGGTG
CACCCTGGTC ACGCGAGGAA GATGACGGCG GCGCTGGAGG CGGCCGGGCA TCCGGTGCGG
TACTACGAGA ACATCGAGGG CGGTCACGCG GGAGCGTCCG ACAACCCTCA GATCGCGTTC
CGGTCCGCGC TCATCTACGA GTTCCTGCAC CGCACGCTGA ACGGAAACTG A
 
Protein sequence
MVGVTDTDSK QTDPYLWLEE VTGDDALAWV RKHNDPTVAD LGGERFEQMR ADALEVLDTD 
ARIPYVRRRG AHLYNFWRDA TNPKGLWRRT TLESYLTEKP AWDVIIDVDE LARADGQNWV
WAGADVIEPD HSLALISLSR GGSDAAVVRE FDMRTREFVT GGFELSEAKS QVSWEDADTL
LVGTDFGEGS LTESGYPRLV KRWRRGQPLA EAETVFSGSE SDVVVAGSRD RTDGFERTLV
SRALDFFNEQ VYELRDGELI RIDTPTDASI SIHRQWLLIE LRTDWSYRFQ NYPAGSLLAA
DYEEFLDGRG EVQVVFTPDA HTCLHHYAWT RDRLVVVTLA DVASRVQVYT PGTWTAEPVA
DLPQNTNTTI AAADPLGDEI FLDSSGFDTP SRLLHGTAGG PLSEIKQAPS FFDSADLEVS
QYFATSDDGT RVPYFVVGHR HRQGPGPTLL GGYGGFEVAR TPGYDGVLGR LWLARGGTYV
LANIRGGGEY GPTWHTQAMR EGRHLVAEDF ASVARDLAAR GITTVEQLGA QGGSNGGLLM
GIMLTKYPEL FGALVCSVPL LDMKRFHLLL AGASWVAEYG DPDNPDDWAF ISEYSPYQNI
SADRRYPPVL ITTSTRDDRV HPGHARKMTA ALEAAGHPVR YYENIEGGHA GASDNPQIAF
RSALIYEFLH RTLNGN