Gene Mvan_4095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4095 
Symbol 
ID4648703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4386633 
End bp4388495 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content67% 
IMG OID639807560 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_954878 
Protein GI120405049 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGAAC AGGTCCGTCG AATCTACGGC GCGTCGATGT CGCCCGACGC CACGGCGTTC 
GCCCATCTCG TCGACGACGG TGGATATCCG AGGGCGGTGC AGCGGTTCCT CCGCGGGTGG
CGCGCGAGCT CCTCGCGCGA TGTCGAGCTG CCCGTCGAAG GCCCCGTGAC GCGGGTGATG
CACTCGGCAG ACGGTCACTG GTTGGCATGC CAGGTCGCGC CAGAAGGTGG CAGCCGCAGC
CAGATCTGGG TGGTGACAAC GGATCCCGAC GATCGTGATG CGCGAAGGAT CGACAGGTGG
CCGTCGGGGG TGGAGGGCAC CGCGGAGCTG ATCAGCTGGG ACGGCACGCT TGTCGCGGCC
ATCCTGACCG GTGAGGACGG GGTCGGCACG TCATGCCTGC TCGACCCGTC CGGGACGACG
GTCATCGTGC TGGACCGCAG ATCCGGTGGC CGACTCGTCG ACGCCTGGGG CGGCGCTTCG
CTCGTGCGGG TGGGCCCACG CGGCTACCGC GATCTGATCA TGTTGCGGGC GCTCACCGAA
ACCGCGCTGC TGCCTTACGA TCCCGGGTCG ACGACCGACA CCGGCATCAT CCTCGACGAC
CACAGTCCGC GCCGGTTGCG CTCGGGGCCG GAGGGTGAGA CCACCGACCT CTATTACCCG
GTCAAGGACT ACGGTCTCGA CAGCACCGAG GGCTATGTGC GGGCCTTGAT CCGCAGCGAG
AACGGCGCCC AGCACGCGCG CCTGCTCGAA GTGACGGTGA CCGCCGACGG CGTGTCCTAC
CAGGTGGTGG CCGAGCGACC CGGCTACGAA CTCGACGAGT TCACGGTCAG CGATGACCTG
TCGACGGTGG CGATGTTGTG GAATCTGCAC GGCGCCAGTG AATTACAGAT CCTGGAGTAC
GCCGATCAGA CGCTGCACGA CCCGATCCCG CTGCCGGGCA TGGTGGCCGG CGAGCTGAGC
ATCAGCGCGG GCGGGACGAT GCTGGCCATG ACGGTCGAAG GTCCGTCGAA GCCGCCCACT
GTCGAACTGG TCGATCCGCG GACCCGCGAA TGGGAGTTGG TCGACCGCGA ACCCAGCTGC
GGCCCGGTGT CGGACGACCC CACCCTGGAG ACGATCATCG CCCGCGACGG CCTGACGTTC
AGCGGCTGGC TGTTCCGGCC TCCCGAAGGG GTGGAGACCA TCGGTGCGAT GTTGTTCCTG
CACGGTGGGC CCGAGGGGCA GGGCAGGCCC GGGTACAACG AGTTCTTTCC CGCACTGCTG
GACGAAGGGA TCTGCGTCTT CCTGCCCAAC GTGCGCGGAT CCGGCGGATT CGGGCGGGCC
TTCATGCACG CCGACGACCG CGAGCGCCGC TTCGCGGCCA TCGATGACGT CGCCGACGCC
GCGCGTTTCC TCGTGGGCAA CGGGCACGCG CCTGCCGGGC GGGTGGCCTG CTGCGGCTGG
TCGTACGGCG GCTACCTGAC ACAGGCGGCG CTGACCTTTC ACCCCGACGA GTTCGCCGCG
GGCATCAGCA TCTGCGGAAT GAGCGACCTG AACAGCTGGT ACCGCAACAC CGAGCAGTGG
ATCGCGGCGG CGGCGTACCC GAAGTACGGG CACCCGGTCA GCGATCAGGA TCTACTCGAA
CGGTTGTCTC CACTGCCGCG GGCCGACAAG GTGACTGCAC CGCTGCTGTT GGTGCACGGG
CTCAACGACA CCAATGTGCC GCCCGGTGAA TCCCAACAGA TGTACGACGC GTTGACCGAA
TTGGGCCGCC GGGTCGAGCT GTTGACGTTC GAGGATGACG GGCACGAGAT CGACAAACGC
GAGAATCGAG CCGTATTGCG GAAAGCCATG ACGGCGTGGC TCGTCGAGGC TTTCGCGGAT
TGA
 
Protein sequence
MAEQVRRIYG ASMSPDATAF AHLVDDGGYP RAVQRFLRGW RASSSRDVEL PVEGPVTRVM 
HSADGHWLAC QVAPEGGSRS QIWVVTTDPD DRDARRIDRW PSGVEGTAEL ISWDGTLVAA
ILTGEDGVGT SCLLDPSGTT VIVLDRRSGG RLVDAWGGAS LVRVGPRGYR DLIMLRALTE
TALLPYDPGS TTDTGIILDD HSPRRLRSGP EGETTDLYYP VKDYGLDSTE GYVRALIRSE
NGAQHARLLE VTVTADGVSY QVVAERPGYE LDEFTVSDDL STVAMLWNLH GASELQILEY
ADQTLHDPIP LPGMVAGELS ISAGGTMLAM TVEGPSKPPT VELVDPRTRE WELVDREPSC
GPVSDDPTLE TIIARDGLTF SGWLFRPPEG VETIGAMLFL HGGPEGQGRP GYNEFFPALL
DEGICVFLPN VRGSGGFGRA FMHADDRERR FAAIDDVADA ARFLVGNGHA PAGRVACCGW
SYGGYLTQAA LTFHPDEFAA GISICGMSDL NSWYRNTEQW IAAAAYPKYG HPVSDQDLLE
RLSPLPRADK VTAPLLLVHG LNDTNVPPGE SQQMYDALTE LGRRVELLTF EDDGHEIDKR
ENRAVLRKAM TAWLVEAFAD