Gene Mvan_4077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4077 
Symbol 
ID4649289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4368211 
End bp4370775 
Gene Length2565 bp 
Protein Length854 aa 
Translation table11 
GC content66% 
IMG OID639807542 
Productpeptidase S15 
Protein accessionYP_954860 
Protein GI120405031 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.122952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCAG CGAAGTATGT CGGTCGGGTC GGTGGGTTGG CGGTCGCCCT GGGGGTGGGT 
GTCGCGATTC TCACGGGGCA GGGTGTCGCC TACGCCGACG ACACGGGAAC TGCGAGCTCG
GAGACCACAG CGACCGAGAA CAGCTCTGCC GAGGGCCCGG CCAGGCAAGG CGCCGCTTCG
GACAGCGATG TCGGGACCGG CGCGGACGAC GAAGCGTCCG ATGTCGGGAC CGTCGCGGAC
GACGAGGCGT CCGATGTCGG GACCGTCGCG GACGACGAGG CGTCCGATGT CGGGACCGTC
GCGGACGACG GGGCGTCCGA TGTCGAGACC GCCGAGGACG ACGAGGCGTC CGACGTCGAA
GAGCAGCAGT CGGGCGGCGG CACGGGTGAA GATTCGACGA CACCCGCAGA ACCTGTCGAC
ACCGCCGACG ACACCGCGTC GCTCTCGGCG CCGGTAGTGA GTACCACGGT CGTGTCACCG
CCGGCGTCGG CCGACCTCGC GCTCGACCCG ATCGCCGACG ACAGCGGAAC GGCCCCGATC
GGCGGATCGG CCGCCGATCT ACTGCTGCTG GCCGGAACCC GACGGGAACT GCAGGCGCAG
CAGGAGGCGG CGCTGGAGAT TCCGCCCTAC TCCGTCGGTG TGACCGCGGG GGTGATCGGC
GGTTGCGTCG TCGGGCCCAC CTGCACCACC GCGGACGGAT CGACGTTCAC CCTCATCAAG
GACCCGGACA AGGGCGGCAA GCTCGCCCTG GATCCGACGA CAGGCGCTTT CAGGTTTCTA
CCCTTCGCGT CGGAGAAGAA CCCCGACGGT TCACAGAATG CCGACGGGCC GTCCGGGCAG
GAGACGTTCT CGGTTCTCGT TGCGCAGAAC ACGCAGTTCA CGACGTTCCT GATCGGGCTG
CCGATCGTGG GATCGTCGAT TCTGGCGCCG GTCATCATGA CCCTGCAACA GATTCCGGTC
ATCAGCACCG TCCTCGCACC GTTGATCGGC ACCGCGACGC GACAGGACAT CCTGGTCGAC
ATCGACTTCT TGCGGACTCC GGCAGAAACG CCGGTGGCCT ACACCACCAT GGTGACGTCG
TGGGACGGCA CGCAGATCAG CACCAATTTC TTCCCGGCGA TCGCCGACGG GAACCCTGAT
GACGGCAATC CCGGATACGA GACCATCCTC TACGGGCCGG GATTGGCGCA GGCGGGCGCC
ACCGATCCGG AGAACCCGTT CGTGTCGACA TTCCGGCTGG GCGGCTACAA CGTGGTCACG
TGGGACCCAC GTGGTGAATT CGCCTCCGGT GGCGTGCTTC AGCTCGACAG CCCGCAGTAC
GAGGGGCAGG ACGTCTCCCA GCTCATATCC TGGGCGGCCG GTCTCGACGG CGTCGAACTC
GACGGAGTCG GTGATCCGAC GCTGGGCATG GTCGGTGTCT CCTACGGCGG CGGCATCCAG
TTCGTGACCG CCGCCGGGGA CAACCGGGTC GACGCGATCG CGCCGGGCTG GGCGTGGAAC
ACCCTGCCGG ATTCGCTCTA CCCGGACCGG TCCTTCAAGA CGGCCTACTC GTCCCTGCTG
TTGCTGGGTC TGGTGACCAC CGGCGCCAGG ATCAACCCGC AGATCTACGG CGGCATCATC
ACCGGTGCGA TTCTCGGTGT CCTCACCCCC GGTCAGATTC AACTGCTCCA GACCAGTGGC
CCGGGCCAGA CGGTTCGGGG GATTTCGGCA CCGACCCTGA TCATCCAGGG CACGGTCGAC
GTGTTGTTCC CGTTGCAGCA GTCCATAGTC AACGAGGGGT TGCTCGCGCT GAACGAGGAC
GTCAAGATGC TCTGGTTCTG CGGCGGGCAC GGATCGTGCC TTCCCGGACA AGGCAATGGC
GAGGCCGATT CGGCATGGGT GATGCGCGAG ACGCTGGCGT GGATGTACCG CTACGTCAAA
GACGACGGCT CGCAGGACGA CGTCGTCTTC GAATGGACCG ATCAGTACGG GGACCGGTGG
ACCTCCGAGG TCTCCCCGAC CGAGGCCGGT TTCTACGACC CGCCCGGTGT CATTCCGACG
ACCGTGTGGG ACACCGGCAA AGTGCTGCCG ATCATCCCGC TCATCGGTGG TTCCGGTCCC
AACCCGGAGG TTCCGTTGCC GTATTCGTTG GGTGACGGAT CCATCGCCAG TAACGCGGTC
ACCATCGCGC TGGAGAATCC CGTCGCCGAG GCCAACGTCG TCGGCTCGCC CTCGGTGGTG
ATCCATTACA GCGGTTTCGG AACCAGCCGG CACATCTACG CACAGGTGGT CGACCGGAGC
ACCGGACTCG TCGTCGGCAA CATCGTCACC CCGGTTCCGG TGACGCTGAA CGGCCGGGAC
CAGGTCGTCA CGGTGGACCT GAATGACATC GCCTACACCG TGGGGCCCGA CAGCGAACTC
GAACTGCAGA TATTCACCAC CGCGACGTCC TACTTCAACG CGACGCAGTT CGGCTTCATC
AACGTGGAGA GCGTCGAGGT GACGATGCCG ACCACCACCA AGGGGACCAA TCAGGGTCCC
AACCCGAGCC TGCCTGAGAT GCCCGAGGTG CTGGTCGCCG TCTGA
 
Protein sequence
MGAAKYVGRV GGLAVALGVG VAILTGQGVA YADDTGTASS ETTATENSSA EGPARQGAAS 
DSDVGTGADD EASDVGTVAD DEASDVGTVA DDEASDVGTV ADDGASDVET AEDDEASDVE
EQQSGGGTGE DSTTPAEPVD TADDTASLSA PVVSTTVVSP PASADLALDP IADDSGTAPI
GGSAADLLLL AGTRRELQAQ QEAALEIPPY SVGVTAGVIG GCVVGPTCTT ADGSTFTLIK
DPDKGGKLAL DPTTGAFRFL PFASEKNPDG SQNADGPSGQ ETFSVLVAQN TQFTTFLIGL
PIVGSSILAP VIMTLQQIPV ISTVLAPLIG TATRQDILVD IDFLRTPAET PVAYTTMVTS
WDGTQISTNF FPAIADGNPD DGNPGYETIL YGPGLAQAGA TDPENPFVST FRLGGYNVVT
WDPRGEFASG GVLQLDSPQY EGQDVSQLIS WAAGLDGVEL DGVGDPTLGM VGVSYGGGIQ
FVTAAGDNRV DAIAPGWAWN TLPDSLYPDR SFKTAYSSLL LLGLVTTGAR INPQIYGGII
TGAILGVLTP GQIQLLQTSG PGQTVRGISA PTLIIQGTVD VLFPLQQSIV NEGLLALNED
VKMLWFCGGH GSCLPGQGNG EADSAWVMRE TLAWMYRYVK DDGSQDDVVF EWTDQYGDRW
TSEVSPTEAG FYDPPGVIPT TVWDTGKVLP IIPLIGGSGP NPEVPLPYSL GDGSIASNAV
TIALENPVAE ANVVGSPSVV IHYSGFGTSR HIYAQVVDRS TGLVVGNIVT PVPVTLNGRD
QVVTVDLNDI AYTVGPDSEL ELQIFTTATS YFNATQFGFI NVESVEVTMP TTTKGTNQGP
NPSLPEMPEV LVAV