Gene Mvan_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1041 
Symbol 
ID4645352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1092142 
End bp1094478 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content68% 
IMG OID639804542 
Productpeptidase S15 
Protein accessionYP_951885 
Protein GI120402056 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTCGAG TCGGTGGTCT GGCAGTCGCG CTGGGGATCG GTGCAGCGGT CTTCACGGGG 
GCAGGAGTCG CCTCCGCGGA CGACGGCACC GGGGCCTCGG CGTCGGACTC GTCCGCGGAA
ACCTCGCAGT CCTCGCAGCC TGTGGCGTCG TCGGATGGGG CCATGGCCGA CGAGAAGGCC
GGGGACTCTT CGCAATCCGG AGCTGAAGAG CCGGCTGAGG ACGAACCGGC TGAAGAAGAG
CCCGTCGACG CGGAGCCGGC TGAGGAAGAG CCCGTCGACG AAGAACCGGC TGAGGAAGAG
CCCGTCGACG AAGAGCCGGC TGAGGAAGAG CCCGTCGACG AAGAGCCGGC TGGGGAGGAG
CCCGTCGACG CGGAGCCGCC CGTCGCGGAG CCGCTGGAAC CGGAAGAGGC GCCGGCCGTC
GAGCGGCCCG CCGCGCAGGC CGAGACCGAC CAGACCGAGA CGGTGACCAG GGATATCTCC
GACCAAACCG CGCCCGACCA GCCGGTCGCC GACACCGAGG CGCCTGCGTC ACCTGCGCTC
GCCTCGCTGG TGATGTCCGT ACTGGCGGCG GGGCGGGAAG CCACCGACGA GACCCCGCAG
TCGGCGGCTG ACCAGGTCGG CACCAGCCTG GCCGCCGACG AGTATCCGAT CCCGACCGAC
GTCGTGGTCG AGGACTTCAA GCCGCCGCTC GAATGGCTGC AGCACATCCC TGTGCTGGGC
AGGTTCGTCG TCACGCCGCT GGTACACCTG GCGCACGCGC TGCCGTTCGT GGGGGAGTTC
CTGCATCCGT TGATCGGCTG GCCGGTCGAC CACGACGCCG CACCCGGCGA TCCGAAGCCG
CGCACCGTGC GGGTCACGTC GTTCGACGGC GCGAAGATCT ACGTGCACTT CATGCCCGCC
ACCGGGCTGA AGGCCGGCGA GAGCGCGCCG ACGGTGCTGT CGGGTCCGGG GCTGGGTCTG
CCGGGGGCCA CGACGCTGGG CATCGACATC GACGGGTTCC TGCCCAACGA CGTCGTCGGG
GTCGGGATGT TGCGCAAGGC CGGCTACAAC GTGGTGACCT GGGATCCGCG CGGCGAATGG
CATTCCGAGG GGACGATGTT CCTCGACTCG CCCGACTACG AGGGCCGCGA CGTCTCGCAC
ATCATCAGCT GGCTGTCGAC GCTGGACGCG GTGCAGAAGG TGGACGGCGA CCCGAAGATC
GGCATGGTCG GCGCCTCCTA CGGCGGTGGA ATCCAGTTGG CCGCAGCGGC AATCGACCGC
CGCATCGACG CGATCGTGCC GACGATCGCG TGGAACAACC TCACCGACGT GTTGTTCCCG
CGCGAAGCGG TCAACAGCGG CTGGGGCACG CTACTGCCGA CGGTGCTGGC GTTGACGCTG
GCGCGCGAGC ATCCGCGGAT CTTCCCGGTG GCGATCGCCG GTGTGCTGTT CGGCCACGCC
GATCCCGCCG ACATCGATTT GGTGGAGAGC TTCGGCTATC AAGATCAGTT GGCCGACATC
ACGATTCCGA CGCTGTTGAT CCAGGGCACG GTCGACACCC TGTTCACCCT GGACCAGGCG
CACCGCAACG CGCTGGAGCT GATCGCGGCG GGCACGACGA CGAAGGTGCT GTGGTACTGC
GGCGGCCACG GCGCGTGCCT GAGCAGCTTC AACGACGGCG AGAAGGTGTG GGGCGAGACG
CTGGAATGGC TGGACCGCTA CGTCAAGGGC GACGAAACCG TCGATCCCGG AGCGCAATTC
GAGTGGGTGA ACCAGCACGG TGAATGGTTC TCGTCGGAGA CCTATCCGGC CGGGTCGCTC
GGCGCGCCGA TCGAGGCGTC CAGCGACGAC CCCAAGACGA TCCCGTTCGT GCCGTTTATC
GGTGGGTCGG GGCCGAACCC CCTGATCCTG CTGAAGGGTC TGGTACGCAC CCTCGTCGGA
TTGCCGTCGG CGGCGCCGGC GTTGAACGCG GTGAACCTGA CAGTACCGGA TGCGACCGTC
GAGACCCACA TCGTGGGTGC GCCGGAGCTG ACTCTGACCT ACTCCGGAAC CGGGACCGCC
AAACATGTCT ACGCCCAGAT CGTGGACGAC GAAACGGGTT TGGTGCTCGG CAATCAGGCG
ACGCCGATTC CGGTCGAGTT GGACGGGCAG TCGCACACGG CCACCTTCTC GCTGGAGCAG
GTGGCGCACA CGCTGCAGCC GGGCCAGTCG GTGACGGTGC AGATCGTCAC CTCGACGATC
TCGTTCCTGA ACTTCTACTC GTGGGGCAAC GTGACCGTCG AGGGCATGTC GGTCAAGCTG
CCCACAGCCA TCGCGGCAGC GGCGTCATCG GCGTCCGAGG AGACTGTCGC GGCGTAG
 
Protein sequence
MGRVGGLAVA LGIGAAVFTG AGVASADDGT GASASDSSAE TSQSSQPVAS SDGAMADEKA 
GDSSQSGAEE PAEDEPAEEE PVDAEPAEEE PVDEEPAEEE PVDEEPAEEE PVDEEPAGEE
PVDAEPPVAE PLEPEEAPAV ERPAAQAETD QTETVTRDIS DQTAPDQPVA DTEAPASPAL
ASLVMSVLAA GREATDETPQ SAADQVGTSL AADEYPIPTD VVVEDFKPPL EWLQHIPVLG
RFVVTPLVHL AHALPFVGEF LHPLIGWPVD HDAAPGDPKP RTVRVTSFDG AKIYVHFMPA
TGLKAGESAP TVLSGPGLGL PGATTLGIDI DGFLPNDVVG VGMLRKAGYN VVTWDPRGEW
HSEGTMFLDS PDYEGRDVSH IISWLSTLDA VQKVDGDPKI GMVGASYGGG IQLAAAAIDR
RIDAIVPTIA WNNLTDVLFP REAVNSGWGT LLPTVLALTL AREHPRIFPV AIAGVLFGHA
DPADIDLVES FGYQDQLADI TIPTLLIQGT VDTLFTLDQA HRNALELIAA GTTTKVLWYC
GGHGACLSSF NDGEKVWGET LEWLDRYVKG DETVDPGAQF EWVNQHGEWF SSETYPAGSL
GAPIEASSDD PKTIPFVPFI GGSGPNPLIL LKGLVRTLVG LPSAAPALNA VNLTVPDATV
ETHIVGAPEL TLTYSGTGTA KHVYAQIVDD ETGLVLGNQA TPIPVELDGQ SHTATFSLEQ
VAHTLQPGQS VTVQIVTSTI SFLNFYSWGN VTVEGMSVKL PTAIAAAASS ASEETVAA