Gene Mvan_2900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2900 
Symbol 
ID4649106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3073224 
End bp3076235 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content67% 
IMG OID639806381 
ProductMMPL domain-containing protein 
Protein accessionYP_953712 
Protein GI120403883 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID[TIGR00833] Transport protein
[TIGR03057] X-X-X-Leu-X-X-Gly heptad repeats 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.689966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0696558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACTGA CGGCGACCGT CGAGGCGCGC AACCGTCGGG TTCCTGAACG CACCGGCGAA 
TTCAGCCCAC GCCTGGCGGC GCTGGGCCGG TTCACCCTCA GACACAAGGC ATTGGTGATC
GGGGCATGGC TCGGTGTCGC CGTGATCCTC GCCGTCGTCT TCCCGCAACT CGAGACAGTC
GTCCGTCAGC AGTCGGTACA GCTGTTGCCC AACGACGTGG CGTCGTTCAT CGCGGTCGAG
GAGATGGCCG CAGCGTTCGA CGAGCACGGC GCCAAGACGT CGATCGTCGT GGCGATGGAA
GATCCCGCCG GGCTCACTCC GCAGACGCGG CAACGTTATG ACGCCCTCGT GGCGGCCCTG
CGCGCCGACG CCACCAATGT CCTGCTGGTC CAGGATTTCC TCTCCGACCC GACCACTCGC
TCTCAGGTGG TCAGCGAGGA CGGAAAGGCC TGGTTCCTGC CGGTAGGGAT CGTCGGTACG
CTCGGCGACC CGCAGGCCGC CGCATCGGTG GAAGCCGTTC GCGCCAGCGC CGATTCGGCA
TTCGCCGGGT CGTCGAGCAC CGTCCATGTG ACGGGCCCGG CGGCGACGTT CCACGATCAG
ATCGCCACTG CCGAGCACGA CGCGCTGCTC GTCAAGGTCG CCAGCGCCGC CCTGATCGCG
ATCATCTTGC TGCTCGTCTA CCGGTCGGTG GTCACCGCAC TCCTGCCCCT ACTGGTCGTC
GGCGTCAGCG TGGCCGTGGC GCGCGGAGTC CTGTCCGGAC TCGGTGAGGC CGGCATGCCG
GTCTCCCAAT TCACCGTCAT CTTCCTGGTC GGCATCCTGC TGGGCGCAGG CACCGACTAC
AGCGTCTTCT TCATCAGCCG ATACCACGAA CAGCGCCGAC TCGGCACCGA CCCCGAAGAG
GCCATCATCT ACGCGTGCGG AAGCATCGGG CGCGTCATCC TGGCGTCGGC AGCCACCGTG
GCTCTGGCGC TGGCATCCAT GGTGTTCGCC CGGCTCAGCG TCTTCCAGGG CGTCGGCCCG
GCATGCGCCA TCGCCGTTCT CATCGGATTC CTGGCGACCG TCACGCTGCT TCCGCCGGTA
ATGGCGCTGG CCGCCAAGCG TGGTATCGCC GAGCCCCGCG CCGATCTGTC CCGCCGGTAC
TGGAACCGGA TCGCCGTTAC CGTGGTGCGG CGGCCAAAGC CCTTGCTGGC CGGCAGCCTG
GTGGTCCTGC TCGCGCTGAC CGGTGTCGCG GCCACCATGA CCATCAGCTA CGACGACCGC
CAAGGTCAGC CCGCCGCCAC CGACAGCAAT CAGGGCTACA AGTTGTTGGA CCGACACTTC
GCCAAAGATT CGGTCATCAC CCAATTCCTG CTCGTTCAGT CCGACACCGA TATGCGCACC
GCCAAGGCAC TCGCCGACCT CGACCAGCTC GCCTCGCGCA TCGCCCAGAT GCCGGGCATC
ACCCGCGTCT CCGGGGTCAC CCGTCCCACC GGCGACCGCC TCGAGCAAGC TCAATTGTCC
TGGCAGAACG GGCAGATCGG CGACAAGATG GCCGGCGCGG TCGCCGAGGG CCGAGCCCGC
GAACACGACC TCACCAAGCT CACCGACGGA GCCGACCAGC TCGCAGCCGG GCTCGCCCAA
CTCGACACCA CGCTGCGTCG CGCCCTCACC CCACTGTCCG GCCTGTTGTC ACAAGTTCAA
GACACGGGCC GCCAGCTGCA GAATGCCCGT CCCGTGATCG AGCAACTCAA CACCACCGCG
CCCACCGTCG ATCAAGCCCT ACGCAGCGGC CCCGGGCTAC GACCCCTGGC CACGCAAGCC
TCTGCCGCCA TCGCGGCCGT CGAACCCCTC ATCGCGGGCC TCAACGCGTC ACCGTGGTGC
GCCACCACAC CACAATGCGC CCAACTGCGC GACCAAACCG GGATACTCGT GACGCTACGC
CGAGGCGGAT TCTTCGATCA GGTCGCCGAC CTCGGTGACC ACCTCGGGCC CGACACCACG
CTGTCCGGCA CCCTCGCCAG CCTCCAAACC GCCGTCACCA CAATGCAACA AGCGCTGGGG
GTGACCGGCG ATCCGGCTGA CCTCGCGGGC AACATGCGTC GCCTCCAGGA CGGCGTGTCC
CAACTCGCCT CGGGCGGTAG GGCGCTGGCC AGCGGTGTGC ACGCGTTGGC CGACAGCAAC
ATCCAAATGC TGGGCGGCAT GAGCCAGATC GCCGCCCAGT TGCAGAACTC CGCGCGGGAC
ACCGCCGGAT CCGATGCCGC CGCGGGCTTC TACCTCCCCC CGGACAGCTT CGAAAACCGC
CAATTCAGCG ACGTGGCACG TCAATTCGTG TCCGCCGACG GACGCACCGC CCGGTTCGCC
ATCACCTCGT CCCACGACCC CTTCTCCGCC GAGGCGATGG AGCTCAACGG CCGCATCATC
GACACCGCCA ACGCCGCCAC ACCCAACACC TCTCTGGCCG GTACGTCGAT CTCCATCGTC
GGGTTTCCGG CCCTGAACTC CGATCTGCAA CAGCTGCTGT CGACCGACTT CGCCCGTCTG
GGCGCGGCGA CGCTGCTGGT CGTCGGCATC GTCCTGGTGC TGCTGCTGCG CGCCATCGTC
GCCCCGCTCT ATCTGCTGGG CACCGTGGTG CTGAACTACG CCGCCGCCCT GGGCATCGGG
GTATTGGTGT TCCAGTACGG ATTCGGCCAA GCCATCGCCT GGCCGGTGCC ACTGTTGGCC
TTCATCTTGC TCGTCGCGGT CGGCGCCGAC TACAACATGC TGCTGATCTC CCGCCTGCGC
GAGGAATCCG GCCGCAGCGT GCGCGTCGGC GTGCTGCGCA CCGTCGCCAG CACCGGATCC
GTCATCACCT CAGCCGGAAT CATCTTCGCG GTCAGCATGT TCGGTCTGAT GACCGGATCG
GTGCACATCA TGGTCCAAGC CGGATTCATC ATCGGCTGCG GGCTACTGCT GGACACGTTC
GTCGTCCGTA CCCTCACCGT GCCGGCCATC GCCACACTGT TGCGGGAGAA GAGTTGGTGG
CCGCAACGGT GA
 
Protein sequence
MVLTATVEAR NRRVPERTGE FSPRLAALGR FTLRHKALVI GAWLGVAVIL AVVFPQLETV 
VRQQSVQLLP NDVASFIAVE EMAAAFDEHG AKTSIVVAME DPAGLTPQTR QRYDALVAAL
RADATNVLLV QDFLSDPTTR SQVVSEDGKA WFLPVGIVGT LGDPQAAASV EAVRASADSA
FAGSSSTVHV TGPAATFHDQ IATAEHDALL VKVASAALIA IILLLVYRSV VTALLPLLVV
GVSVAVARGV LSGLGEAGMP VSQFTVIFLV GILLGAGTDY SVFFISRYHE QRRLGTDPEE
AIIYACGSIG RVILASAATV ALALASMVFA RLSVFQGVGP ACAIAVLIGF LATVTLLPPV
MALAAKRGIA EPRADLSRRY WNRIAVTVVR RPKPLLAGSL VVLLALTGVA ATMTISYDDR
QGQPAATDSN QGYKLLDRHF AKDSVITQFL LVQSDTDMRT AKALADLDQL ASRIAQMPGI
TRVSGVTRPT GDRLEQAQLS WQNGQIGDKM AGAVAEGRAR EHDLTKLTDG ADQLAAGLAQ
LDTTLRRALT PLSGLLSQVQ DTGRQLQNAR PVIEQLNTTA PTVDQALRSG PGLRPLATQA
SAAIAAVEPL IAGLNASPWC ATTPQCAQLR DQTGILVTLR RGGFFDQVAD LGDHLGPDTT
LSGTLASLQT AVTTMQQALG VTGDPADLAG NMRRLQDGVS QLASGGRALA SGVHALADSN
IQMLGGMSQI AAQLQNSARD TAGSDAAAGF YLPPDSFENR QFSDVARQFV SADGRTARFA
ITSSHDPFSA EAMELNGRII DTANAATPNT SLAGTSISIV GFPALNSDLQ QLLSTDFARL
GAATLLVVGI VLVLLLRAIV APLYLLGTVV LNYAAALGIG VLVFQYGFGQ AIAWPVPLLA
FILLVAVGAD YNMLLISRLR EESGRSVRVG VLRTVASTGS VITSAGIIFA VSMFGLMTGS
VHIMVQAGFI IGCGLLLDTF VVRTLTVPAI ATLLREKSWW PQR