Gene Mvan_3189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3189 
Symbol 
ID4644179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3391765 
End bp3394671 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content65% 
IMG OID639806666 
Producttransport protein 
Protein accessionYP_953997 
Protein GI120404168 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID[TIGR00833] Transport protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.172033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAC CAGTGTCCGA CGCGCGCACC GACGAATTCC CGGCGGCGCG CCAACCGCAC 
CGGCCGTTCA TCCCGCGGAT GATCCGGCTC TTCGCGGTGC CGATCATCCT CGGCTGGATC
GCGCTGATCG TCATTCTCAA CGTCACCGTG CCGCAGCTGG AGGCGGTCGG CGAGGCGCGC
GCGGTGTCGA TGAGCCCGAA CGAGGCGCCG TCGCTGATCT CGATGAAGAA GGTCGGCGAA
CTGTTCCGGG AAGGTGACTC GGACAGCTCG GTGATGATCG TGTTCGAGGG CGACCAGCCC
CTCGGTGACG AGGCGCACGC CTGGTACGAC GAGCTGGTCG AACGGCTGCG GGCCGACACC
AAGCACGTGC AGTCCGTCCA GGACTTCTGG AGTGATCCGC TCACCGCGTC GGGTTCGCAG
AGCAACGACG GCAAGGCCGC CTACGTCCAG GTCAAGCTCG CAGGCAACCA GGGTGAGTCG
CTGGCCAACG AATCGGTGCA GGCCGCGCAG GAGATCGTCC GCAGCCTCGA GCCGCCGCCC
GGGGTGCGGG CGTTCGTGAC GGGGCCGGCC GCGCTCGCCG CCGATCAGCA CATCGCCAGC
GACCGCAGCG TCCGGGTCAT CGAGTTGGTG ACGTTCGCCG TGATCATCGT CATGCTGCTG
CTGGTCTACC GCTCGATCGT GACCGTGCTC CTGACCCTGG TGATGGTGGT CCTGTCGCTG
GCCACCGCCC GCGGCGTGGT CGCGTTCCTG GGCTGGCACG AGATCATCGG CCTGTCGCTG
TTCGCGACGA ACCTGTTGGT GACGTTGGCC ATCGCCGCGG CGACGGACTA CGCGATCTTC
CTGATCGGCC GGTATCAGGA GGCGCGCACC AACGGTGAGG ACAAAGAGTC CGCGTACTAC
ACGATGTTCC ACGGCACCGC GCACGTGGTG CTGGGCTCGG GCCTGACCAT CGCGGGTGCG
ACCTACTGCC TGAGCTTCAC CCGGCTGCCC TACTTCCAGA CCCTCGGTGT CCCGCTGGCG
ATCGGCATGT TCGTGGTCGT GATGGCCGGC GTCATCCTGA TGGTCGCCAT GATCAGTGTG
GCGACCCGCT TCGGGAAGCT CCTGGAACCC AAGCGCGCGA TGCGCATTCG CGGCTGGCGC
AAGATCGGCG CCGCCGTGGT CCGCTGGCCG GGCCCGATTC TCGTCGCCAC GTTGGCCATC
ACGCTCGTCG GGCTGCTCGC CCTTCCGGGC TACAAGACCA ATTACAACGA CCGCACCTAC
CTGCCGGCCG ATCTGCCCGC CAACGAGGGC TACGCCGTCG CCGATCGCCA TTTCGACCAG
GCCCGGATGA ACCCGGAGCT GTTGATGATC GAAAGCGATC ACGACCTGCG GAACTCGGCG
GACTTCCTCG TCATCGACAA GATCGCCAAG GCCATCTTCA AGGTCGAGGG AATCGCCCGC
GTGCAGGCGA TCACCCGGCC CGACGGCAAG CCGATCAAGC ACACGTCGAT CCCGTTCCAG
ATGAGCATGC AGGGCACCAC GCAGCGGCTC AACGAGAAGT ACATGCAGGA CCGGATGGCG
GACATGCTGG TCCAGGCGGA CGAGATGGCG AACACCATCG CGACCATGGA GAAGATGTCG
AATCTCACCG CGCAGATGGC CGACATCACG CATTCGATGG TGTCGAAGAT GGAGAACATG
CTCACCGACA TCGAGGACCT GCGCGACAGC ATCGCCAACT TCGATGACTT CTTCCGCCCG
ATCCGCAACT ACTTCTACTG GGAGCCGCAC TGCTACAACA TCCCGGTGTG CTGGGCGCTC
CGGTCGGTGT TCGACACCCT CGACGGCATC AACGTGATGA CCGACGACTT CCGCGAAATC
CTGCCGGACA TGAAGCGCCT GGATAGCCTG ATGCCGCAGA TGGTCGCGCT CATGCCCGAG
ATGATCGAGA CCATGAAAAC CATGCGGACC ATGATGCTGA CGATGTATCA GTCCCAGAAG
GGTCAGCAGG ATCAGATGGC GGCAATGTCG GAGGACGCCG ACGCCATGGG TGAGGCGTTC
GACGACTCGA TGAACGACGA CTCGTTCTAC CTGCCGCCGG AGATCTTCGA GAACAAGGAC
TTTCAGCGCG GCCTGGAGCA GTTCCTGTCC CCCGACGGGC ATGCGGTGCG CTTCATCATC
TCCCACGAGG GCGATCCGCT CAGCCCCGAA GGCGTGGCCA AGATCGACAA GATCAAGACC
GCCGCCAAGG AGGCGATCAA GGGCACGCCG CTGGAGGGCT CGAAGATCTA TCTCGGCGGC
ACCGCAGCGA CCTTCAAGGA CATGCAGGAC GGCAACAACT ACGACCTGCT GATCGCCGGC
ATCTCGGCGC TGGGCCTGAT CTTCATCATC ATGTTGATCC TGACGCGGGC CATCGTCGCG
GCGGCGGTGA TCGTCGGCAC GGTGGTGTTG TCGTTGGCCG CCTCGTTCGG CCTGTCGGTG
CTGTTCTGGC AGCACATCCT TGGCACCGAG CTGCACTGGA TGGTGTTGGC GATGGCGGTG
ATCGTGCTCC TTGCGGTGGG CGCGGACTAC AACCTCTTGC TGGTCTCGCG GCTCAAGGAG
GAGATACACG CGGGCATCGG CACCGGCATC ATCCGCGCGA TGGGCGGCAG CGGTTCGGTG
GTGACGGCCG CGGGGTTGGT CTTCGCATTG ACGATGATGT CGATGGCGGT CAGCGAGCTG
ACCGTGATCG GTCAGGTCGG CACCACGATC GGCCTCGGCC TGTTGTTCGA CACGTTGGTG
ATCCGCGCGT TCATGACCCC GTCGATCGCG GCACTGTTGG GCCCGTGGTT CTGGTGGCCG
CAACGGGTGC GTACTCGCCC CGTGCCGGCG CCGTGGCCGA GACCCGGTGG GCTGCAATCA
GATCCATCAG AAGGAGTGAA GGTATGA
 
Protein sequence
MSAPVSDART DEFPAARQPH RPFIPRMIRL FAVPIILGWI ALIVILNVTV PQLEAVGEAR 
AVSMSPNEAP SLISMKKVGE LFREGDSDSS VMIVFEGDQP LGDEAHAWYD ELVERLRADT
KHVQSVQDFW SDPLTASGSQ SNDGKAAYVQ VKLAGNQGES LANESVQAAQ EIVRSLEPPP
GVRAFVTGPA ALAADQHIAS DRSVRVIELV TFAVIIVMLL LVYRSIVTVL LTLVMVVLSL
ATARGVVAFL GWHEIIGLSL FATNLLVTLA IAAATDYAIF LIGRYQEART NGEDKESAYY
TMFHGTAHVV LGSGLTIAGA TYCLSFTRLP YFQTLGVPLA IGMFVVVMAG VILMVAMISV
ATRFGKLLEP KRAMRIRGWR KIGAAVVRWP GPILVATLAI TLVGLLALPG YKTNYNDRTY
LPADLPANEG YAVADRHFDQ ARMNPELLMI ESDHDLRNSA DFLVIDKIAK AIFKVEGIAR
VQAITRPDGK PIKHTSIPFQ MSMQGTTQRL NEKYMQDRMA DMLVQADEMA NTIATMEKMS
NLTAQMADIT HSMVSKMENM LTDIEDLRDS IANFDDFFRP IRNYFYWEPH CYNIPVCWAL
RSVFDTLDGI NVMTDDFREI LPDMKRLDSL MPQMVALMPE MIETMKTMRT MMLTMYQSQK
GQQDQMAAMS EDADAMGEAF DDSMNDDSFY LPPEIFENKD FQRGLEQFLS PDGHAVRFII
SHEGDPLSPE GVAKIDKIKT AAKEAIKGTP LEGSKIYLGG TAATFKDMQD GNNYDLLIAG
ISALGLIFII MLILTRAIVA AAVIVGTVVL SLAASFGLSV LFWQHILGTE LHWMVLAMAV
IVLLAVGADY NLLLVSRLKE EIHAGIGTGI IRAMGGSGSV VTAAGLVFAL TMMSMAVSEL
TVIGQVGTTI GLGLLFDTLV IRAFMTPSIA ALLGPWFWWP QRVRTRPVPA PWPRPGGLQS
DPSEGVKV