Gene Mvan_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1057 
Symbol 
ID4645368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1110322 
End bp1113213 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content65% 
IMG OID639804558 
Producttransport protein 
Protein accessionYP_951901 
Protein GI120402072 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID[TIGR00833] Transport protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGT CTACAGACGA CGCACCCACC GACGTCCTGC CGCCGGCGCG GCACGCGGCC 
CCCTCCCGGC CGAAGCTGCC GCGCTTCATC CGCACGTTCG CTGTGCCGAT CGTCCTGGCC
TGGGTCGCCA TCGTCGCGCT GCTCAACACC GTCGTGCCGC AGCTCGAGGA GGTCGGCAAG
CTGCGCGCCG TGTCGATGAG CCCCAACGAT GCGCCCGCGC TGATCGCGAC CAAGCACGTC
GGCGACAAGT TCGACGAGTA CAACACCTCC AGCTCGGTGA TGATCGTCCT CGAAGGCGAG
GAGGCGCTCG GTCCGGACGC GCACGCGTTC TACGACGAGG TGGTCCGCCA ACTCGACGCC
GACACCGAGC ATGTGCAGCA CGTGCAGGAC TTCTGGGGTG ACACCCTGAC CGCGGCCGGC
GCCCAGAGCA TCGACGGCAA GGCCGCCTAC GTTCAGGTGT ACATCGCCGG TGACCAGGGC
GAGACGCTGG CCAACGAATC GGTGCACGCC GTGCGGGCCA TCGTCGACAG CACTCAGGCG
CCCCCGGGGG TCCGCGCGTA TGTCACCGGC CCCGCGGCGC TGACCACCGA TCAGAACATC
GTCGGCGACG CCAGCATGAA GACGATCGAA TCGGTGACGA TCGGCATCAT CATCGTGATG
CTGCTGATCA TCTACCGATC GGTCATCACG ACGGTCGTCA CCATGTCGAT GGTTTTCGTC
GGCCTGCTGT CGGCCCGCGG CATCGTCTCC TTCCTCGGGT TCTACGAGGT TTTCGGGCTC
ACCACCTTCG CCACCAGCAT GGTGGTGACG CTGGCCATCG CCGCCGCCAC CGACTACGCG
ATCTTTCTGA TCGGGCGATA CCAAGGGGCC CGACGATCGG GGATGGACCG AGAGTCGGCC
TATTACGACA TGTTTCACGG CACCGCCCAC GTGGTTGCGG CCTCGGGACT GACGATCGCC
GGTGCGACCG CCTGCCTGCA CTTCACCCGG TTGCCGTACT TCCAGAGCAT GGGATTCCCG
CTGGCCGTCG GCATGATCAT CGTGGTGGCC GCGGCGCTTA CCCTGGGTCC GGCGCTGATC
TCGATCGTGA CGCGGTTCGG CAAGGTGCTG GAGCCCAAGG GAAACGGGCG GGCACGAGGC
TGGCGCAAAC TCGGTTCGGC GACGGTCCGC TGGCCCGGTG CGGTGCTGGT GATGGCCACC
GTGCTGTGCC TGGTCGGCCT GCTGGCCCTG CCCGGGTACC ACACCAATTA CAACGACCGC
ATCTATCTGC CCGACGGGGT GCCCGCCAAC GTGGGTTACG CCGCGGCCGA CCGGCACTTC
TCCGACGCCA AGATGAATCC CGACCTGGTG ATGGTCGAAT CGGATCACGA TATGCGCAAC
CCGGCGGACT TCCTGGTGAT CGAGAAGATC GCCAAGGCGC TGACCCGGGT GCACGGAATC
GCTTCGGTCA CCACGATCAC CCGTCCGGAC GGGAAGCCGA TCAAGCACGC GTCGCTGGCC
TACACCATCA GCCAGAGCGG CAACGGGCAG ATCATGAACA ACGACTTCCA GCAGACCGTG
CTGGAGAACA CGCTGCAGCA GGCCAACGAG ATGCAGGTGA CCATCGACTC GATGGAGGAG
ATGCAGCGCA TCACCCTGGA GCTGTCCGAG GTCACCCGCG AGATGGCCGA CAAGATGAAG
GACACGTCGG CGAACCTCAA CGAAGTCCGG GACCACCTGG CCGATTTCGA CGATCAGTTC
CGCCCGCTGC GCAACTACTT CTACTGGGAG CCGCACTGCT ACAACATCCC GATGTGCTGG
GCACTGCGGT CGGTGTTCGA CAGCCTCGAC GGCATCAGCA CGATGTCCGA CGATTTCACC
GAGCTGGTGC CCAGCATCGA GCGGATGGCG CAGCTGACGC CGCAGATGGC GGCCATCATG
CCCGCGATGA TTCAGACGAT GAAGAACCAG AAGCAGATCA TGCTGAATCA GTACCAGGCG
CAGAAGATGC AGCAGGATCA GAACATCGCC ATGCAGGAGG ACAGCACGGC GATGGGCGAG
GCGTTCGACA CCGCGCGCAA CGACGACACG TTCTATCTAC CGCCGGAGGC GTTCCAGACC
GCCGACTTCC AGCGGGGCAT CAAGCTGTTC ATGTCCCCGG ACGGGAAAGC GGTGCGCTTC
ACCGTGTTCC ATCAAGGTGA CCCCTTGACC GAAGCCGGCA CCGCCCGCAT CGATCCGCTG
CGCATCGCGG CCGCGGATGC CATCAAGGGC ACGCCGCTGG AGGGTTCCAC GATCTACGTC
GGCGGCAGCG CCGCGATGTA CAAGGACATG CAGCAGGGTG CCGATTACGA CCTGCTGATC
GCCGCCGTCG CATCGCTGAT CCTGATCTTC CTGATCATGG TGATCCTCAC CCGGGCCGTC
GCGGCGGCTG CCGTCATCGT CGGCACCGTG GTGCTGAGCC TGGGCACGTC GTTCGGTCTG
TCGGTTCTGC TGTGGCAGCA CGTCGTCGGG ATCCCGCTGG GCTGGATGGT GCTGCCCATG
TCGGTGATCG TGCTGCTCGC CGTCGGCGCG GATTACAACC TGCTGCTGGT GTCGCGCATG
AAGGAGGAGC TCCACGCCGG AGTGAACACC GGAATCATCC GGTCCATGGC CGGCACCGGA
TCCGTGGTCA CCGCAGCGGG ATTCGTGTTC GCCTTCACGA TGATCGGCAT GATCGTCAGC
GACATGATCG TCATCGGTCA GGTGGGCACC ACCATCGGTC TCGGCCTGCT GTTCGACACG
CTGATCGTCC GGTCACTGAT GACACCGTCC ATCGCGGCAC TGATGGGCAA ATGGTTCTGG
TGGCCGACGC ACGTGCGGCG CCGTCCGAAG CCGAAACCCT GGCCGCGGGT CGCCGAGGAG
GTGACGGCAT GA
 
Protein sequence
MTTSTDDAPT DVLPPARHAA PSRPKLPRFI RTFAVPIVLA WVAIVALLNT VVPQLEEVGK 
LRAVSMSPND APALIATKHV GDKFDEYNTS SSVMIVLEGE EALGPDAHAF YDEVVRQLDA
DTEHVQHVQD FWGDTLTAAG AQSIDGKAAY VQVYIAGDQG ETLANESVHA VRAIVDSTQA
PPGVRAYVTG PAALTTDQNI VGDASMKTIE SVTIGIIIVM LLIIYRSVIT TVVTMSMVFV
GLLSARGIVS FLGFYEVFGL TTFATSMVVT LAIAAATDYA IFLIGRYQGA RRSGMDRESA
YYDMFHGTAH VVAASGLTIA GATACLHFTR LPYFQSMGFP LAVGMIIVVA AALTLGPALI
SIVTRFGKVL EPKGNGRARG WRKLGSATVR WPGAVLVMAT VLCLVGLLAL PGYHTNYNDR
IYLPDGVPAN VGYAAADRHF SDAKMNPDLV MVESDHDMRN PADFLVIEKI AKALTRVHGI
ASVTTITRPD GKPIKHASLA YTISQSGNGQ IMNNDFQQTV LENTLQQANE MQVTIDSMEE
MQRITLELSE VTREMADKMK DTSANLNEVR DHLADFDDQF RPLRNYFYWE PHCYNIPMCW
ALRSVFDSLD GISTMSDDFT ELVPSIERMA QLTPQMAAIM PAMIQTMKNQ KQIMLNQYQA
QKMQQDQNIA MQEDSTAMGE AFDTARNDDT FYLPPEAFQT ADFQRGIKLF MSPDGKAVRF
TVFHQGDPLT EAGTARIDPL RIAAADAIKG TPLEGSTIYV GGSAAMYKDM QQGADYDLLI
AAVASLILIF LIMVILTRAV AAAAVIVGTV VLSLGTSFGL SVLLWQHVVG IPLGWMVLPM
SVIVLLAVGA DYNLLLVSRM KEELHAGVNT GIIRSMAGTG SVVTAAGFVF AFTMIGMIVS
DMIVIGQVGT TIGLGLLFDT LIVRSLMTPS IAALMGKWFW WPTHVRRRPK PKPWPRVAEE
VTA