Gene Mvan_4474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4474 
Symbol 
ID4649090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4805925 
End bp4807808 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content71% 
IMG OID639807944 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_955255 
Protein GI120405426 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.137997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.105025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAGA CCGTGACGGC CCGTGTCCGG GGCGCCGACC CCTGGCCCGC CCTGTGGGCG 
CTGCTGATCG GGTTCTTCAT GATCCTGGTC GACGCCACGA TCGTGCCCGT CGCCAATCCC
GCGATCATGG CGCAGCTGGG CGCTGACTAC GACGCGGTGA TCTGGGTGAC GAGCGCCTAC
CTCCTCGCCT ACGCCGTTCC GCTGCTGGTG TCGGGCCGGC TCGGGGACAG ATACGGGCCC
AAGACGGTGT ACCTCGTCGG CCTGGCGGTG TTCACCGCCG CATCGCTGTG GTGTGGCCTT
TCCGACAGCA TCGGCATGCT GGTCGCCGCC CGTGTGCTCC AGGGTGTGGG GGCCGCGCTG
CTCACCCCGC AGACGCTGAC GGTGATCACC CGTACGTTCC CGGCGCACAA CCGTGGAGTG
GCGATGAGCG TGTGGGGCGC CACCGCGGGC GTGGCGACGC TGGCCGGGCC GCTGGCCGGC
GGTGTACTGG TGGACAGCCT GGGCTGGCAG TGGATCTTCA TCGTCAACCT GCCCGTCGGG
GTGCTCGGCT TCGCGCTGGC GGTCTGGCTG GTGCCGTCGC TGCCCACTGC GCGGGCGCAG
CGTTTCGACC TTCCGGGTGT GGTGCTGTCC GGCGTCGCGT TGTTCCTGAT CGTGTTCGCG
TTGCAGGAGG GCCAGTCGCA CGATTGGGCA CCGTGGATCT GGGCGACGAT CGGGATCGGC
ATCGCGGTGA TGGCGGCCTT CCTGTACTGG CAGTCGGTCA ACACCGGCGA GCCGCTGATC
CCGCTGATCA TCTTCGGAGA CCGCAACTTC TCGATGTCGA ACCTCGGGGT GGCGACGATC
GGGTTCGTCG CGACCGGGAT GATCCTGCCG CTGATGTTCT ACGCCCAGTC CGTCTGCGGG
CTCACCCCGA CGCAGGCCGC GCTGCTGACC GCGCCGATGG CGGTGGCGAC CGGGGTGTTG
GCGCCGTTGG TCGGAAAGCT CGTCGACCGT TCACATCCCC GGCCGGTGGT CGGGTTCGGT
TTCGCGCTGA TGGCGATCGG CCTGACGTGG CTCTCGATCG AGATGACGCC GTCCACGCCG
ATCTGGCGTC TGGTGGTGCC GTTGACGGCG ATGGGTGTGG CGATGGCGTT CATCTGGTCG
CCGCTGGCGG CCACCGCCAC CAGGAATCTG CCGCCGCAGC TGGCGGGGGC GGGCTCCGGC
GTCTACAACA CCACCAGGCA GGTGGGGTCG GTGCTCGGCA GCGCGGCCAT GGCGGCACTG
CTGGCCTCGC AGCTCTCGGC GAAGATCCCG GGTGAGGCGC CGGTACCGGT GGAGGGTCAG
GCCGGGCAGC TGCCGGCATT CCTGCACGCG CCGTTCGCCG CGGCGATGTC GCAGGCGATG
CTGCTGCCGG CCTTCGTGGC GTTGTTCGGC GTCATCGCCG CGCTGTTCCT GGTGGGCCTG
GGGGACCGGC TCCCGGCACC CCCGCGGCGG CCGCAGGAGA CGGCCGAGCC GGTGCCGGCG
TTCGACCCCG ACGATGAGCT CTACTGGGCC GACGGGGAGG ACGAGTACGT CGAGTACGAG
GTGCCCTGGG ACGACGGCGC GGAACCGCGC AGGCCGGCGC AGGAGGCCGA CACCGTCGAC
GCGGCGGCCG ACACCGACGT GCTCGACCCG GTCACCGAAC CCATGCACGA CGGCGGCGGG
TATCCGGCGC CGGAGTCAGC GCCGCGGCGG GATCACGCCG ATCCGACATC GCATCCCGAC
CGGGAATGGC GCAGCATCCT CGATCAACTG CTCGACCCGC CGAAGCCGTC GGACGACCCG
ACAGGGCAGG GCCGCAACGG CTTTCACGTC ACCCCCAGCG AGGACGGTCG CGGCGCCTCG
CGGGGCAGGC ATTCGCTCGA CTAG
 
Protein sequence
MFETVTARVR GADPWPALWA LLIGFFMILV DATIVPVANP AIMAQLGADY DAVIWVTSAY 
LLAYAVPLLV SGRLGDRYGP KTVYLVGLAV FTAASLWCGL SDSIGMLVAA RVLQGVGAAL
LTPQTLTVIT RTFPAHNRGV AMSVWGATAG VATLAGPLAG GVLVDSLGWQ WIFIVNLPVG
VLGFALAVWL VPSLPTARAQ RFDLPGVVLS GVALFLIVFA LQEGQSHDWA PWIWATIGIG
IAVMAAFLYW QSVNTGEPLI PLIIFGDRNF SMSNLGVATI GFVATGMILP LMFYAQSVCG
LTPTQAALLT APMAVATGVL APLVGKLVDR SHPRPVVGFG FALMAIGLTW LSIEMTPSTP
IWRLVVPLTA MGVAMAFIWS PLAATATRNL PPQLAGAGSG VYNTTRQVGS VLGSAAMAAL
LASQLSAKIP GEAPVPVEGQ AGQLPAFLHA PFAAAMSQAM LLPAFVALFG VIAALFLVGL
GDRLPAPPRR PQETAEPVPA FDPDDELYWA DGEDEYVEYE VPWDDGAEPR RPAQEADTVD
AAADTDVLDP VTEPMHDGGG YPAPESAPRR DHADPTSHPD REWRSILDQL LDPPKPSDDP
TGQGRNGFHV TPSEDGRGAS RGRHSLD