Gene Mvan_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3349 
Symbol 
ID4644396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3564000 
End bp3565421 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content72% 
IMG OID639806827 
Productmajor facilitator superfamily transporter 
Protein accessionYP_954152 
Protein GI120404323 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.552726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCACA CTGAAGCCGG TAGCGCGGCC GTCGCCACCG GGAGCTGGCG TGAACTCCTC 
GGCCCGAAGT ACCTCGGGGC CTCGACCGTG CTCGCCGGCG GGGTGGCGCT GTATGCGACC
AACGAGTTCC TCACGATCAG CCTGATGCCG AGTGCCGTGG CCGACATCGG CGGTCACCGC
TTCTACGCGT GGGTGACGAC GGTGTATCTC GTCGGGTCGG TGATGGCCGC CACCACGGTT
CACTCGGTGT TGACGCGACT CGGGCCGCGG TGGGCATACC TGTTGGGGCT CACGGTGTTC
GGTCTCGGCA GTCTGGTCTG TGCGGTGGCG CCGAGCATGG AGATACTGCT CGCCGGCCGC
ACCGTGCAGG GCGCCGCGGG GGGCTTGCTG GCCGGCCTCG GGTACGCGGT GATCAACACG
GTGCTGCCCA GCGTGCTGTG GAAGAAGGCG TCCGCTCTGG TGTCGGCCAT GTGGGGTGTC
GGCACCCTGG TCGGTCCCGC CGCGGGTGGG TTGTTCGCTC AGTACAGTTC GTGGCGTTGG
GCTTTCGGCA TTCTCGTGGT GATGACGACC GCGATGTCGG CGCTGGTGCC GATGGCGCTG
CCCGCGCGCG CCCGCGCCGC GCATCTCGGC TCCGGGGCGC CGCCCGGCCG CATCCCGTTC
TGGTCGCTGC TGTTGCTCGG CGCGGCCGCG TTGCTGGTCA GCGTCGCCGG GATTCCGCAC
GACGTGCGCG CCACCGCCGG GCTGGTCGTG GCGGGATTCG TGCTGGTGGC CGTGTTCGTG
GTGGTCGACC GCCGGGTCGC CGCGTCGGTG CTGCCGCCGA GTGCATTCGG TCGCGGGCCG
TTGAAGTGGA TCTATCTGAC GCTGGGCGGG TTGATGGCGG CCACCATGGT GGACATGTAC
GTGCCGCTGT TCGGTCAGCG GCTCGCGCAC CTGACCCCGG TCGCGGCCGG CTTCCTGGGG
GCCGGGCTCG CGATCGGCTG GACGGTCGGC GAGATCGGGA GCGCGTCGCT GACCCGTCAC
CGGGTGGTGG TGCGGACCGT GGCGGTCGCG CCGGCGGTGA TGGCGACCGG CCTGACGATC
GGCGCGCTGA CCCAGCACCG CGACGCCGGG CCCATGCTGG TCGCGCTCTG GGCTGTGGGC
CTGATCATCA CCGGCGCCGG CGTGGGCATC GCATGGCCGC ACCTGTCGGC GTGGGCGATG
AGCAAGGTGG ACGACCCGGC GGAGGGGCCG GCCGCTGCCG CGGCCATCAA CACGGTGCAG
GTGATCTCGG CGGCATTCGG CGCGGCGCTG GCCGGGGTGA TCGTCAACCT CTCGCACACC
GGGGACGTCG CGGCGGCGCG CTGGTTGTTC GCCTCGTTCG CGGTGCTCGC CGTGCTCGCG
ACAGTGGCCT CGACCCGAAG CGGTCGCTTG CGATCACGTT GA
 
Protein sequence
MTHTEAGSAA VATGSWRELL GPKYLGASTV LAGGVALYAT NEFLTISLMP SAVADIGGHR 
FYAWVTTVYL VGSVMAATTV HSVLTRLGPR WAYLLGLTVF GLGSLVCAVA PSMEILLAGR
TVQGAAGGLL AGLGYAVINT VLPSVLWKKA SALVSAMWGV GTLVGPAAGG LFAQYSSWRW
AFGILVVMTT AMSALVPMAL PARARAAHLG SGAPPGRIPF WSLLLLGAAA LLVSVAGIPH
DVRATAGLVV AGFVLVAVFV VVDRRVAASV LPPSAFGRGP LKWIYLTLGG LMAATMVDMY
VPLFGQRLAH LTPVAAGFLG AGLAIGWTVG EIGSASLTRH RVVVRTVAVA PAVMATGLTI
GALTQHRDAG PMLVALWAVG LIITGAGVGI AWPHLSAWAM SKVDDPAEGP AAAAAINTVQ
VISAAFGAAL AGVIVNLSHT GDVAAARWLF ASFAVLAVLA TVASTRSGRL RSR