Gene Vapar_4300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4300 
Symbol 
ID7970487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4543316 
End bp4544557 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content65% 
IMG OID644794886 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002946178 
Protein GI239817268 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCCCT CTCCTCATTC ACCTTCGGAA CTGCGGCAGC ATTGGCGCTA CGTCCTGGGT 
TGCTTCCTCG GCTTGGCCAG TGGCGTCTCA TCCCTGTACT TCTACAGTTC AGGTCTGTTC
CTGAAGCCGC TCGCCGCCGA GTTCGGATGG ACCCGAGGCA CCGCGTCCCT GGGCTCGCTG
ATCAGCAGCG TGCTGCTCGG ATGCGCGGCG CCGTTCATCG GACAGGTGAT CGACCGGTAC
GGCGCGCGAC GCGTCACCCT GCTCTCCCTG GTGGGCCTGT CGACCTCGTT CCTGCTTCTG
GGCGTTTGGA CACAGGGGTT GGCCAGTTTT CTGGCGCTGG TCACGGTCCT GACACTGCTC
GGTGGCGCGA CCTCGCCCTT GTCCTTCACG AAGATCCTCG TGGGCAGGTT CTCTCGGCAA
CGCGGACTGG CCCTGGGCAT TGCCATCACC GGAACGGGCG TTGGGGCCAT CCTGATTCCG
CTGATCGTCA CGCCGGCCAT CGCCAATTTC GGATGGCGAG CCACCTACCT GGGCCTTGCG
GCGGTCGTCC TTGCACTTTT GCCATTGATT GCCTACTTCC TGCGGGGCGC GGATGCGGCC
CCTGCAGAGA AGGGCCGCCG TGCCCCAGCT GGAAGTAACC TGAAGGCGCT GGCCGATCCG
CGCTTCTTCC GCATTGCCGC TGTCTTCCTG CTGTGCTCGG TCGGCATCTT CGGCACGATC
GTGCACGTCG TTCCAATGCT CACCGACCTC GGGCTTTCCC CGAACCGCGC CGCCGGTCTC
GCCAGCATCC TTGGCGTGGC GGTCATCGTC GGACGAATCG TCACGGGCTT TCTGCTGGAC
ATGTTCGATG CCGCGCGCTT GTCGGCGACG CTCTTCATGC TGTCTGCCAC CGGCATGCTG
CTGCTGGCAA CGGGCCAACC GGCGCTGGTG CTTCCCGGGT TGCTCATGAC TGGCTTCGCG
GTCGGCGCGG AGTTCGACCT CGCAGCCTAT CTGGTGAGCC GGAAATTCCC GCTGAACCTC
TACAGCACCC TCTTCGGGGG TGTCTATGCC ACGGTGGCCA TTGGCGCCGG CATCGGGCCA
TTCCTCGCGG GACGGATCTT CGACATTTCC GGCAGCTATG TCTCGTGGCT CTGTCTGGCA
GCGGGATTGC TGCTTGCGGC CGCAATGCTC TGCCTCATGG AACGTCCGGC CGCCAAGCAC
GCGAGCGGCA CGGTCGCAGC GGCGGGTTCG AGCGAGCGCT GA
 
Protein sequence
MTPSPHSPSE LRQHWRYVLG CFLGLASGVS SLYFYSSGLF LKPLAAEFGW TRGTASLGSL 
ISSVLLGCAA PFIGQVIDRY GARRVTLLSL VGLSTSFLLL GVWTQGLASF LALVTVLTLL
GGATSPLSFT KILVGRFSRQ RGLALGIAIT GTGVGAILIP LIVTPAIANF GWRATYLGLA
AVVLALLPLI AYFLRGADAA PAEKGRRAPA GSNLKALADP RFFRIAAVFL LCSVGIFGTI
VHVVPMLTDL GLSPNRAAGL ASILGVAVIV GRIVTGFLLD MFDAARLSAT LFMLSATGML
LLATGQPALV LPGLLMTGFA VGAEFDLAAY LVSRKFPLNL YSTLFGGVYA TVAIGAGIGP
FLAGRIFDIS GSYVSWLCLA AGLLLAAAML CLMERPAAKH ASGTVAAAGS SER