Gene Vapar_4007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4007 
Symbol 
ID7974572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4249898 
End bp4251157 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content72% 
IMG OID644794593 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002945887 
Protein GI239816977 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACGCCT CCCCACCCGC GCTGGACGGC GCGGACACCG ACATTCCCGT TGTTTCCCCC 
GCCGCACGGC CCGTGAATTT CCTGCGCGCC GTGCTTGCGC TCGGCGTCGG CGGCTTCGCC
ATCGGCACCG GCGAGTTCGT GATCATGGGC CTCTTGCCTG AAGTGGCCAA GGACATCGAC
GTCAGCATTC CCCAGGCCGG CCACGTCATC AGCGCCTATG CGCTCGGCGT GGTCATCGGC
GCCCCGGTGC TCGCGGTGCT GGCCGCCGGC TGGCGCCGCC GCGCGCTGCT GATCGCGCTC
ATGGCCGTGT TTGCCGCCGG CAACTTCGCG AGCGCGATGG CGCCGGGCTA CCTGTCGCTC
AACCTGCTGC GCTTTGCCAC CGGGCTGCCG CACGGCACCT ACTTCGGCGT GGCCGCGCTG
GTGGCCGCCA CGCTCGCGCC GCCCGGGCGC CGGGCGCGCG CCGTGGGCCT CGTGATGCTG
GGGCTCACCG GCGCCACGCT GGTCGGCGTG CCGATCGCGG CCTGGCTCGG CCAGCTGTTC
GGCTGGCGCG CCGCCTTCGT GTTCGTCGGC CTCATCGCGC TGGTGGCCGT GGCGCTGCTG
CGCCGCGACA TCCCCGACCT GGCGCCGCCC GCAGGCGCCA GCCCCTGGCG CGAGCTCGGC
GCGCTCAAGC GCAAGCAGGT GTGGTTCACG CTCGGCATCG GTGCCATCGG CTTCGGCGGC
ATGTTCTCGG TGTTCAGCTA CATCAAGCCC ACGCTGATCG AAGTTGCGGG CCTGCCGCTG
GGTGGCGTGC CGTTCGTGCT CGCGCTGTTC GGCCTGGGCA TGGTCACCGG CAACCTCGTC
GGCTCGCGCC TGGCCGACAA GTCGCTGATG CGCACCATCG GCGGCCTGCT CGTCTATGCG
GCGCTGGTGC TCGCGATGTT CTCGTTCGCG GCGCACCATG TGGTCGCCGC GGCGGTCAAC
GTGTTTCTCA TCGGCACCAC CGTGGCCATC GGACCGGCGC TGCAGATCCG CCTGATGGAC
GTGGCCGGCG ACGCGCAGAC GCTCGCCGCG GCGCTCAACC ACTCGGCCTT CAACATGGCC
AATGCGCTGG GCGCCTGGCT CGGCGGCGTG GCCATTGCGG CCGGGCTGGG CTGGACCTCG
ACCGGGTGGG TGGGTGCGCT GCTCGCGCTC GCGGGCATGG GGGTGTTCGG CTGGGCGATC
GCGAGCGCGC GTGCCGAGGC GCGGCCGGGC AAGATCGCCT GCGAAGGTTC GGCGGGCTAG
 
Protein sequence
MNASPPALDG ADTDIPVVSP AARPVNFLRA VLALGVGGFA IGTGEFVIMG LLPEVAKDID 
VSIPQAGHVI SAYALGVVIG APVLAVLAAG WRRRALLIAL MAVFAAGNFA SAMAPGYLSL
NLLRFATGLP HGTYFGVAAL VAATLAPPGR RARAVGLVML GLTGATLVGV PIAAWLGQLF
GWRAAFVFVG LIALVAVALL RRDIPDLAPP AGASPWRELG ALKRKQVWFT LGIGAIGFGG
MFSVFSYIKP TLIEVAGLPL GGVPFVLALF GLGMVTGNLV GSRLADKSLM RTIGGLLVYA
ALVLAMFSFA AHHVVAAAVN VFLIGTTVAI GPALQIRLMD VAGDAQTLAA ALNHSAFNMA
NALGAWLGGV AIAAGLGWTS TGWVGALLAL AGMGVFGWAI ASARAEARPG KIACEGSAG