Gene Rpal_3539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3539 
Symbol 
ID6411213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3789172 
End bp3790419 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content71% 
IMG OID642713417 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001992514 
Protein GI192291909 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCA GCGAGCCGGC CCTGCGCGTC GCGGAAGAAT CTCGACCCGA ACCGCCGCCG 
AAGCCCGGCC CGGGCGGCGC CTCGGTGCCG GGGCCCGCAA TGGGCGCCGG GCTGACGCTG
GCGATGGCGG CGGCCGCGGG CATCAGCGTT GCCAATATCT ATTACAACCA GCCTATGCTG
GGGGTGATCG AGCGTGACCT CGGCAATCCG GCGCTGACCG GGATGATCCC GACGGCGACT
CAACTCGGCT ACGCGGTCGG TCTATTCCTG CTGGTGCCGC TCGGCGACCT GACCGACCGG
CGGCGGCTGA TTGCCGGCCA GTTCGTGCTG CTGGCGGTCG CGGCTGCGCT GGTGGCGCTG
GCGCCGTCGG CCTGGTTGAT CATTGCCGCG TCGCTGGCGC TCGGCGCCTG CGCGACCGTG
GCGCAGCAGG TGGTGCCGTT CGCCGCGGCG CTGGCGGCAC CGGAGCGGCG CGGCAAGACC
ATCGGCCTGG TGATGGCCGG GCTGTTGTGC GGCATCCTGC TCAGCCGGAC GGTGGCCGGC
TTTGTTGCCG GCCATCTCGG CTGGCGCGAG ATGTTCTGGC TGGCGGTGCC GGCGGCGCTC
GCGGCCGCCG CGCTGATGGC GTGGCTGCTG CCGCGCCATC ACGGTCACCT CGATATCAGC
TATGGCGCCG CGCTGAAGTC GCTCGCGTCG CTGTGGCGCG AGCAGCGGGA TCTCCGGCGG
GGGACCGCGG TGCAGGCGGC GCTGTTCGCC TCGTTCAGCG TGTTCTGGAC GGTGCTGGCG
CTGCATCTGC AGGAGCCGAA GTTCGGGCTC GGGGCCGAGG CGGCGGGCCT GTTCGGCCTG
GTTGGCGTGG TTGGCGTGTT GGCGGCGCCG ATCTCCGGCC GGATCGCCGA CCGAAGTGGA
CCGGGACCGG TGATCGCGAT CGGCGCGGCT CTGGTGCTGG CGTCGTGGGT GTTGTTCGGT
CTGTGGGGCA GCGTCGTTGG ACTGCTGATC GGCGTCGTGG TGCTGGATTT CGGTCTGCAG
AGCGCGCTGA TCTCCAACCA GCACATCGTC TACGCGTTGG TGCCGGAAGC GCGAAACCGC
CTCAACACCG TGTTCATGAC CGGGATGTTC ATCGGCGGAT CGGTCGGTTC TGCCGGCGCG
GCCTTCGCCT GGGCGCACGG CGGCTGGACG GTGGTCAGCC TCTATGGCGG CGCGCTGGCG
GCAATCGCCT TGCTACTCGA ACTGACGGCG CGTTGGTCCC GCCGTTAG
 
Protein sequence
MSVSEPALRV AEESRPEPPP KPGPGGASVP GPAMGAGLTL AMAAAAGISV ANIYYNQPML 
GVIERDLGNP ALTGMIPTAT QLGYAVGLFL LVPLGDLTDR RRLIAGQFVL LAVAAALVAL
APSAWLIIAA SLALGACATV AQQVVPFAAA LAAPERRGKT IGLVMAGLLC GILLSRTVAG
FVAGHLGWRE MFWLAVPAAL AAAALMAWLL PRHHGHLDIS YGAALKSLAS LWREQRDLRR
GTAVQAALFA SFSVFWTVLA LHLQEPKFGL GAEAAGLFGL VGVVGVLAAP ISGRIADRSG
PGPVIAIGAA LVLASWVLFG LWGSVVGLLI GVVVLDFGLQ SALISNQHIV YALVPEARNR
LNTVFMTGMF IGGSVGSAGA AFAWAHGGWT VVSLYGGALA AIALLLELTA RWSRR