Gene Rpal_5003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5003 
Symbol 
ID6412695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5387398 
End bp5388663 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content70% 
IMG OID642714886 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001993967 
Protein GI192293362 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.408823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCGCCCA GCCCTCCCCA GACCCTGGCG CGGCCGCTCG ACGCGCTGAA CTTCTTCCTC 
GCCGACGTGC GCGACGGCCT CGGGCCATAT CTGGCGATCT ACCTGCTGGC CGTGCAGAAC
TGGAACGAGG CGTCGATCGG GCTGGTGATG TCGATCGCCG CGGCGGCCGG CATCGCAGCT
CAGACCCCAG CCGGGGCACT GATCGACCGC TCCACCGCCA AGCGCGCGCT GATCATCGCG
GCCGCTCTGG TCGTGACCGC CGCATCGGTG GTGCTGCCGT GGCTGGACAG TTTCGTGCTG
GTGGCGGCGA CCCAGGCGCT GGCAGCGGCT GCTGGCGCGA TCTTTGCGCC CGCGGTGGCG
GCGCTAACGC TGGGCATCGT CGGGCCGCGC GCCTTCGCCC GTCGGACCGG CCGCAACGAA
GCCTTCAACC ACGCCGGCAA CGCGGTGGCG GCGATGCTCA CCGGCGCGTT CGCCTATGGG
TTCGGCCCCG GCGTGGTGTT CTGGCTGATG GCCGCGATGG CGCTCGCCAG CATCTTCGCC
ACGCTGGCGA TCCCAGCCGC GGCGATCGAC GATCACGTCG CCCGCGGGCT CGGCGACGAT
CACGAGCGCG GCGCGCATCA CGACCAGCCG TCCGGCTTTA AGGTGCTGCT GACGTGCCGG
CCGCTCTTGA TCTTCGCCGG CGCCACCGTG CTGTTTCACT TCGCCAATGC GGCGATGCTG
CCGCTGGTCG GACAGAAGCT GGCGCTGGTG AACAAGAACC TCGGCACCAC GCTGATGTCG
GTGTGTATCG TCGCCGCGCA GCTCGTGATG GTGCCGGTGG CGGCGCTGGT CGGGCACAAG
GCCGACGTCT GGGGCCGCAA ACCGATCTTC GCCGTCGCGC TCGGCGTGCT GGCGCTGCGC
GGCGCGCTAT ACCCCCTGTC CGACAATCCG TATTGGCTGG TCGGCGTGCA ACTGCTCGAC
GGCGTCGGCG CCGGCATTTT CGGCGCGCTG TTTCCGCTGG TGGTGGCCGA CCTCACCCAC
GGCACCGGGC ATTTCAACAT CAGCCAGGGC GCGATCGCGA CGGCCGCAGG CCTCGGCGCC
GCGCTGTCGA CCGGCTTCGC CGGACTGATC GTGGTCAGCG CGGGCTACAG CGCCGCGTTC
CTGGCGCTTG CCGGCATCGC TGCTGCGGCG CTGGTGTTGT TCCTGGTGCT GATGCCGGAG
ACCCGACAGC AGCAATCGGC GGCACCACCT TCCGCAGCGC CGGAGGCGGT CTCATCCACC
GTCTAA
 
Protein sequence
MPPSPPQTLA RPLDALNFFL ADVRDGLGPY LAIYLLAVQN WNEASIGLVM SIAAAAGIAA 
QTPAGALIDR STAKRALIIA AALVVTAASV VLPWLDSFVL VAATQALAAA AGAIFAPAVA
ALTLGIVGPR AFARRTGRNE AFNHAGNAVA AMLTGAFAYG FGPGVVFWLM AAMALASIFA
TLAIPAAAID DHVARGLGDD HERGAHHDQP SGFKVLLTCR PLLIFAGATV LFHFANAAML
PLVGQKLALV NKNLGTTLMS VCIVAAQLVM VPVAALVGHK ADVWGRKPIF AVALGVLALR
GALYPLSDNP YWLVGVQLLD GVGAGIFGAL FPLVVADLTH GTGHFNISQG AIATAAGLGA
ALSTGFAGLI VVSAGYSAAF LALAGIAAAA LVLFLVLMPE TRQQQSAAPP SAAPEAVSST
V