Gene RPD_4386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4386 
Symbol 
ID4024911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4852104 
End bp4853321 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content69% 
IMG OID637964596 
Productmajor facilitator transporter 
Protein accessionYP_571504 
Protein GI91978845 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.185131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCA TCCCGCCGGA TGACGAGACC TCGATCCGCT ATGGCGGCTG GCGCATCGTT 
GCGGTGTGCT TTGCGGTCGC AACCTTCGGC TGGGCTTTCG GGTTCTATGG CCAGAGCGTC
TATCTCGCCG AGCTCACGCG CCTGCATGGC TGGCCGTCGT CGCTGATCGC CACCGCGACG
ACTTTCTTTT ATCTCGGCGG CGCGCTGCTG GTCGCCTTTG TGGGCGACGC GATCCGGATG
ATCGGCGCGC GCGCGTGCCT GCTCGGCGGC ATCGCCGCGA TGGCGCTCGG CACCGCGCTG
CTCGGCCGGA TCGATGCGCT GTGGCAGCTT TATGCCGTCT ACGTGCTGCT CGCGATCGGC
TGGGCCGGCA CCAGTCTCGG CGCGGTTACC AGCACGCTCG GCCTGTGGTT CGACCAGCGC
CGTGGCATGG CGATCAGCCT GGCGTTGAAC GGCGCGAGTT TCGGGGGCAT TGCCGGCGTG
CCGCTGTTGG TGGCGGCGAT CGAACATCTC GGTTTCGCCG GCGCCACGCT CGCGGCGGCG
GTCGTGTCCG TCGTCGTGCT GATGCCGATC GTGGCGATCT TCGTCGGCCG CCCGCCGCAG
CGCGCCGCTG CTCACGCTGC CGGGCCGGGT GCGGTGCAGG CCCTGTCGTC GGGCGCGATC
CGCCGGCATG CGTTCCGCGA CACCGCGTTC CTCACCGTCA CGATCGCCTT CGCGCTGGTG
CTGTTCGCGC AGGTCGGGTT CATCGTGCAC CTGATCGCCT ATCTCGATCC GCTGGTCGGC
CGCGAGCGCG CCGCGGTCGC GGTGTCGTTG CTGACGACGA TGGCGGTGGT CGGCCGCGTG
TCGCTGTCGA CCGTGATCGA TCGCCTCGAC CAGCGGCTGG TCTCGGCGAT CTCGTTTGCG
AGCCAGGCGG CGGCGCTGGC GATCGTGATC CTGTCGCGCG ACGCCACGCT GCTGCTGGTC
GCTTGCGCGC TGTTCGGCTT CTCGGTCGGC AATCTGATCA CGCTGCCGGC GCTGATCGTG
CAGCGCGAAT TCGCTCCCGG CTCGTTCGGC GTGCTGGTCA GCCTCAACAC CGCGATCAAT
CAGGTGACCT ACGCGTTCGG CCCGGGGGTG GTCGGCCTCC TCCGCGACGC TTCCGGCAGC
TACACGGCGC CGTTCCTCGG CTGCATCGCG CTACAACTGA TCGCCGCCAT GCTGGTGATG
GTGCGGGGGC GGAGCTAG
 
Protein sequence
MAAIPPDDET SIRYGGWRIV AVCFAVATFG WAFGFYGQSV YLAELTRLHG WPSSLIATAT 
TFFYLGGALL VAFVGDAIRM IGARACLLGG IAAMALGTAL LGRIDALWQL YAVYVLLAIG
WAGTSLGAVT STLGLWFDQR RGMAISLALN GASFGGIAGV PLLVAAIEHL GFAGATLAAA
VVSVVVLMPI VAIFVGRPPQ RAAAHAAGPG AVQALSSGAI RRHAFRDTAF LTVTIAFALV
LFAQVGFIVH LIAYLDPLVG RERAAVAVSL LTTMAVVGRV SLSTVIDRLD QRLVSAISFA
SQAAALAIVI LSRDATLLLV ACALFGFSVG NLITLPALIV QREFAPGSFG VLVSLNTAIN
QVTYAFGPGV VGLLRDASGS YTAPFLGCIA LQLIAAMLVM VRGRS