Gene RPB_4678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4678 
Symbol 
ID3912496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5291396 
End bp5292610 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content69% 
IMG OID637886583 
Productmajor facilitator transporter 
Protein accessionYP_488272 
Protein GI86751776 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAACC TTCCGCTGCG CGACGAGAGT TCGATTCGCT ATGAAGGCTG GCGGATCGTC 
GCGATGTGTT TTGCGGTCGC GACCTTCGGC TGGGCGCTCG GCTTCTACGG CCAGAGCGTC
TATCTCGCCG AGCTGACGCG GCTGCACGGC TGGCCGTCGT CGCTGATCGC GACCGCGACG
ACGTTCTTCT ATCTCGGCGG CGCGCTGCTG GTCGCCTTCG TCGGCGACGT CATCCGCGTG
ATCGGGCCGC GCGCCTGTCT GCTCGGCGGC ATCGCCGCGA TGGCGCTCGG CACCGCGCTG
CTCGGCCGGA TCGATGCGGT CTGGCAGCTC TACGCCGTCT ATGTGCTGCT CGCGGTCGGT
TGGGCCGGCA CCAGCCTCGG CGCCATCACC AGCACGCTCG GGCTGTGGTT CGACCGGCGC
CGCGGCATGG CGATCAGCCT GGCGCTGAAC GGCGCCAGCT TCGGCGGCAT CGCCGGCGTG
CCGTTGCTGG TCGCGGCGAT CGGACATTTC GGATTTGCCG ACGCGACGCT GGCGGCGGCG
ATCGCCGGGG TATTGCTGAT GCCGGTCGTC GCGATCGTCG TCGGCCGCCC GCCGCTGCGC
ATCGCGGAGC ATCCCGCCGG GCCGGGTGCG GTGCAGGCGC TGTCGTCGGG CGCGATCCGC
CGCGATGCGT TCCGCGACAT TGCGTTCCTC ACCGTCACCA TCGCGTTCGC GCTGGTGCTG
TTCGCGCAGG TCGGCTTCAT CGTGCACTTG ATCGCCTATC TCGACCCGTT GATCGGCCGC
GAGCGCGCCG CAGCCGCGGT GGCGCTGCTG ACCACGATGG CGGTGGTCGG CCGCGTCTCG
TTGTCGACCG TGATCGACCG GCTCGACCAG CGGCTGGTGT CGGCGATCTC GTTCCTGAGC
CAGGCGGTGG CGCTGGCGAT CGTGATCCTG TCGCGCGACG GCACGCTGCT ATTGATCGCC
TGTGCGCTGT TCGGCTTCTC GGTCGGCAAT CTGATCACGT TGCCGGCGCT GATCGTGCAG
CGCGAATTCC CGGCCGCCTC GTTCGGCGTC CTGATCAGCC TCGTCACCGC GATCAATCAG
GTGACCTATG CGTTCGGCCC CGGCGTGATC GGCCTCGTCC GCGACCTCTC CGGCAGTTAC
ACGCTACCGT TCGCCGGCTG CATCGTGCTG CAACTGATCG CCGCGGCGCT GGTGATGATG
CGGGGACGAA GCTGA
 
Protein sequence
MVNLPLRDES SIRYEGWRIV AMCFAVATFG WALGFYGQSV YLAELTRLHG WPSSLIATAT 
TFFYLGGALL VAFVGDVIRV IGPRACLLGG IAAMALGTAL LGRIDAVWQL YAVYVLLAVG
WAGTSLGAIT STLGLWFDRR RGMAISLALN GASFGGIAGV PLLVAAIGHF GFADATLAAA
IAGVLLMPVV AIVVGRPPLR IAEHPAGPGA VQALSSGAIR RDAFRDIAFL TVTIAFALVL
FAQVGFIVHL IAYLDPLIGR ERAAAAVALL TTMAVVGRVS LSTVIDRLDQ RLVSAISFLS
QAVALAIVIL SRDGTLLLIA CALFGFSVGN LITLPALIVQ REFPAASFGV LISLVTAINQ
VTYAFGPGVI GLVRDLSGSY TLPFAGCIVL QLIAAALVMM RGRS