Gene RPB_4634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4634 
Symbol 
ID3912451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5237067 
End bp5238482 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content67% 
IMG OID637886538 
Productmajor facilitator transporter 
Protein accessionYP_488228 
Protein GI86751732 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGA CCCTCGCCGT CACCACGGCT TCTTCGCGCC GCTGGCGGGT GCTGGCGATC 
GTGGTCGCCG CGCAGTTCAT GTTCGGCGTC GATTCCTTCA TCGTCAACGT CGCGATTCCG
ACGATCTCCG TCGAGCTGAA CGCCTCGTCG TCACAGCTCG AGGCGGTGAT CGCGATCTAT
CTGATCGGCT ATGCGACACT GATCGTCACC GGCGGCCGGC TCGGCGATAT CTACGGCACC
AAGACGGTAT TCCTCGCCGG TGTGATCGGC TTCACGTTGA CTTCGCTGTG GTGCGGGCTG
GCGCGCTCCG GCACCGAACT GATCCTGGCG CGGCTCGCGC AGGGCACCAC CGCGGCGCTG
ATGGTGCCGC AGGTGCTGGC GACGCTGCAC GTGCTGTTTC CGGACGCCGC CCGCGCCAAG
GCTTTCGCGA TCTACGGAAT TGTGCTCGGG CTCGCCGGCG CCGCCGGCTT CGCGCTCGGC
GGGCTATTGG TGACGCTCGA TCTCGGCGGC TACGGCTGGC GCGCGATCTT CTTCGTCAAT
GGTCCGGTCG GGCTGATCAT CTTCGCCGCG GCGGCGTGGG TGGTGCCGCA GGCGCCGCGC
CGGCCGGGCA CGAGGCTCGA TCTGCCGGGC GCGGTGATCC TGTTCACCGG CTTGCTGTGC
GTGATCGGCC CGCTGCTGTT CGGCCGCGAA GTCGACTGGG CGGGCTGGAT CTGGGTGGTG
ATGGCCGCCG GCGTCGCCAT CGTGCTGATT TTCCTGCGCT ACGAACGCGG TGTTGCGGCG
CGCGGCGGGA TGCCGCTGAT CGATCTGGCG CTACTCGCCG ACTCCGCTTT CATGCGCGGC
CTCGGCGCGG TGTTCTGCTT CTTCTTCGCC AACCAGTCAT TCTATCTGGT GGTGACGCTT
TACATGCAGA TGGTGCTCGA GATCCCGCCG CTTCCAGCCG GCCTGGTGTT CCTGCCATTG
GCGTTGGCCT TCGTGATTGC GGCCCGGCAT TCCGGCGCGC GGGCACGGCG CCGTGGCACC
CTGGTGCTGA TCGAGGGCTG CCTGCTGCAG ATCGCAGGTC TCGCTCTGGT CGCGCTGACG
GTCGCCATGA TCGAGGCGCC GACGCCGTTC GTGCTGGCGC TGGTGCTGAT CGTGGTCGGC
TACGGTCAGG GCCTGGTGAT GGCGCCGTTG TCCGGCGCAG TGTTGTCGAC GGTACAGGCG
GCCAGCGCCG GCTCCGGCTC CGGCCTGTAC GGCACCGCCA CGCAGATTTC CTCGGCCGCC
GGTATCGCTG CGATCGGCTC GCTGTATTTT TCGCTCGACC ATGCGATCTC CGGGCAATTC
GCGTTCCTGG TCGCATTGTC GGTGATCGCA GGCTCGGTCG CCGGCAGCAT CATCCTGCTG
CACTGGATGC GGCGGGCGGC CTTGGTCCCA ACGTAG
 
Protein sequence
MDQTLAVTTA SSRRWRVLAI VVAAQFMFGV DSFIVNVAIP TISVELNASS SQLEAVIAIY 
LIGYATLIVT GGRLGDIYGT KTVFLAGVIG FTLTSLWCGL ARSGTELILA RLAQGTTAAL
MVPQVLATLH VLFPDAARAK AFAIYGIVLG LAGAAGFALG GLLVTLDLGG YGWRAIFFVN
GPVGLIIFAA AAWVVPQAPR RPGTRLDLPG AVILFTGLLC VIGPLLFGRE VDWAGWIWVV
MAAGVAIVLI FLRYERGVAA RGGMPLIDLA LLADSAFMRG LGAVFCFFFA NQSFYLVVTL
YMQMVLEIPP LPAGLVFLPL ALAFVIAARH SGARARRRGT LVLIEGCLLQ IAGLALVALT
VAMIEAPTPF VLALVLIVVG YGQGLVMAPL SGAVLSTVQA ASAGSGSGLY GTATQISSAA
GIAAIGSLYF SLDHAISGQF AFLVALSVIA GSVAGSIILL HWMRRAALVP T