Gene RPD_1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1958 
Symbol 
ID4022440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2197957 
End bp2199549 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content69% 
IMG OID637962151 
Productmajor facilitator transporter 
Protein accessionYP_569094 
Protein GI91976435 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.452075 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGT CCGAGCATCA AAGCCGCGAA CCTGCCGGCC GGCCCTTGTC GCCCGACGAA 
GCGGCGGCGG AACTCTCGCA CAGCTCCACG GACGTGATCG ATCTCGGCCA TGCGCCGCCG
CTGGCGCCGT CCGCGCCGCT GACCACGGAC GAGGTCCGCA CCATCCTGTT GAGCCTGTTG
CTGGCGATGT TCCTCGCCGC GCTCGACCAG ACCATCGTGG CGACCGCGTT GCCGACGATC
GGGCGGCAGT TCGGCGACGT CGAGAATCTG TCCTGGGTGA TCACCGCCTA TCTGTTGTCC
TCGACCGCGG TGGCGCCGGT GTTCGGCAGC CTCTGCGATA TCTACGGCCG CCGCGCCACG
ATCATCGCGG CGCTCAGCCT GTTCATCGCC GGCTCGGTGA TGTGCGCGCT GGCGCCGAGC
GTTCTGGTGC TGATCCTCGG CCGCGCGTTG CAGGGGCTCG GCGGCGGCGG GATCATGCCG
GTGGTGCAGA CGGTGATCTC CGACGTGGTC AGCCCACGCG AGCGCGGCAA GTATCAGGCG
TATTTCTCCG GCGTCTGGGT GGCGGCGGGA ATCGGCGGCC CGGTGCTCGG CGGGGCCTTC
GCCGAGCATC TGCACTGGTC GATGATCTTC TGGATCAATC TGCCGCTGTC GATCGGCGCG
CTGGCGCTGC TGCTGCCGAA GATGGCGAAG ATTCCGGTGT ATCACCGCCG TCGCAAGGTC
GACTGGCTCG GCGGCGTGCT GCTGATGGCC TCGGCGCTGG CGGTGATGCT GGTGCTGACC
TGGGGCGGCA CGCGGTTTTC GTGGCTGTCG CCGGTGATCC TGGCGCTCGC CGGCGGCGCG
GTGCTGTTCG CGGCGAGCTT CATCTGGCAC GCGCTGCGCG AGCCGGAGCC GTTCCTGCCG
CTGCAATTGA TGGGCGGCAC GGTGGTGCCG TGGGCGATGG CGGCGGGCGG CTTCGCGATG
GGCGCGATGA TCGGGCTCAC CGTGCACATC CCGCTGTATT ACGAGGCGGT GTATCACCTC
AGCGCCAGCG CCTCGGGTCT GGCGCTGATC CCGATCGCCG CGGTCTCGGT GTTCGGCGCG
GCGTTCACCG GCCGCGCCAT GACGCATCTC GATCATTACA AGCGGATCGC GATCATCGGC
ACCGGCTTCT CGGCGCTGAT GGCGGCGGCG ATCGCGCTGC TGACGCCGTT GCCGCTGTGG
GCGTTCCTGA CGCTGCTGTC GCTGTTCTCG CTCGGCCTCG GTACGGTGTT TCCGGTCAGC
ATGGTGTCGA TCCAGAACGC GGTGCCGCGA CCGCAGATCG GCACCGCCAC CGGCGCGATG
AACTTCTTCC GCGCGCTGAT GTCGTCGTTC ACGGTGGCGG CGTTCACCGC GGTGCTGCTG
ATCACGTTCG GCGGCGAGAT CCAGCTCGGC GGCGCAGAGC ATCGCCACGC GGTCGGCAGC
GTCGCCTCCG CCGACATGGT GGCGGCGTTC CGCTGGGTGT TCGGCGCGGC GGCATTGATG
CTGGCCGGCT CGGCGATCTG CGTCGCGATC ATGGAGGAGC GCCGGCTCGC CGGCCCGGAC
AACACGCCGC CGCCGCTGGA GCTGGCGGAG TAG
 
Protein sequence
MSMSEHQSRE PAGRPLSPDE AAAELSHSST DVIDLGHAPP LAPSAPLTTD EVRTILLSLL 
LAMFLAALDQ TIVATALPTI GRQFGDVENL SWVITAYLLS STAVAPVFGS LCDIYGRRAT
IIAALSLFIA GSVMCALAPS VLVLILGRAL QGLGGGGIMP VVQTVISDVV SPRERGKYQA
YFSGVWVAAG IGGPVLGGAF AEHLHWSMIF WINLPLSIGA LALLLPKMAK IPVYHRRRKV
DWLGGVLLMA SALAVMLVLT WGGTRFSWLS PVILALAGGA VLFAASFIWH ALREPEPFLP
LQLMGGTVVP WAMAAGGFAM GAMIGLTVHI PLYYEAVYHL SASASGLALI PIAAVSVFGA
AFTGRAMTHL DHYKRIAIIG TGFSALMAAA IALLTPLPLW AFLTLLSLFS LGLGTVFPVS
MVSIQNAVPR PQIGTATGAM NFFRALMSSF TVAAFTAVLL ITFGGEIQLG GAEHRHAVGS
VASADMVAAF RWVFGAAALM LAGSAICVAI MEERRLAGPD NTPPPLELAE