Gene RPB_3496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3496 
Symbol 
ID3911298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4000359 
End bp4001954 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content69% 
IMG OID637885398 
ProductMFS transporter 
Protein accessionYP_487102 
Protein GI86750606 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.597192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGT CCGAACACCG ACGCCGTGAG CCGTCCGGCA GGCAATTGTT GTCCGACGAA 
GCGGCGGCGG AACTGTCGCA CATTCCCACC GAAGTGATCG ATCTCGGCGA CGCGCCGCCG
CTGGCGCCGG CGAATACGCT GTCGCAGGGC GAAGTCCGCG CCATCCTGAT GAGTCTGCTG
CTGGCGATGT TCCTCGCCGC GCTCGACCAG ACCATCGTCG CCACCGCGCT GCCGACGATC
GGGCGGCAGT TCGGCGACGT CGAGAATCTG TCCTGGGTGA TCACCGCTTA TCTGTTGTCG
TCGACCGCGG TGGCGCCGGT GTTCGGCAGC CTCGCCGACA TTTACGGCCG CCGCGCCACC
ATCATCGCGT CGATCAGCCT GTTCATCGCC GGTTCAGTGA TGTGCGCGCT GGCGCCGAGC
CTGCTGGTGC TGATTCTCGG TCGCGCGCTG CAGGGGCTGG GCGGCGGCGG CATCATGCCG
ATCGTGCAGA CGGTGATCTC GGACGTGGTG ACGCCGCGCG AGCGCGGCAA GTATCAGGCC
TATTTCTCCG CCGTCTGGGT CTCGGCCGGG ATCGGCGGGC CGATCCTCGG CGGCGCCTTC
GCCGAGCATC TGCACTGGTC GATGATCTTC TGGATCAATC TGCCGCTGGC GATCGGCGCG
CTGGCGTTGC TGCTGCCGAA GATGGCCAAG ATTCCGACGT ATCACCGCCG CCGCAAGGTC
GACTGGCTGG GCGGCGTGCT GCTGATGGCC TCGGCGATGG CGGTGATGCT GGCGCTGACC
TGGGGCGGCA CGCGGTTCTC CTGGCTGTCG CCGACGATCC TGGCGCTGAG CGGCGGCGCA
GTGCTGCTCG CGGTGTGCTT CGTCTGGCAC GCGCTGCGGG CGCCCGAGCC GTTCCTGCCG
CTGCAATTGA TGGGCGGCAC GGTGGTGCCG TGGGCGATGG CGGCGGGCGG CTTCGCGATG
GGAGCGATGA TCGGCCTGAC GGTGCACATG CCGCTGTACT ACGAGGCGGT GTATCACCTG
ACCGCGAGCC AGTCCGGCCT GGCGCTGATC CCGATCGCCG CGGTGTCGGT GCTCGGCGCG
GCGTTCACCG GCCGCGCTAT GGTGAAGGTC GAGCGCTACA AGCGGATCGC GATCCTCGGC
ACCGGCTTCT CGGCGCTGAT GGCGGCGGCG ATCGCGGTGA CGACGCCGCT GCCGCTGTGG
CTGTTCCTGG CGCTGCTGTC GCTGTTCTCG ATCGGGCTCG GCACGGTGTT CCCGGTCACC
GTGGTGTCGA TCCAGAACGC GGTGGCGCGG CCGCAGATCG GCACCGCGAC CGGGGCGATG
AACTTCTTCC GCGCGCTGAT GGCGTCGTTC ACGGTGGCGG CGTTCACCGC GGTGCTGCTG
ATCGCGTTCG GCGGCGAGAT CCAGCTCGGC GGCGCCGAGC ATCGCCACGC CATCGGCACG
GTCGCTGCCG CCGACATGGT GGCGGCGTTC CGCTGGGTGT TCGCCGCGGC CGCCGCGATG
CTGGCCGCGT CGGCGATCTG CGTGGCGATC ATGGAAGAGC GCACGCTCGC CGGTCCGGAG
CAGACTTCGC CGCCGCCGCT GGAGATGGCC GAGTAG
 
Protein sequence
MSMSEHRRRE PSGRQLLSDE AAAELSHIPT EVIDLGDAPP LAPANTLSQG EVRAILMSLL 
LAMFLAALDQ TIVATALPTI GRQFGDVENL SWVITAYLLS STAVAPVFGS LADIYGRRAT
IIASISLFIA GSVMCALAPS LLVLILGRAL QGLGGGGIMP IVQTVISDVV TPRERGKYQA
YFSAVWVSAG IGGPILGGAF AEHLHWSMIF WINLPLAIGA LALLLPKMAK IPTYHRRRKV
DWLGGVLLMA SAMAVMLALT WGGTRFSWLS PTILALSGGA VLLAVCFVWH ALRAPEPFLP
LQLMGGTVVP WAMAAGGFAM GAMIGLTVHM PLYYEAVYHL TASQSGLALI PIAAVSVLGA
AFTGRAMVKV ERYKRIAILG TGFSALMAAA IAVTTPLPLW LFLALLSLFS IGLGTVFPVT
VVSIQNAVAR PQIGTATGAM NFFRALMASF TVAAFTAVLL IAFGGEIQLG GAEHRHAIGT
VAAADMVAAF RWVFAAAAAM LAASAICVAI MEERTLAGPE QTSPPPLEMA E