Gene RPB_1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1981 
Symbol 
ID3909486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2250567 
End bp2252153 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content64% 
IMG OID637883875 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_485600 
Protein GI86749104 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0426587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCAT CCACCACAGC GACCGCCGGA GCGGTGCCGC AGGCGCCGTC CGAACACGTG 
CCGGCCAGCC GGCTGATCGC GTTCCTGATC ATGGTGTTCG GGATGTTCAT GTCGATCCTG
GACATCCAGA TCGTCTCGGC ATCGCTGTCC GAAATCCAGG CCGGCCTGTC GGCGTCATCG
TCCGAAGTCT CCTGGGTGCA GACCTCGTAT CTGATCGCCG AAGTGATCGC GATCCCGCTG
TCCGGCTTCC TGTCGCGCGC GCTCGGCACG CGCAATCTGT TCGCGATCTC GGCGGCCGGC
TTCACCTTCG CCAGCCTGAT GTGCGGTCTC ACCTCGTCGA TCACCGAGAT GATCGTGTGG
CGGGCGATCC AGGGCTTCCT CGGCGCCGGC ATGATCCCGA CCGTGTTCGC CTCGGCCTAC
ACGGTGTTTC CGCGCAGCAA ATTCAACCTG GTCGGGCCGA TCATCGGCCT GGTGGCGACG
CTGGCGCCGA CCATCGGCCC GACGGTCGGC GGCTACATCA CCGACCTGAT GTCGTGGCAC
TGGCTGTTCT TCATCAACGT CGTCCCCGGC ATCGGCATCA CCATCGGCGT GCTGATGCTG
GTCGATTTCG ACGAGCCGCA TTACGAACTG CTCGACCATT TCGACTGGTG GGGGCTGCTG
TTCATGGCGG GCTTCCTCGG CTCGCTCGAA TACGTGCTCG AGGAAGGCCA TCGCAACGAC
TGGTTCAGCG ACGAGTCGAT CTTCATCTTC GCGATCGTCT GTGCGGTCTG CGCCGTGGGG
TTCTTCTGGC GGGTGCTGAC CGCGCGCGAG CCGATCGTCG ACATCCGTGC CTTCACCAAC
CGCAATTTCG CGTTCGGTTG CCTGTTCTCG TTCTGCGTCG GTATCGGCCT GTACGGCCTG
ACCTACATCT ATCCGCGCTA CCTCGCCGAA GTCCGCGGCT ACAGCGCGCT GATGATCGGC
GAGACGATGT TCGTCTCCGG CATCGCGATG TTTCTGTCCG CGCCGCTGGT CGGCCGGCTG
ATGACGCTGG TCGACATGCG CATCCTGATC GCGATCGGCC TTTTGCTGTT CGCCGCCGGC
ACCTGGCTGA TGACCGGGAT CACCCGCGAC TATGATTTCT ACGAGCTGCT GTGGCCGCAG
ATCTTCCGCG GCGTCGGCAT GATGATGGCG ATGGTGCCGG TCAACAACAT CGCGCTCGGC
ACGCTGCCGC CGGAGCGCGT CAAGAACGCC TCCGGCCTGT TCAACCTGAC GCGCAATCTC
GGCGGCGCGC TCGGGCTGGC GCTGATCAAC ACCATCCTCG ACGGCCGCAC CGATTTCCAC
ATCTCGCGGC TGCACGACAA GGTGAATTGG GGCAACGCCC AGGCGGTCGA CTTCCTCAAC
ATGCTGACGC AGAAATTCCA GGGCATGGGC GACGCCTCGC TGATGGCGCT GAAGCAGTTC
AACCAAATCG TCCACCGCCA GGCCGTCACC ATGAGCTTCG GCGACGCGTT CTTCCTGCTG
ACGATCTTCT ACGTTGGGCT CAGCACGCTG GTGGTGCTGG TGGCGAAACC GGCGAACCCG
GCGGCAGCGG GGGGCGGGGG GCATTAG
 
Protein sequence
MSASTTATAG AVPQAPSEHV PASRLIAFLI MVFGMFMSIL DIQIVSASLS EIQAGLSASS 
SEVSWVQTSY LIAEVIAIPL SGFLSRALGT RNLFAISAAG FTFASLMCGL TSSITEMIVW
RAIQGFLGAG MIPTVFASAY TVFPRSKFNL VGPIIGLVAT LAPTIGPTVG GYITDLMSWH
WLFFINVVPG IGITIGVLML VDFDEPHYEL LDHFDWWGLL FMAGFLGSLE YVLEEGHRND
WFSDESIFIF AIVCAVCAVG FFWRVLTARE PIVDIRAFTN RNFAFGCLFS FCVGIGLYGL
TYIYPRYLAE VRGYSALMIG ETMFVSGIAM FLSAPLVGRL MTLVDMRILI AIGLLLFAAG
TWLMTGITRD YDFYELLWPQ IFRGVGMMMA MVPVNNIALG TLPPERVKNA SGLFNLTRNL
GGALGLALIN TILDGRTDFH ISRLHDKVNW GNAQAVDFLN MLTQKFQGMG DASLMALKQF
NQIVHRQAVT MSFGDAFFLL TIFYVGLSTL VVLVAKPANP AAAGGGGH