Gene RPB_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2009 
Symbol 
ID3909515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2285847 
End bp2287136 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content68% 
IMG OID637883903 
ProductBcr/CflA subfamily drug resistance transporter 
Protein accessionYP_485628 
Protein GI86749132 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00710] drug resistance transporter, Bcr/CflA subfamily
[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.175256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.467375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGCA AGACCGGTAG AACCTTCGCG ATCGACCCCG GCCGGATCGA CCGGCCGCCG 
TCGAAACTGC TGCTGCTCCT GCTGGTGCTG ATGAACGGCG CGGCGCCGAT CGCGCTGTAC
ATCTTCGTGC CGGCGCTGCC GGTGCTCGCC AGCGATCTCG GCGGCGACAT CTCGGTCGCG
CAGATGACGG TGTCGCTCTA CATGGTCGGG CTGGCGTGCT CGCAACTGAT CATGGGCCCG
CTGTCGGACC GGTTTGGCCG CCGCCCGGTG CTGCTCGGCG GCTTGACGCT GATGGTCGCC
GCCAGCGTCG GCTGCATCTT CGCCGAGACC CTGCCGCAAC TGATCGCCGC GCGGTTCCTG
CAGGCGCTCG GCGGCGCTTC CGGCATGGTG ATCAGCCGCG CCATCATCCG CGATCTGTAC
AGCCGCGACC GCGTCGGCGG CATGCTCAGT CTCGTCATCG CCGTGATGAT GATCGCGCAG
ATGCTGAGCC CGCTGTTCGG CGGCGTGATC GAAACCGCGC TCGGCTGGCG CGCGATCTTC
TACGTCGTCA CGGCGGGCGC GATCGCCGTC ACGGCGACGA TCGCGCTGGC GCTGCCCGAG
ACGCGCCGCC GCCTCGGTCC CGCCGCACCG GGCGGCTTCC GCGGCGATGT CGGCGGCCTG
TTCAGGAGCC GCGCCTTCAT CGGCTATGTG CTTTGCCAGG TGCTGGCCTC GGCGATCATC
TTCACCTTCG CCGGCGGCGG GCCGTACGTC GTGGTGACGC AGATGGGCCG CAGTTCGGCC
GAATACGGCG CCTGGTTCGC CAGCTCCGGC TTCGCCTATC TGCTCGGCAA TCTGTTCTGC
GTGCGGTTCG CGCCGCGCTA TTCGCTCGAC AAACTGATCT GGTTCGGACT GGCGATGCAG
ATCGGCGGCG CGGCGCTGAA TCTGGCGTGG GGCGTGCTCG GCTGGAATCA GGTGCCGAGC
TGGCTGTTCG CCACCCACAT GATCATCATG TTCGGCAACG CCTTCGTGAT GGCCAATGCA
ACCGCCGGCG CGATCAGCAT AAGGCCGCAG GTGGCCGGCA CCGCTTCCGG GCTGATGGGC
TTCACCCAAT TCGGGATCGG CGCGCTGTGC TCGCAGTTCG GCGCGTATCT CGGCGGCCAT
TTCGCCACGC CGCTGCCGCT CAACATCGCC GTCGCCGCCC TCGCGCTGGC CTGCGCCGCC
TCGATGATCT TCCTGGTGCC ACGCAGCAAC CTCATCGCGA GCGAAGAGCT GATCGAGAAG
GCAGAAGGCG AAGAGCCGCC GGTGATGTGA
 
Protein sequence
MHGKTGRTFA IDPGRIDRPP SKLLLLLLVL MNGAAPIALY IFVPALPVLA SDLGGDISVA 
QMTVSLYMVG LACSQLIMGP LSDRFGRRPV LLGGLTLMVA ASVGCIFAET LPQLIAARFL
QALGGASGMV ISRAIIRDLY SRDRVGGMLS LVIAVMMIAQ MLSPLFGGVI ETALGWRAIF
YVVTAGAIAV TATIALALPE TRRRLGPAAP GGFRGDVGGL FRSRAFIGYV LCQVLASAII
FTFAGGGPYV VVTQMGRSSA EYGAWFASSG FAYLLGNLFC VRFAPRYSLD KLIWFGLAMQ
IGGAALNLAW GVLGWNQVPS WLFATHMIIM FGNAFVMANA TAGAISIRPQ VAGTASGLMG
FTQFGIGALC SQFGAYLGGH FATPLPLNIA VAALALACAA SMIFLVPRSN LIASEELIEK
AEGEEPPVM