Gene RPB_2901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2901 
Symbol 
ID3910697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3304468 
End bp3307584 
Gene Length3117 bp 
Protein Length1038 aa 
Translation table11 
GC content67% 
IMG OID637884804 
Productacriflavin resistance protein 
Protein accessionYP_486514 
Protein GI86750018 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0661735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCT CCGAGCCGTT CATCCGGCGC CCGATCGCAA CCTCGCTGCT CGGCGTGGCG 
TTGCTGCTCG GCGGCGTGCT CGGCTATTTT TCGCTACCGG TGTCGGCGCT GCCTCAGGTC
GATTTTCCGA CCGTGCAGGT CTCGACCCGG CTGCCGGGCG CGAGCCCCGA CGTGGTGGCG
TCGCTGATCA CCGCGCCGCT GGAGCGGCAG CTCGGCCAGA TTCCGTCGCT GACGGCGATG
ACCTCGACCT CATCGTTCGG CGTCAGCCAG GTGTCGCTGC AGTTCGACCT CAACCGCGAT
ATCGACGGCG CCACCCAGGA CGTTCAGGCG GCGATCAACG CCGCCGCCGG CATCCTGCCG
AAGAACCTGC CGTACCCGCC GACCTACGCC AAGGTGAACC CGGCCGACGC CCCGGTGATG
ACGCTGGCGC TGACCTCGAC CACCGCCTCG CTGCGGGCGA TGAGCGACAT CGCCGATACG
ATGCTGGCGC AGCGGCTGGC GCAGATCAGC GGCGTCGGCC GGGTCGCGGT GCTCGGCGGA
TTGAAGCCGG CGGTGCGGGT CCAGGCCGAT CTCGCCCGGC TCGCCGCCTA TGGCATCTCG
ATGGAAGACC TGCGCACGGC GATCGCCGGC GCCAATGTCT CCGGCCCGAA GGGCTCGCTC
GACGGCGCGC AACAGGCCTA CACCATCGCC GCCAACGACC AGATCGCCAC GGCAGAGGCT
TACAAGCCGA TCATCGTTGC CTATCGCAAC GGCTCGCCGG TGACGATCGG CGACGTCGCG
AATATTCTCG ATGGATTGGA GAACGACCGC ACCGGCGCCT GGTACCAAGG CACGCCCTCG
GTGATCCTCG ACATCCAGCG CCAGCCGGGC GCCAATGTCA TCGAAGTGGT CCGCGGCATC
CGCGCCGAGA TCCCCAAGCT GCAACGCATC ATCCCGGCCG GCGTCAAGCT CACCGTGGTC
AGCGACCGCA CCGAGACGAT CCGCGCCTCG GTCAACGACG TGCAGTTCAC GCTGCTGCTC
AGCGTCGGGC TGGTGACGCT GGTGGTGCTG CTGTTCCTGC GCTCGATGCG GGCGACGATC
ATCGCCGGCG TGGCGCTGCC GCTGTCGCTG ATCACCTCGT TCGGGGTGAT GTATTTCGCC
GGCTTCAGCC TCGACAATCT GTCGCTGATG GCGCTGACGA TCGGCACCGG CTTCGTCGTC
GACGACGCCA TCGTGATGAT CGAGAACATC GTCCGCCATA TGGAGGAGGG CGAAAACCCG
ATGCAGGCGG CGCTGTCCGG CGCCTCCGAA ATCGGCTTCA CCGTGATCTC GCTGACGATG
TCGCTGATCG CGGTATTCAT CCCGCTGTTG TTCATGTCGG GCCTGGTCGG CCGGATGTTC
CGCGAATTCG CGCTGACGCT GACGATCGCC GTGGTGACGT CCGCCATCGT GTCGCTGACG
CTGACGCCGA TGATGTGCTC GCGGCTGCTC AAACACGCCC ACGAGGAGCG CCAGGTGCCC
GGCCTCGCGG CCATCACCCG CTGGATCGAT CGCGGTGCCG AGGCCTATCA CCGCAGCCTG
CTCTGGGTAC TGAAGCATCA GCGCGCCACG CTGGTGGTGA CGTTCCTCAC CATCGTGGCG
ACGCTGGCGC TCTATGTGGT CGCGCCGAAG GGCTTCCTGC CGCTGCAGGA CACCGCCTCG
ATCACCGCCG TGACGGAGGC CGGGCAGGAC GTCTCCTTCG CCGAGATGCA GGCGCGGCAG
ACCGAGGCGG CCGACGCGAT CAAGGCCGAT CCGGACGTGA TCGGCGTGGT GTCGGTGATC
GGCGCCGGCA CGGTCAATCC GACCACCAAT GTCGGCCGGC TGGTGATGAC GCTGAAGCCG
CGCGGTGACC GCAAGGCCGG CGTCGCCGAG GTGATCGAGC GGCTGAAGCA GCGGGTGGCG
TCGATCCCCG GCATGACGGT GTACTTCCAG GCGGTGCAGG ACGTGCAGAT CTCGACCCAG
TCGAGCCGCT CGCAATATCA GTACACGCTG ACCGCCACGG ATGCGGCGCT GGTCTCGGAA
TGGGCGGCGA AGCTGGTCGC CGAGCTGCGC GACGATCCCT TGTTCCGCGA CGTCTCGTCG
GAAGCGCAGG AGCGGGGCCT GCGCGCCGCG CTCGACGTCA ATCGGCAGCG CGCCGGCCAG
CTCGGCGTCA GCCTCCAGGC GATCAACGAC ACGCTGAACG ACGCCTTCGC GCAACGTCAG
ATCTCGACGA TCTACGGCCA GGCCAACCAG TACCGCGTGG TGCTGGAGGC GATGCCGATT
TACCAGAAGG ACCCGTCGAT CCTGTCGAAG CTGTACGTGC CCGGCACCGA CGGCGCGCAG
GTGCCGATCT CGGCGGTGGC GGAATTGAAG CGGACCACCG CGCCGCTGTC GATCTCGCAC
CAGGCGCAGT TTCCGTCGGT GGCGCTGAGC TTCAATCTGG CGCCGGGGGC GTCGCTCGGC
GAGGCGGTCG ACGAGATCAA GGTGATCGAG ACCCGGATCG GGATGCCCGG CAGCATCGTC
GGCGTGTTCT ACGGCGACGC CGCCGAGTTC TCGAAATCGC TGTCCGGCCA GCCCTGGCTG
ATCCTCGCAG CACTGGTGAC GATCTACATC GTGCTCGGCG TGCTGTACGA GAGCTTCATC
CACCCGATCA CCATCCTGTC GACGCTGCCC TCTGCGGGCG TCGGCGCGAT CCTGGCCTTG
ATGCTGTTCG GGCAGGACCT CTCGGTGATC GGGCTGATCG GCATCATCCT GTTGATGGGT
ATCGTCAAGA AGAACGCGAT CATGATGATC GACTTCGCGC TGGAGGCGGA GCGGCACAAG
GGGATGTCGT CCTACGACGC GATCGTGCAG GCCTGCCGGC TGCGGTTCCG TCCGATCATG
ATGACGACGC TGGCGGCGCT GTTCGGCGCG CTGCCGCTCG CGGTCGAAAG CGGCACCGGC
TCCGAGCTGC GGTTTCCGCT CGGCATCTCG ATCATCGGCG GCCTGCTGCT CAGCCAGTTG
TTGACGCTGT ACACCACGCC GGTGATCTAT CTGGCGCTCG ACCGGATCAA CCGGAAGTTC
GAGCGGGCGC TGCCGCCGGC GCCGCCGCTG GCCGGCCCGG CGGAGGAGGC GCGCTGA
 
Protein sequence
MSVSEPFIRR PIATSLLGVA LLLGGVLGYF SLPVSALPQV DFPTVQVSTR LPGASPDVVA 
SLITAPLERQ LGQIPSLTAM TSTSSFGVSQ VSLQFDLNRD IDGATQDVQA AINAAAGILP
KNLPYPPTYA KVNPADAPVM TLALTSTTAS LRAMSDIADT MLAQRLAQIS GVGRVAVLGG
LKPAVRVQAD LARLAAYGIS MEDLRTAIAG ANVSGPKGSL DGAQQAYTIA ANDQIATAEA
YKPIIVAYRN GSPVTIGDVA NILDGLENDR TGAWYQGTPS VILDIQRQPG ANVIEVVRGI
RAEIPKLQRI IPAGVKLTVV SDRTETIRAS VNDVQFTLLL SVGLVTLVVL LFLRSMRATI
IAGVALPLSL ITSFGVMYFA GFSLDNLSLM ALTIGTGFVV DDAIVMIENI VRHMEEGENP
MQAALSGASE IGFTVISLTM SLIAVFIPLL FMSGLVGRMF REFALTLTIA VVTSAIVSLT
LTPMMCSRLL KHAHEERQVP GLAAITRWID RGAEAYHRSL LWVLKHQRAT LVVTFLTIVA
TLALYVVAPK GFLPLQDTAS ITAVTEAGQD VSFAEMQARQ TEAADAIKAD PDVIGVVSVI
GAGTVNPTTN VGRLVMTLKP RGDRKAGVAE VIERLKQRVA SIPGMTVYFQ AVQDVQISTQ
SSRSQYQYTL TATDAALVSE WAAKLVAELR DDPLFRDVSS EAQERGLRAA LDVNRQRAGQ
LGVSLQAIND TLNDAFAQRQ ISTIYGQANQ YRVVLEAMPI YQKDPSILSK LYVPGTDGAQ
VPISAVAELK RTTAPLSISH QAQFPSVALS FNLAPGASLG EAVDEIKVIE TRIGMPGSIV
GVFYGDAAEF SKSLSGQPWL ILAALVTIYI VLGVLYESFI HPITILSTLP SAGVGAILAL
MLFGQDLSVI GLIGIILLMG IVKKNAIMMI DFALEAERHK GMSSYDAIVQ ACRLRFRPIM
MTTLAALFGA LPLAVESGTG SELRFPLGIS IIGGLLLSQL LTLYTTPVIY LALDRINRKF
ERALPPAPPL AGPAEEAR