Gene RPD_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2039 
Symbol 
ID4022521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2285822 
End bp2288878 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content62% 
IMG OID637962232 
Productacriflavin resistance protein 
Protein accessionYP_569175 
Protein GI91976516 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0940765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTCTT TCAACCTCTC CGACTGGGCT CTGCAACATC GTTCGCTGGT CTGGTACTTC 
ATGATCGCGT TCATGTTCGC CGGGCTGTTC TCCTACCTCG AGCTCGGACG CGAAGAAGAC
CCTGCCTTCA CCATCAAGAC CATGGTGATC CAGGCCAAGT GGCCTGGCGC CTCGGCTGAA
GAAACCACGC GACAGGTCAC CGACCGGATC GAGAAGAAGC TCGAGGAACT GGAGTCGCTG
GACTACACCA AGAGCATCAC CACGCCGGGG CAGACGACGG TGTTCGTCAA TCTGCGCGAC
ACCACCAAGG CGCGCGACGT CACGCCCACC TGGGTCCGCG TCCGCAATAT GATCAACGAC
ATCAAGGGTG ACTTTCCGGA GGGCGTGATC GGACCGGGTT TCAACGACCG CTTCGGTGAC
GTGTTCGGCA ATATCTACGC CTTCACCAGC GACGGCCTGA CCCAGCGGCA GTTGCGAGAC
AAGGTCGAGG AAGTCCGCGC CCAGGTCCTG CAGGTGCCCA ATGTCGGCCG CGTCGACATC
GTCGGCGCGC AGGACGAGGT GATTTTTCTC GAGTTCTCGA CCCGCAAAGT CGCGGCCCTT
GGTCTCGACC AGCGTTCGAT CCTGACATCG CTCCAGGCGC AGAATGCGAT CACTCCGTCC
GGTGTGCTGC AAGCCGGGCC GGAGCGGATC AGCGTGCGGG TCAGTGGGCA GTTCACCTCG
GAGGAGAGCC TCAAAGCGAT CAATTTGCGC GTCAATGATC GGTTCTTTCC GCTCACCGAC
GTCGCCACCA TCCGCCGCGG CTATTCCGAT CCGCCGACGT CGCTGTTCAG GTTCAAGGGT
GAGCCTGCGA TCGGCTTGAC GATCGGCATG AAGGCTGGCG CCAATCTGCT CGAATTCGGC
CAAGCACTGA AAAAGGAGAT GACGCGGATC TCAGCCGACC TGCCGGTCGG CGCCGAAGTT
CATCTGGTAT CCGATCAGCC GCAGATCGTC GACGACGCAG TTTCGGGCTT CACCCGGGCT
CTATTCGAAG CGGTCGTCAT CGTGCTGGCG ATCAGTTTCA TCAGCCTCGG AATGCGTGCC
GGCCTCGTTG TCGCGATCTC GATTCCACTG GTGCTGGCGA TCACCTTCAT GGTGATGTCC
TATTCCGGAA TCTCGTTGCA GCGGATTTCG CTGGGGGCGC TGATCATCGC TCTCGGCCTT
CTGGTCGATG ACGCAATGAT TGCGGTCGAA ATGATGGTGG CCCGGCTCGA GGTCGGCGAC
ACGCTCGCCA AGGCCGCAAC CTACGTCTAC ACCTCGACTG CCTTTCCGAT GCTGACAGGC
ACGCTGGTGA CGGTCGCGGG CTTCATCCCG ATCGGCCTCA ACAGCAGCGC GGCGGGCGAA
TTCACCTTCA CGCTGTTCGT CGTGATCGCG GTGTCGCTGC TGACATCGTG GATCGTCGCG
GTGCTGTTCA CGCCGCTGCT CGGCGTCACC ATCCTGCCGG ACAAGATGAA GAGCCACCAC
GAGAACAAGG GCTGGTTTTC CACCCGCTTC AGTCGCGTGC TGATTTTTTG CATGCGGCGG
CGCTGGTTGA CGATCACGGT GACGCTGGCA GCGTTCGCGC TGTCGATCGT CGGGATGCGG
TTCGTCCAGC AGCAGTTCTT CCCATCTTCC GATCGCAAGG AACTCATCGT CGACTGGAAT
CTCCCCAAGA ACAGCTCGAT TGCCGAGACC AGCGCGCAGA TGGCGCAGTT CGAGCGCGAG
GCGTTGCAGG GTAAGGACGG CATCGATCAC TGGTCGACCT ATGTCGGTCA GGGGGCGCCG
CGTTTCGTGT TGTCGTTCGA TGTTCAACCC GCGGATTTCT CGTTTGGGCA GATGGTGATC
GTGACCAGAA GCCTGGCTGA CCGGGACCGG CTGAGGGGCG AGTTGCAGGG CTATCTGAAG
AAGACGTTCC CCGGGACCGA CGCGCTGGTG AATCTGCTCG ACATCGGCCC GCCAGTCGGG
CGGCCGGTCC AGTATCGTCT CAGCGGCCCG GACATTGCAA AAGTACGTGC CCTGTCGCGC
GAGCTCGCTG GCATCGTTGC CGGCAACCTG CATCTCGGCG ACGTGGTGTT CGACTGGATG
GAGCCCGCGC GGGTCGTCAA GGTCGACGTG CTGCAAGACA AGGCGCGGCA GCTCGGCGTG
ACCTCTGAAG ACATCGCGTC TACGCTCAAC AGCATCGTCG ACGGCGTATC TATCACTCAA
GTTCGCGACG ACATCTATCT GGTCAAGGTG CTGGGCCGCG CCAACGCTGC AGAGCGCGGT
TCGATTGAAA CGCTGCGCAA TCTGCAATTG TCGGGCAGCA GCGGGCAGTC TGTTCCGCTC
GCGGCGGTGG CGACGTTCCG CTACGAGCTC GAGCAGCCGA CGATCTGGCG GCGGTCGCGT
CTGCCGACGA TCACCATCAA GGCAAGCATT CGGGATGGCG TTCAGCCGGC GACTGTCGTC
CAGCAGCTGA AGACGCCAAT CGCCGAATTT TCCTCGAAGC TGCCGGTCGG CTATTCCGTC
GCGGTCGGCG GCAGCGTCGA GCAGAGTGGA AAGTCCCAGG CGCCGATCGC CGCGGTCGTG
CCGATCATGC TGTTCGCGAT GGCGACCATC CTGATGGTGC AGCTGCAGAG CTTCAGCCGG
CTGTTTCTGG TGTTCGCCGT CGCACCGCTG GCTTTGATCG GCGTCGTCGC GGCATTGTTG
CCGAGCGGGG CGCCGCTCGG CTTCGTCGCA ATCCTCGGCG TGCTGGCGCT GATCGGAATT
CTGATCCGCA ATTCGGTCAT TCTGATCGTG CAGATCGAAC ATTTGCGCAG TGAGGGTAAG
CCGCCATGGG AAGCGGTGGT CGAGGCGACC GAACATCGTA TGCGACCGAT CCTGTTGACC
GCCGCCGCGG CCAGCCTGGC GCTGATCCCG ATCGCGCGCG AGGTGTTCTG GGGGCCGATG
GCCTACGCGA TGATGGGTGG CATCATCGTC GGAACGGTTC TGACGCTGCT GTTCCTGCCC
GCGCTCTACG TCGCGTGGTT CCGTATCAAG ATGCCCGAAG AAGGCGCGCC TGCATGA
 
Protein sequence
MKSFNLSDWA LQHRSLVWYF MIAFMFAGLF SYLELGREED PAFTIKTMVI QAKWPGASAE 
ETTRQVTDRI EKKLEELESL DYTKSITTPG QTTVFVNLRD TTKARDVTPT WVRVRNMIND
IKGDFPEGVI GPGFNDRFGD VFGNIYAFTS DGLTQRQLRD KVEEVRAQVL QVPNVGRVDI
VGAQDEVIFL EFSTRKVAAL GLDQRSILTS LQAQNAITPS GVLQAGPERI SVRVSGQFTS
EESLKAINLR VNDRFFPLTD VATIRRGYSD PPTSLFRFKG EPAIGLTIGM KAGANLLEFG
QALKKEMTRI SADLPVGAEV HLVSDQPQIV DDAVSGFTRA LFEAVVIVLA ISFISLGMRA
GLVVAISIPL VLAITFMVMS YSGISLQRIS LGALIIALGL LVDDAMIAVE MMVARLEVGD
TLAKAATYVY TSTAFPMLTG TLVTVAGFIP IGLNSSAAGE FTFTLFVVIA VSLLTSWIVA
VLFTPLLGVT ILPDKMKSHH ENKGWFSTRF SRVLIFCMRR RWLTITVTLA AFALSIVGMR
FVQQQFFPSS DRKELIVDWN LPKNSSIAET SAQMAQFERE ALQGKDGIDH WSTYVGQGAP
RFVLSFDVQP ADFSFGQMVI VTRSLADRDR LRGELQGYLK KTFPGTDALV NLLDIGPPVG
RPVQYRLSGP DIAKVRALSR ELAGIVAGNL HLGDVVFDWM EPARVVKVDV LQDKARQLGV
TSEDIASTLN SIVDGVSITQ VRDDIYLVKV LGRANAAERG SIETLRNLQL SGSSGQSVPL
AAVATFRYEL EQPTIWRRSR LPTITIKASI RDGVQPATVV QQLKTPIAEF SSKLPVGYSV
AVGGSVEQSG KSQAPIAAVV PIMLFAMATI LMVQLQSFSR LFLVFAVAPL ALIGVVAALL
PSGAPLGFVA ILGVLALIGI LIRNSVILIV QIEHLRSEGK PPWEAVVEAT EHRMRPILLT
AAAASLALIP IAREVFWGPM AYAMMGGIIV GTVLTLLFLP ALYVAWFRIK MPEEGAPA