Gene RPD_4073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4073 
Symbol 
ID4024590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4525436 
End bp4528576 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content65% 
IMG OID637964276 
Productacriflavin resistance protein 
Protein accessionYP_571193 
Protein GI91978534 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTGGT TCAATCTCTC GGCGTGGGCA GTCAAGCACC CGGCGCTGAT CCTGTTCCTG 
ATCTTCGCGC TGGGCCTCTC CGGCATCTAC TCCTATCAGC GGCTCGGCCG CGCCGAGGAC
CCGTCCTTCA CCGTCAAGGT GGCGGTGATC TCGGTGATCT GGCCGGGCGC CACCGCGGCG
GAAATGCAGG CGCAGGTCGC TGATCCGATC GAGAAGAAAC TGCAGGAGCT GCCCTATTTC
GAAAAGGTGC AGACCTATTC GAAGGCCTCC TTCACCGCGA TGCAGGTGAC CTTCCGCGAC
TCGACGCCGC CCGCCGAGGT GCCTCATCTG TTGTATCTGC TGCGCAAGAA GCTGTGGGAC
GTCGCGCCGC AACTTCCGAG CAATCTGATC GGCCCGAACA TCAACGACGA ATACAGCGAC
GTCGATTCGA TCCTCTACAT GATGACCGGA GACGGCGCCA ACTACGCGCA GCTGAAGAAG
GCGGCCGAAG GCCTGCGCCA GCGGCTGCTG AAGGTCGAGA ACGTCACCAA GGTCAATATC
TATGGCGTCC AGGACGAGCG GATCTATGTC GAGTTCTCGC ACGCGAAACT CGCCACCCTC
GGCCTGACGC CGCAGGCGCT GTTCGACTCG CTCGCCAAGC AGAACGCGGT GACGCCGGCG
GGCACCGTGG AAACCTCGTC GCAGCGCGTG CCGCTGCGCG TCACCGGTGC GCTCGACGGC
GTCAAGGCCG TTGCCGAGAC GCCGGTGGAA AGCAACGGCC GCGTGTTCCG GCTCGGCGAC
ATCGCGACCG TGTCCCATGG CTACGTCGAT CCGACCGACT ATCTGGTGCG GCAGAAGGGT
AAACCTGCGA TCGGCATCGG TGTGGTCACC GCCACGGGCG CCAATATCCT CGATCTCGGC
GAGCACGTGA AAGCCGCGAC CGCCGAGTTC ATGGGCGACG TGCCGCAGGG CATCGAGATC
GAGCAGATCG CCGACCAGCC GCTTGTGGTG AAACACGCGG TCGGCGAGTT CATGAGTTCG
TTCCTCGAGG CGCTGGTGAT CGTGCTGTTC GTGTCGTTCC TGGCACTCGG CTGGCGCACC
GGCGTCGTGG TCGCGCTGTC GGTGCCGCTG GTGCTGGCGA TCGTCTTCAT CGTGATGAAC
GTGATGTCGC TCGACCTGCA CCGCGTCACG CTCGGGGCGC TGATCATCGC GCTCGGCCTT
CTCGTCGACG ACGCCATCAT CGCGGTCGAA ATGATGGTGG TGAAAATGGA GCAGGGCTGG
GATCGCGCCC GCGCCGCGTC GTTCGCCTGG GAGTCGACCG CATTTCCGAT GCTGACCGGA
ACGCTGGTCA CGGCCGTGGG CTTTCTGCCG ATCGGCCTCG CCAACTCCTC GGTCGGCGAA
TACGCCGGCG GCATCTTCTG GATCGTGGCG ATCGCGCTGA TCGCGTCATG GTTCGTCGCG
GTGATCTTCA CGCCCTATAT CGGCGTCAAG CTGCTTCCGG ATTTCGCCGG CAAGAAGGGC
CACAATCCGG ACGAGGTGTA TCACACCCGG ATCTATCGCG CGCTGCGCGC CGGCGTGGCC
TGGTGCGTAC GCTGGCGCGG GACGGTCGTG CTCGCCACCG TCGGCATCTT CATCGCGTCG
ATCATCGGCT TCGGCCATGT CCAGCAGCAG TTCTTTCCAC TGTCCGAGCG GCCTGAACTG
TTTCTGCAAC TGCGGCTGCC GGAAGGCACC GCGTTCAACG TCACCATGAA CACCGTGAAA
CAGGCCGAGA CGCTGTTGAA AGACGACGGC GACATCGCGA CCTACACGGC CTACGTCGGC
AAGGGTTCGC CGCGGTTCTG GATGGGCCTC AACCCACAAC TCCCGACCGA ATCCTTCGCC
GAGATCGTGA TCGTCGCCAA GGACGTCGCG GCGCGCGAGC GGATCAAGGC GCGACTCGAG
CAGGCGGCGC ATGATGGCCG GCTCGCCGAG GCGCGGGTCC GCGTCGATCG TTTCAACTTC
GGCCCGCCGG TCGGATTCCC GGTGCAGTTC CGGGTGATCG GCTCCGACAC GGCCAAGGTC
CGTGAGATCG CCTACAAGGT GCGCGACATC GTCAAAGTCA ATCCTAACGT CATCGATCCG
CATCTCGATT GGAACGAGCA GTCGCCTTAT CTCAAGCTGG TCGTGGATCA GGACCGCGCC
CGCGCCCTCG GGCTGACGCC ACAGGACGTC TCGCAGGCGC TCGCGATGCT GATCTCCGGT
GCGCAGGTCA CCGCGGTCCG CGATGGCGTC GAGAAGATCG GCGTCGTCGC CCGCGCGGTG
GCGTCCGAGC GGCTCGATCT CGGCCGCATC GGTGAACTCA CCATCACCGC GCGCAACGGC
GTCGCGGTAC CACTGTCGCA GATCGCCAAG GTCGAGTACG CCCATGAGGA GCCGATCCTG
TGGCGTCGCA ATCGCGATAT GGCGATCACT GTGCGGGCCG ATGTCGCCGA AGGCGTCCAG
GCGCCGGACG TCACGAACGC GATCTGGCCG CAGCTCAAGG AAATCCGTGA CAGCCTGCCG
TCCGCCTATC GGATCGAGAT CGGCGGCGCG ATCGAGGAAG CCGCCAAGGG CAACGCTTCG
TTGTTCATCC TGTTTCCGGT GATGGTGATC GCGATGCTGA CGCTGCTGAT GATCCAGTTG
CAGAGCTTCC CGCGGCTGCT GCTGGTGTTT CTCACGGCGC CGCTCGGGGT GATCGGCGCC
TCGCTCGGCC TCAACGTCGC CAACGCGCCG TTCGGTTTCG TCGCGTTGCT CGGGTTGATC
GCGCTGGCCG GCATGATCAT GCGCAACACC GTGATTCTCG TCGATCAGAT CGAGACCGAT
GTCGCGCAGG GCGCGACCCG CCGCGAGGCG ATCGTCGAAG CTACGGTGCG CCGCGCCCGG
CCGGTGGTGC TCACGGCGCT CGCTGCGATC CTCGCGATGA TTCCGCTGTC GCGCTCGGCG
TTCTGGGGGC CCATGGCGAT TACCATCATG GGCGGTCTGT TTGTCGCCAC CTTCCTGACG
CTGTTCTATC TGCCCGGCCT GTACGCGCTG TGGTTCCGCA AGAGTCTCGA CGAGCGCGGC
GGTCGTGGCG AAGCCGACCT CGCACCGCAA CATGCCGATC AGGCGCAGCG CGCGTTTCCG
CTTGCCGACG CCGCTGAATA A
 
Protein sequence
MRWFNLSAWA VKHPALILFL IFALGLSGIY SYQRLGRAED PSFTVKVAVI SVIWPGATAA 
EMQAQVADPI EKKLQELPYF EKVQTYSKAS FTAMQVTFRD STPPAEVPHL LYLLRKKLWD
VAPQLPSNLI GPNINDEYSD VDSILYMMTG DGANYAQLKK AAEGLRQRLL KVENVTKVNI
YGVQDERIYV EFSHAKLATL GLTPQALFDS LAKQNAVTPA GTVETSSQRV PLRVTGALDG
VKAVAETPVE SNGRVFRLGD IATVSHGYVD PTDYLVRQKG KPAIGIGVVT ATGANILDLG
EHVKAATAEF MGDVPQGIEI EQIADQPLVV KHAVGEFMSS FLEALVIVLF VSFLALGWRT
GVVVALSVPL VLAIVFIVMN VMSLDLHRVT LGALIIALGL LVDDAIIAVE MMVVKMEQGW
DRARAASFAW ESTAFPMLTG TLVTAVGFLP IGLANSSVGE YAGGIFWIVA IALIASWFVA
VIFTPYIGVK LLPDFAGKKG HNPDEVYHTR IYRALRAGVA WCVRWRGTVV LATVGIFIAS
IIGFGHVQQQ FFPLSERPEL FLQLRLPEGT AFNVTMNTVK QAETLLKDDG DIATYTAYVG
KGSPRFWMGL NPQLPTESFA EIVIVAKDVA ARERIKARLE QAAHDGRLAE ARVRVDRFNF
GPPVGFPVQF RVIGSDTAKV REIAYKVRDI VKVNPNVIDP HLDWNEQSPY LKLVVDQDRA
RALGLTPQDV SQALAMLISG AQVTAVRDGV EKIGVVARAV ASERLDLGRI GELTITARNG
VAVPLSQIAK VEYAHEEPIL WRRNRDMAIT VRADVAEGVQ APDVTNAIWP QLKEIRDSLP
SAYRIEIGGA IEEAAKGNAS LFILFPVMVI AMLTLLMIQL QSFPRLLLVF LTAPLGVIGA
SLGLNVANAP FGFVALLGLI ALAGMIMRNT VILVDQIETD VAQGATRREA IVEATVRRAR
PVVLTALAAI LAMIPLSRSA FWGPMAITIM GGLFVATFLT LFYLPGLYAL WFRKSLDERG
GRGEADLAPQ HADQAQRAFP LADAAE