Gene RPD_4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4199 
Symbol 
ID4024720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4665800 
End bp4669027 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content67% 
IMG OID637964405 
Productacriflavin resistance protein 
Protein accessionYP_571317 
Protein GI91978658 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCTCG GAATTTCAGG CGGCCTGACC AAGGCCAGCA TCCAGTCGCC GCTGACGCCG 
CTGTTCCTGC TGGCGTCGCT GGTCGTCGGG CTGATCGCGC TCGCCGCGAT CCCGCGCGAG
GAGGAGCCGC AGATCAGCGT GCCGATGGTC GACATCCGCA TCAACGCCGA CGGACTGCAC
GGCCCCGACG CGGTCGAGCT GGTGACCAAG CCGCTCGAGG CGATCGTCAA GGGCATCGAC
GGCGTCGAGC ACGTCTACAG CCAGAGCGAA GACGACCGGG TGATGGTCAC CGCCCGCTTT
CTGGTCGGTA CCAAGTCCGA GGACGCCATC CTGCGGGTGC ACGAGAAGAT CCGCGCCAAT
TTCGACCGCA TTCCCGTCGG CATTCCCGAG CCGCTGATCG TCGGCCGCGG CATCAACGAC
GTCGCCGTCG CCATCCTGAC GCTGTCGCCG AAGCCGGAAG CAGCCAGCCG CTGGACCGAC
AAGGACCTGT ACGAACTCGC CGACAAGCTG CGCTCCGAGC TGATGAAGGT CGACAGCATC
GGCCTCACCT ACATCGCCGG CGGCGGCGCG CAGCAGATCC GGGTCGAGCC CGATCCGGAG
AAGCTGTCGC TGTACGGCGT GACGCTGCAA CAGCTCGTCG CCAAGGTGAA GGACGCCAAC
CGCTCGTTCC TGGCCGGGCA GGTGCGCGAC GGCGGCGCGA TGCGCAGCGT CTCGGCCGGG
CAGACGCTGA TGGGCATTCC CGACATCGGC CTGCTGCTGA TTTCGACCCG CGACAATCGC
CCGGTCTACG TCAAGGACGT CGCCGCGGTG ATCATCGGCC CGAACGCGGC CGAGCATCGG
GTGTGGAACG ACGCCCGCAG CGGCGGCGGC GACTGGACCC GCGTGCCAGC GGTCAGCGTC
GCGCTCGCCA AGCGCGCCGG CGCCAACGCC GTGGTGGTGT CGCAGGAGAT CGAGCACCGC
CTTGCCGGCC TGAAGGGAAC GCTCCTTCCC GACGACGTCG TCGTCACCGT CACCCGCGAC
TATGGCGACA CCGCCAACGA GAAGGCCAAC GAATTGTTGC TGCATCTCGG CATCGCCACG
ATCTCGATCG TGGTGCTGAT CGCGATCGCG ATCGGCTGGC GCGAAGCGGT GGTCACCGCG
GTGGTGATCC CGACCACCAT CCTGCTGACC TTGTTCGCCG CCAATCTGAT GGGCTACACC
ATCAATCGCG TCAGCCTGTT CGCGCTGATT TTCTCCATCG GCATCCTGGT CGACGACGCC
ATCGTGGTGG TCGAGAACAT CGCCCGGCAC TGGGGGATGA AGGACGGCCG ACCGCGGCTG
CAGGCCACCA TCGAGGCGGT GGCCGAGGTC GGCAATCCGA CCGTGATCGC GACGCTCACC
GTGGTCGCGG CGCTGTTGCC GATGCTGTTC GTGTCCGGGC TGATGGGGCC ATATATGGCG
CCGATCCCGG CCAACGCCTC GGCGGCGATG CTGTTCTCGT TCTTCGTCGC GATGGTGGTG
GCGCCTTGGC TGATGCTGCG GCTGGCGCCG AAGCAGGGTG CTGCGGTCGC GGCGCATGAC
GCGCATGACG AGGGCCGGCT CGGCCGGCTG TATCGCCGCA TTGCGTCGCC GATCGTCGCC
AGCAAGCGCG CGTCGTGGAT CTTCCTGCTC GGCGTCGGCG TCGCCACGCT GCTGTCGATG
GTGTTGTTCT ACACCAAGTC GGTGACGGTG AAGCTGCTGC CGTTCGACAA CAAGAGCGAG
ATCGCGGTGA TCGTCGATCT GCCGGAAGGC GCGACGCTGG AGGACACCGA GCGCACGCTG
TTCGCCGCCG CCGACATCGC CCGCGGCCTG CCGGAGATCA CTTCGGTGCA GAGCTATGCC
GGCACGCCGG CGCCGTTCAA CTTCAACGGC CTGGTCCGGC ATTATTACTT GCGCGAACGG
CCCGAGCTCG GCGAGTTGCA GGTCAATCTC GCCGCGCGCG GCGACCGGTC GCGCGCCAGC
CACGCGATCG CGCTCGATCT GCGCGAGCGG TTGAAGGCGC TCGCCATTCC CGCCGGCGCC
AGCGTCAAGG TGGTCGAGGT GCCGCCCGGC CCGCCGGTGC TGTCGACGCT GCTCGCGGAA
ATCTACGGCC CCGACGCCGA GACCCGCCGC GCGGTGACTG CCGAGGTGAA GAAGATCTTC
AAGGACGTGC CGTTCATCGT CGACGTCGAC GATTCGATCG GCGAGAAGCG GCCGCGGCTG
CGGCTGTCGA TCGATCAGGA TCGGCTGGAG TTCTTCGGCG TCGAACAGAA GGACGTCTAC
GACACCATCC AGACGCTGTT CGGCGGCGTG TCGGTCGGCT ACTCGCATCG CGGCGAGGGC
CGCAATCCGA TCGCCATCCA TGTCGGCCTG CCGAAACACG ATCTCGCCTG GAACGAGGCG
CTGGCCTCGA CGCCGGTGCC GGCCAACACG CTGCCCGGCA GCAAGACGGT GGTCGAGCTG
GGCCAGTTGG TGAAGGCGAC CCGCGAGGTC GGCTCGCCGA TGATCTTCCG CCGCGACGGC
CGCTTCGCCG ACATGGTGAT GGCCGAACTC GCCGGCAAGT TCGAGGCGCC GCTCTACGGG
ATGCTCGAGG TCGACAAGCG GATCGAGGCG CACGATTGGG GCAAGCTGCC GAAGCCGGCG
ATCAGCCTGC ACGGCCAGCC GACCGACGAG TCGCGCCCGA CGCTGCTGTG GGACGGCGAA
TGGGAGATCA CCTGGGTCAC GTTCCGCGAC ATGGGCGCTG CGTTCGGCGC GGCGATCCTC
GGCATCTACG TGCTGGTGGT GGCGCAGTTC AGAAGCTTCA AGCTGCCGCT GGTGATCCTG
ACGCCGATCC CGCTGACGCT GATCGGCATC CTGATCGGCC ACTGGCTGTT CGGCGCGCCG
TTCAGCGCCA CCTCGATGAT CGGCTTCATC GCGCTCGCCG GCATCATCGT GCGCAACTCG
ATCCTGCTGG TCGATTTCAT CCGCCACTCC GGCGGCGAGA GCAAGACGCT GCGCGAGGTG
GTGCTGCGGG CCGGCGCCGT GCGCTTCAAG CCGATCCTGC TCACCGCGCT GGCGGCGATG
ATCGGCGCGG CGACGATCCT GCTCGATCCG ATCTTTCAGG GGCTGGCGAT CTCGCTGCTG
TTCGGACTCG CCTCGTCGAC GCTGCTGACC GTGCTGGTGA TCCCGGCGAT CTACATCGTG
CTGCGCGACA ATACGCCGAA GCCGCCGCCG AATTTGACGG CAAAGTGA
 
Protein sequence
MKLGISGGLT KASIQSPLTP LFLLASLVVG LIALAAIPRE EEPQISVPMV DIRINADGLH 
GPDAVELVTK PLEAIVKGID GVEHVYSQSE DDRVMVTARF LVGTKSEDAI LRVHEKIRAN
FDRIPVGIPE PLIVGRGIND VAVAILTLSP KPEAASRWTD KDLYELADKL RSELMKVDSI
GLTYIAGGGA QQIRVEPDPE KLSLYGVTLQ QLVAKVKDAN RSFLAGQVRD GGAMRSVSAG
QTLMGIPDIG LLLISTRDNR PVYVKDVAAV IIGPNAAEHR VWNDARSGGG DWTRVPAVSV
ALAKRAGANA VVVSQEIEHR LAGLKGTLLP DDVVVTVTRD YGDTANEKAN ELLLHLGIAT
ISIVVLIAIA IGWREAVVTA VVIPTTILLT LFAANLMGYT INRVSLFALI FSIGILVDDA
IVVVENIARH WGMKDGRPRL QATIEAVAEV GNPTVIATLT VVAALLPMLF VSGLMGPYMA
PIPANASAAM LFSFFVAMVV APWLMLRLAP KQGAAVAAHD AHDEGRLGRL YRRIASPIVA
SKRASWIFLL GVGVATLLSM VLFYTKSVTV KLLPFDNKSE IAVIVDLPEG ATLEDTERTL
FAAADIARGL PEITSVQSYA GTPAPFNFNG LVRHYYLRER PELGELQVNL AARGDRSRAS
HAIALDLRER LKALAIPAGA SVKVVEVPPG PPVLSTLLAE IYGPDAETRR AVTAEVKKIF
KDVPFIVDVD DSIGEKRPRL RLSIDQDRLE FFGVEQKDVY DTIQTLFGGV SVGYSHRGEG
RNPIAIHVGL PKHDLAWNEA LASTPVPANT LPGSKTVVEL GQLVKATREV GSPMIFRRDG
RFADMVMAEL AGKFEAPLYG MLEVDKRIEA HDWGKLPKPA ISLHGQPTDE SRPTLLWDGE
WEITWVTFRD MGAAFGAAIL GIYVLVVAQF RSFKLPLVIL TPIPLTLIGI LIGHWLFGAP
FSATSMIGFI ALAGIIVRNS ILLVDFIRHS GGESKTLREV VLRAGAVRFK PILLTALAAM
IGAATILLDP IFQGLAISLL FGLASSTLLT VLVIPAIYIV LRDNTPKPPP NLTAK