Gene RPB_4221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4221 
Symbol 
ID3912029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4793862 
End bp4797002 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content65% 
IMG OID637886124 
Productacriflavin resistance protein 
Protein accessionYP_487823 
Protein GI86751327 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.592622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.234166 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTGGT TCAATCTGTC GGCGTGGGCG GTCGCGCATC CGGCGCTGAT CTTGTTCCTC 
ATCGTGGCGT TGGGCGTCTC CGGGATCTAC TCGTATCAGC GGCTCGGTCG CGCCGAGGAT
CCGTCGTTCA CTGTCAAGGT CGCGGTGATC TCGGTGATCT GGCCGGGCGC CACCGCCAGA
GAAATGCAGG AACAGGTCGC CGACCCGATC GAGAAGAAAC TGCAGGAACT GCCGTATTTC
GAGAAGGTGC AGACCTATTC GAAAGCGTCC TTCGCGGCGA TGCAGGTGAC CTTCCGCGAC
TCGACGCCGC CGCCCGAAGT GCCGCATCTG TTCTATCTGC TGCGCAAGAA GCTGTGGGAC
GTCGCGCCGC AGCTTCCGAG CGATCTGATA GGACCGACCA TCAACGACGA ATACAGCGAC
GTCGATTCCA TCCTGTACAT GATGACCGGC GACGGCGCCG ACTACGCGCA GCTGAAGAAG
ACGGCCGAAG GCCTGCGTCA GCGCCTGCTG AAAGTCGACA ACGTCACCAA GGTCAACGTC
TACGGCACCC AGGACGAGCG GATCTATGTC GAGTTCTCGC ACGCCAAGCT CGCCACGCTC
GGCCTGACGC CGCAAGCGCT GTTCGACTCC CTCGCCAAGC AGAATGCGGT GACGCCGGCG
GGCATCGTGG AGACCTCGTC GCAGCGTGTG CCGCTGCGTG TCACCGGTGC GCTCGACGGC
GTCAAGGCGG TGGCCGAGAC GCCGGTCGAA AGCAACGGCC GACTGTTCAG GCTCGGCGAC
ATCGCCACCG TCTCGCACGG TTATGTCGAT CCGACCGACT ACATGGTGCG GCAGAAGGGC
CAGCCGGCGA TCGGCATCGG CGTGGTCACC GCGCGGGGCG CCAACATTCT CGAACTCGGC
GAGCAGGTGA AGGCCGCGAC CGCGGAGTTC ATGGGCGAGG TGCCGCAGGG CATCGAGATC
GAGCAGATCG CCGATCAGCC GTTGGTGGTC AAGCACGCCG TCGGCGAGTT CATGATGTCG
TTCATCGAGG CGCTGGTGAT CGTCCTGTTC GTGTCGTTCC TGGCGCTCGG TTGGCGCACC
GGCATCGTGG TGGCGCTGTC GGTGCCGCTG GTGCTGGCGA TCGTCTTCAT CGTGATGAAC
GTCATGTCGC TCGACCTCCA CCGCATCACG CTCGGCGCGC TGATCATCGC GCTCGGACTA
TTGGTCGACG ACGCCATCAT CGCGGTCGAG ATGATGGTGG TGAAGATGGA GCAGGGCTGG
GATCGCGCCC GCGCCGCGTC CTTCGCATGG GAATCGACCG CGTTTCCGAT GCTGACCGGC
ACGCTGGTCA CCGCGGTGGG GTTTCTGCCG ATCGGCTTCG CCAATTCCTC GGTCGGCGAA
TATGCCGGAG GCATTTTCTG GATCGTGGCG ATCGCGCTGG TCGCGTCGTG GTTCGTCGCG
GTGATCTTCA CGCCCTATAT CGGCGTCAAG CTGCTGCCGG ATTTCGCCGG CAAGAAGGGC
CACAATCCCG ACGAGGTGTA TCACACGAGG ATCTATCGCG CGTTGCGCGC CGGCGTCGCC
TGGTGCGTGC GCTGGCGCGG CACCGTCGTG CTGGCCACGG TCGGCATCTT CGTCGCATCG
GTGGTCGGCT TCGGCCACGT TCAGCAGCAG TTCTTCCCGC TGTCCGAGCG GCCCGAACTG
TTCCTGCAGT TGCGCCTGCC GGAAGGCACG GCGTTCAACG TCACCATGAA CACCGTCAAG
CAGGCCGAGA CGCTGCTCAA GGACGACGGC GATATCGCCA CCTATACCGC CTATGTCGGC
AAGGGCTCGC CTCGGTTCTG GATGGGGCTC AATCCGCAGC TGCCCACCGA ATCCTTCGCC
GAGATCGTGA TCGTCGCGAA GGACGTCGCG TCGCGCGAAC GGATCAAGGC GCGGCTCGAA
CAGGCGGCGC ATGACGGCCG GCTCGCCGAG GCGCGGGTCC GCGTCGACCG TTTCAACTTC
GGCCCGCCGG TCGGCTTTCC GGTGCAGTTC CGAGTGATCG GTTCGGACGC CGCCAAGGTG
CGCGAGATCG CCTACCAGGT CCGCGACGTG GTCAAGGCCA ATCCCAACGT GATCGATCCG
CATCTCGACT GGAACGAGCA ATCACCCTAT CTGAAACTCG CCGTCGATCA GGACCGCGCC
CGCGCGCTCG GATTGACGCC GCGAGACGTC TCCCAGGCGC TGGCGATGCT TATCTCCGGC
ACGCAGGTGA CGACGGTGCG TGATGGCGTC GAAAAGATCG GCGTGGTCGC CCGCGCGGTG
CCGTCGGAAC GGCTCGATCT CGGCCGGATC GGCGAACTGA CCATCACCGC GCGCAACGGC
GTGGCGGTAC CGCTGTCGCA AGTCGCCAAG GTCGAATATG CGCACGAGGA GCCGATCCTG
TGGCGGCGCA ACCGCGACAT GGCGATCACC GTGCGCGCCG ACGTCGTCGA CGGTGTGCAG
GCGCCGGACG TCACCAATGC GATCTGGCCG CAACTCGAGA AGATCCGCGC CGGCCTGCCG
TCCGCCTACC GGATCGAGAC GGGCGGCGCG ATCGAGGAAT CCGCCAAGGG CAATGCGTCG
ATCTTCATCC TGTTTCCGGT GATGGTGATC GTGATGCTGA CGCTGCTGAT GATCCAGTTG
CAGAGCTTCC CGCGGCTGCT GCTGGTGTTC CTCACCGCGC CGCTCGGCGT CATCGGCGCT
TCGCTCGGCC TCAACGTCGC CAATGCGCCG TTCGGCTTCG TCGCGCTACT CGGGCTGATC
GCGCTGGCCG GCATGATCAT GCGCAACACC GTCATTCTGG TCGATCAGAT CGAGACCGAC
GTCGCGCAGG GTTCGACCCG GCGCGTCGCG ATCGTCGAGG CCACCGTGCG CCGGGCCCGG
CCGGTGGTGC TGACGGCGCT CGCGGCGATC CTGGCGATGA TCCCGCTGTC GCGTTCCGCC
TTCTGGGGGC CGATGGCCAT CACCATCATG GGCGGCCTGT TCGTCGCCAC CTTCCTGACG
CTGTTCTATC TGCCGGGGTT GTACGCGCTG TGGTTCCGCA ACAGCCTCGA TGAACGCGGT
GGTGACAGCG CCGCCGATCC TGCGCCGCAG CATGGGGACC AGGCGCAGCC TGCGTTTCCG
CTTGCCGCAG CGGCCGAATA A
 
Protein sequence
MRWFNLSAWA VAHPALILFL IVALGVSGIY SYQRLGRAED PSFTVKVAVI SVIWPGATAR 
EMQEQVADPI EKKLQELPYF EKVQTYSKAS FAAMQVTFRD STPPPEVPHL FYLLRKKLWD
VAPQLPSDLI GPTINDEYSD VDSILYMMTG DGADYAQLKK TAEGLRQRLL KVDNVTKVNV
YGTQDERIYV EFSHAKLATL GLTPQALFDS LAKQNAVTPA GIVETSSQRV PLRVTGALDG
VKAVAETPVE SNGRLFRLGD IATVSHGYVD PTDYMVRQKG QPAIGIGVVT ARGANILELG
EQVKAATAEF MGEVPQGIEI EQIADQPLVV KHAVGEFMMS FIEALVIVLF VSFLALGWRT
GIVVALSVPL VLAIVFIVMN VMSLDLHRIT LGALIIALGL LVDDAIIAVE MMVVKMEQGW
DRARAASFAW ESTAFPMLTG TLVTAVGFLP IGFANSSVGE YAGGIFWIVA IALVASWFVA
VIFTPYIGVK LLPDFAGKKG HNPDEVYHTR IYRALRAGVA WCVRWRGTVV LATVGIFVAS
VVGFGHVQQQ FFPLSERPEL FLQLRLPEGT AFNVTMNTVK QAETLLKDDG DIATYTAYVG
KGSPRFWMGL NPQLPTESFA EIVIVAKDVA SRERIKARLE QAAHDGRLAE ARVRVDRFNF
GPPVGFPVQF RVIGSDAAKV REIAYQVRDV VKANPNVIDP HLDWNEQSPY LKLAVDQDRA
RALGLTPRDV SQALAMLISG TQVTTVRDGV EKIGVVARAV PSERLDLGRI GELTITARNG
VAVPLSQVAK VEYAHEEPIL WRRNRDMAIT VRADVVDGVQ APDVTNAIWP QLEKIRAGLP
SAYRIETGGA IEESAKGNAS IFILFPVMVI VMLTLLMIQL QSFPRLLLVF LTAPLGVIGA
SLGLNVANAP FGFVALLGLI ALAGMIMRNT VILVDQIETD VAQGSTRRVA IVEATVRRAR
PVVLTALAAI LAMIPLSRSA FWGPMAITIM GGLFVATFLT LFYLPGLYAL WFRNSLDERG
GDSAADPAPQ HGDQAQPAFP LAAAAE