Gene RPD_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1006 
Symbol 
ID4021481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1139834 
End bp1143010 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content67% 
IMG OID637961197 
Productacriflavin resistance protein 
Protein accessionYP_568145 
Protein GI91975486 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00411218 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.136084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCTCA ACGTCTCCTC CTGGTCGATC CGGCATCCGC TGCCCTCGAT CGTGTTTTCG 
ATCATCCTGC TCGCGCTGGG CTGGATCAGC TTCACCAAGC TCGCGGTCAC GCGGCTGCCG
AGCGCCGACA TTCCGGTGAT CTCGGTCGCA GTAGCGCAGT TCGGCGCCGC GCCGGCCGAG
CTGGAAGCAC AGGTCACCAA AACGATCGAG GACGGCGTCT CCGGCGTCGA GGGCGTCCGG
CACATCGCCT CGTCGGTCAC CGACGGCCTC TCGGTCACCA CAATTCAATT CGCGCTCGAA
ACCAATACCG ACCGCGCGCT GAACGACGTC AAGGACGCCA TCACCCGGGT CCGCAGCAAC
CTGCCGCAGA ACGTCACCGA ACCGCTGATC CAGCGCGTCG ACGTGATCGG CCTGCCGATC
GTCACCTACG CCGCGATCTC GCCGGGCAAG ACGCCGGAAC AGCTCTCCTG GTTCGTCGAC
GACGTGGTCA AGCGCGCGCT GCAGGGCGTG CGCGGGGTGG CGCAGGTCGA GCGCATCGGC
GGCGTCGAGC GTGAGATTCT GGTGTCGCTC GATCCCGACC GGCTCAAGGC CGCGGGCCTC
ACCGCGCTCG ACGTGTCGCG GCGGCTGCGC GGCACCAATG TCGATCTCGC CGGCGGCCGC
GCCGAAATCG GCAAGAACGA CCAGGCGATC CGCACGCTGG CCGGCGCCAA GACGCTGAAC
GATCTCGCCG GCACCATGAT CAGCCTGTCG TCGGGCGGCG AAATCCGGCT CGACGACCTC
GGCACAGTCA CCGACACCAT CGCCGACCGC CGCACCTTCG CCCGCGTCAA TGGCGAGCCG
GTGGTCGCGC TCGGCATCAA GCGCTCCAAG GGCGCCAGCG ACGTGGTGGT CGCGAGCGCA
GTGCAGAAGC GGATCGACGC GCTGAAGGCG GCGCATCCCG ATGTCGACCT CAAGCTGATC
GACACCTCGG TCGATTACAC CAAGGGCAAT TATGAGGCCG CGATCTCGAC GCTGTTCGAG
GGCGCGATCC TCGCGGTGAT CGTGGTGTTC CTGTTCCTGC GCGACATCCG CGCCACCGTC
ATCGCCGCGA TCTCGCTGCC GCTGTCGATC TTCCCGGCGT TCTGGGCGAT GGACATGCTG
GGCTTCTCGC TGAACCTGGT CAGCTTCCTC GCCATCACGC TGTCGACGGG TATCCTCGTC
GACGACGCCA TCGTCGAGAT CGAGAACATC GTTCGCCACA TGCGGATGGG CAAATCGCCC
TACCAGGCGG CGATCGAAGC CGCCGACGAG ATCGGCCTCG CGGTGATCGC GATCAGCCTC
ACCATCATCG CGATCTTCGC GCCGGCCAGC TTCATGTCCG GCATCGCCGG GCAATTCTTC
AAGCAGTTCG GCATCACCGT CTCGGTGCAG GTGTTCTTCT CGCTGCTGGC GGCGCGCTTC
GTCACGCCTG TGCTCGCCGC CTATTTCCTC AAGCACGTCC CGCACGAGGA GAAGCCGCCG
GGCCGAATCC TGCGGGGCTA CACCCGGATG GTGACGTGGT CGGTCAAGCA CTATTATCTC
ACGGTGCTGA TCGGGCTCGG GGTGTTCGCG GCGTCGATCT GGAGCATCGT GCTGCTGCCG
CAGGGCTTCC TGCCGGCGCA GGACACCTCG CGCTCGGTGA TGGCGATGGA GCTGCCGCCC
GGCACCCAGA TCGGGACCAC CGAGAAGATC ACCGAAACCG TCGTGACGAT GCTGCGCAAG
CGGCCCGAAG TCCGCAGCGT GTTCGTCGAC GGCGGCCGGG TGCCGCCGGG CATCCACGAG
GTGCGCCGCG CGTCGCTGAT CATCAACTAC ACGTCGAAGG GCGATCGCAA GATCACCCAG
CGCGAACTCG AACTCGCGAT CAGCAAGGAT CTCGATCAGG TCCCCGACAT CCGCTACTGG
TTCCTCGACG AGAACGGCCT GCGCGCGATC TCGCTGGTGG TGACCGGCGC CGACAGCAAC
ATCGTCAACA ACGTCGCCCA GGAACTGGCG GCGCAGATGA AGCGCATCCC GATCCTCTCC
AACGTGATCT CGGAGACTTC GCTCGATCGC CCCGAACTGC GAATCCTGCC GCGCGCCGAT
CTCGCCGCCC GGCTCGGCGT CTCGACCGAA AGCCTGTCGG AGACGATCCG CGTCGCCACC
ATCGGCGACG TCGGCCCGGC GCTCGCCAAA TTCGACGCCG GCGATCGGCT GGTGCCGATC
CGGGTCCAGC TCGAGGACGG CGCCCGCGGC GATCTCAGCG TGCTCGAACA GCTCCAGGTG
CCGATCTATG GCGGCCGCGG TTCGGTGCCG CTGTCGGTGG TCGCCGACGT CAAGTTCGAC
CAGGGACCGA CCAGCATCAA TCGCTACGAC CGCGAACGCC AGGCGACGGT GGCCGCGGAC
CTCGTCGGCA ATGCCGCGCT CGGCGACGCC CAGAAGCGCA TCAACGACCT GCCGGTGATG
AAGTCGCTGC CGAAGGGGGT CCGGGTCAGC CCGTCCGGCG ACGCGGAAAG CCTGAACGAA
CTCTCGGACG GCTTCGCCAC CGCGATCAGC GCCGGGCTGA TGATGGTCTA TGCGGTGCTG
GTGCTGCTGT TCGGCACCTT CCTGCAACCG ATCACCATCC TGTTCTCGCT GCCGCTGTCG
ATCGGCGGCG CGATCGGCGC GCTGCTGATC ACCGGCAAGC AGCTCACCAC GCCGGTGTGG
ATCGGCATCC TGATGCTGAT GGGCATCGTC ACCAAGAACG CGATCATGCT GGTCGAGTTC
GCTCTGGAGT CGATCCGCGA CGGCAAGAAT CGCGAAGAGG CGATGATCGA CGCCGGCCAG
AAGCGCGCCC GGCCGATCGT GATGACGACC ATCGCGATGG TCGCCGGCAT GATCCCGAGC
GCGCTGGCGT TCGGCGCCGG CGGCGAATTC CGCTCGCCGA TGGCGCTCGC CGTGATCGGC
GGCCTGATCT TCTCGACGGT GCTGTCGCTG ATCTTCGTGC CCGCGATGTT CATGATGATG
GACGATGTCG GCCGGGTGTC CTGGAGCCTC GGCAAACGCC TGCTCAGCCA TCATCACGAC
GACGACGGCG ACCCGAAACC GCGCCCGCCG GCAGCGACGC CCACTCCCGC GCCGGCAACA
GCGACGCCGT CGAGCCCGTC GCCGGAGAAG GGCTTCTCGT TCTGGCGGAG CAAATAG
 
Protein sequence
MGLNVSSWSI RHPLPSIVFS IILLALGWIS FTKLAVTRLP SADIPVISVA VAQFGAAPAE 
LEAQVTKTIE DGVSGVEGVR HIASSVTDGL SVTTIQFALE TNTDRALNDV KDAITRVRSN
LPQNVTEPLI QRVDVIGLPI VTYAAISPGK TPEQLSWFVD DVVKRALQGV RGVAQVERIG
GVEREILVSL DPDRLKAAGL TALDVSRRLR GTNVDLAGGR AEIGKNDQAI RTLAGAKTLN
DLAGTMISLS SGGEIRLDDL GTVTDTIADR RTFARVNGEP VVALGIKRSK GASDVVVASA
VQKRIDALKA AHPDVDLKLI DTSVDYTKGN YEAAISTLFE GAILAVIVVF LFLRDIRATV
IAAISLPLSI FPAFWAMDML GFSLNLVSFL AITLSTGILV DDAIVEIENI VRHMRMGKSP
YQAAIEAADE IGLAVIAISL TIIAIFAPAS FMSGIAGQFF KQFGITVSVQ VFFSLLAARF
VTPVLAAYFL KHVPHEEKPP GRILRGYTRM VTWSVKHYYL TVLIGLGVFA ASIWSIVLLP
QGFLPAQDTS RSVMAMELPP GTQIGTTEKI TETVVTMLRK RPEVRSVFVD GGRVPPGIHE
VRRASLIINY TSKGDRKITQ RELELAISKD LDQVPDIRYW FLDENGLRAI SLVVTGADSN
IVNNVAQELA AQMKRIPILS NVISETSLDR PELRILPRAD LAARLGVSTE SLSETIRVAT
IGDVGPALAK FDAGDRLVPI RVQLEDGARG DLSVLEQLQV PIYGGRGSVP LSVVADVKFD
QGPTSINRYD RERQATVAAD LVGNAALGDA QKRINDLPVM KSLPKGVRVS PSGDAESLNE
LSDGFATAIS AGLMMVYAVL VLLFGTFLQP ITILFSLPLS IGGAIGALLI TGKQLTTPVW
IGILMLMGIV TKNAIMLVEF ALESIRDGKN REEAMIDAGQ KRARPIVMTT IAMVAGMIPS
ALAFGAGGEF RSPMALAVIG GLIFSTVLSL IFVPAMFMMM DDVGRVSWSL GKRLLSHHHD
DDGDPKPRPP AATPTPAPAT ATPSSPSPEK GFSFWRSK