Gene RPD_3406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3406 
Symbol 
ID4023919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3788171 
End bp3789430 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content69% 
IMG OID637963611 
Productsecretion protein HlyD 
Protein accessionYP_570531 
Protein GI91977872 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0814588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGAC GCGATCAGGC TGCCCGAGTT CGTCGCGCCG GGCCGGACAG CGCGCCGGCC 
GAATCCGGCG CCGATGTCCC GCAGCCGCAA GCCCCGTCGC AGGCGGAGAC GCTGCGCTCG
TTGCTGGAGG ATCCCGCGAC GCTGAAGCTC GCCGATCCGC CGGAGCCGGA GCTGGACGAC
GAGGCGGTCG CGCCGGTTGC GAAGCCCGCG GCGGCAGCAA GGAAGCCCGG CAAGAAAAGG
CTGGTCCTGA TCGGCGTCGG CATCGCGGCG CTGGCCGCGG CCGCGTATTA CGGCATCGAT
TACATGCTGG TCGGCCGCTT CATGGTGTCG ACCGACGACG CCTATGTGCG GGCCAACAAC
ACCACGCTCG GCGCCCGCGT CGCCGGCCAT GTCGCCGCCA TCCTGCCGCG TGACAACGCC
GTCGTCAAAG CGGGCGACGT GGTGTTCAAG ATCGATGACG GCGACTACAA GATCGCGGTC
GACGCCGCCC GCGCCAAGAT CGCCACCCAG CAGGCGACGA TCGAACGGAT CGGCCGGCAA
GTCTCCGCGC TGCAGAGCGC CGTCGAGCAG GCGCAAGCGC AGCGCGACTC CGCCGAGGCC
GCGGCCAAGC GCGCTGCGCT CGATTTCGAT CGCCAGCAGG CGCTCAGCAC CAAGGGGTTC
GCCTCGCGCG CAACCTTCGA GGTGTCTCAA GCCGGCCGCG ACCAGGGCGT CGCCTCGGTG
GCCGCCGCCA AGGCCGCGTT CGATGCCGCG CGTGACAATG TCGAGGTCAC CAAGGCGCAG
CAAAACGAAG CGCGCGCCCA GCTCGTCGAG CTGCAGAGCT CGCTCGCCAA GGCCGAGCGC
GATCTCGACT TCACCAATGT CCGTGCGCCG GTCGAGGGCG TGTTCTCCAA CCGGCTCGTC
AATACGGGCG ACTTCATCCA GGCCGGCCAG CGGCTCGCCA ACATCGTGCC GCTCGACGGG
GTCTATGTCG ACGCCAACTT CAAGGAAACC CAACTCGGCC GGCTGAAGCC CGGCCAGAAG
GTCGACATTT CGGTCGACGC TTACTCCAGC CGCAAGATCG AGGGCACGGT CGACAGCCTG
GCGCCCGCGG CCGGTCAGGT GTTCACGCTG CTGCCGCCCG ATAACGCCAC CGGCAACTTC
ACCAAGATCG TGCAGCGCGT GCCGGTGCGG ATCCGCGTCC CGGCCGAGGT GGCGCGCGAG
AATCTGCTGC GCGCCGGCAT GTCGGTCTAC GTCCGCGTCG ACACCAAGCC GGCGAACTGA
 
Protein sequence
MSGRDQAARV RRAGPDSAPA ESGADVPQPQ APSQAETLRS LLEDPATLKL ADPPEPELDD 
EAVAPVAKPA AAARKPGKKR LVLIGVGIAA LAAAAYYGID YMLVGRFMVS TDDAYVRANN
TTLGARVAGH VAAILPRDNA VVKAGDVVFK IDDGDYKIAV DAARAKIATQ QATIERIGRQ
VSALQSAVEQ AQAQRDSAEA AAKRAALDFD RQQALSTKGF ASRATFEVSQ AGRDQGVASV
AAAKAAFDAA RDNVEVTKAQ QNEARAQLVE LQSSLAKAER DLDFTNVRAP VEGVFSNRLV
NTGDFIQAGQ RLANIVPLDG VYVDANFKET QLGRLKPGQK VDISVDAYSS RKIEGTVDSL
APAAGQVFTL LPPDNATGNF TKIVQRVPVR IRVPAEVARE NLLRAGMSVY VRVDTKPAN