Gene RPD_4074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4074 
Symbol 
ID4024591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4528593 
End bp4529699 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content66% 
IMG OID637964277 
Productsecretion protein HlyD 
Protein accessionYP_571194 
Protein GI91978535 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGCGC ACCCGATTTT CGATGGATTT ATCCGGTTGT TCATCGGCTT GCTGATCGCA 
TTCGCGGCGA TCGGGCTGGC CGGCTGCAAC GACAAGCAGG TCAAGAAGGA GCGCTCTGAT
CGGCCCGTCC TGGTCGAGAC CGTGCATTAT ATCGCGGCCT CTCCCGAACG TAGTTTCGTC
GGAACGGTCC GGCCGCGGAT CGAGACCGAT ATCGGCTTCC GGGTCTCCGG AAAGGTCGCG
AAGCGCCTTG TCGAGGTCGG GCAGACCGTC GAGATCGACC AACCGCTGGC GCTGCTCGAC
CAGACCGATC TCAAACTGCA AACGGAACAA TCCGAGGCCG AGCACCGCGC CGCCAAGGGC
GTGCTGGCGC AGGCGACCGC GTCCGAGAAC CGCGTCAAGG AGCTTCGCGC CAAGGGTTGG
GCGACGGAAG CGCAAATGGA TCAGGCGCAC GCCGCGGCGG ATGAGGCCCG CGCCCGCTTC
GCCCGTGCCG AGCGTTCGGT GGACCTGACC CGCAACGCGC TGTCCTATGC GAGCCTGATT
GCCGATACCC GCGGCGTCAT CACCGCGACG CTGATCGATG CCGGCCAGGT GGTCTCGGCA
GGGCAGCCGG CGTTCCGCGT GGCGCGGTTC GGCGAGAAGG AGGCCGTGGT CGCGATTCCG
GAGACGCTGG TGGGTCGCGC CAAGCAGGGC GAGGCCCGCG TCACGCTGTG GTCCGAGCCC
GGCAAGACCT ACGCAGCGAA GCTGCGCGAG ATTGCGCCGA TGGCAGACCC GGCGACCCGC
ACCTACCTCG CCAAGTTCCT GTTGCCCGAT GCCGACGACC GCGTCTCGCT CGGCATGACC
GCGACATTGA CGCTTGCCGA TTCGGCGACG GATCGCGTGG CGCGGCTGCC CCTTGCAGCA
TTGTTCAATC AGGGCGGCGC GTCGTCGATC TACGTTGTCG ATGGGTCCGG TCGGGTGACG
CTGAAGCCGG TCACCGTGAA GGCCTACGAG ACCGATCATG TGGTGATCAG CGGCGGCGTC
GAGGAGGGCG CCAAGGTGGT CGTGCTCGGC GTGCAAAAGC TCGATCCGGC CGAGAGGGTC
CGGATCGTGT CATCGTTGTC GTTCTAG
 
Protein sequence
MLAHPIFDGF IRLFIGLLIA FAAIGLAGCN DKQVKKERSD RPVLVETVHY IAASPERSFV 
GTVRPRIETD IGFRVSGKVA KRLVEVGQTV EIDQPLALLD QTDLKLQTEQ SEAEHRAAKG
VLAQATASEN RVKELRAKGW ATEAQMDQAH AAADEARARF ARAERSVDLT RNALSYASLI
ADTRGVITAT LIDAGQVVSA GQPAFRVARF GEKEAVVAIP ETLVGRAKQG EARVTLWSEP
GKTYAAKLRE IAPMADPATR TYLAKFLLPD ADDRVSLGMT ATLTLADSAT DRVARLPLAA
LFNQGGASSI YVVDGSGRVT LKPVTVKAYE TDHVVISGGV EEGAKVVVLG VQKLDPAERV
RIVSSLSF