Gene RPD_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2037 
Symbol 
ID4022519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2283664 
End bp2284755 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content65% 
IMG OID637962230 
Productsecretion protein HlyD 
Protein accessionYP_569173 
Protein GI91976514 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.323562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAT CCCTCATGCT CCCTTCGCTC CGGAACGTCC TTTGGCTGCC TGTTTGCGTG 
GTGCTGGCGG GCTGCAGTGA AGAGGCTCCC CAGAATCGGC CGGCGGCCTT GGTCAAGACC
GAGCTCGTGC GGTTGCAGCC CCGGCAGACC GTCATTCGTC TGACCGGCGA CGTTCAGGCG
CGCGTCAGCA CTGAATTGTC ATTCCGGGTG AGTGGGCGGG TGATCGAGCG CCTTGTGGAT
GTCGGCGCCC ATGTGAAGGC CGGCGACGTG CTGGCGAGGA TCGATCCGAC CGAGCAACGC
GCCGACCTTG TCGGCGCGCA GGCGGCAGTG GCGGCTGCGG AGGCTCAATT GCGTCTCGCC
AAGGCGACCT TCGAGCGGCA GAAGTCGCTG ATGGCCAGCG GCTTCACCAC CCGGGTGGCA
TTCGATCAGG CGCAAGAGGG GTTGCGGACC GCGGAAGGAT CGCTCGACAC GGCCAAGGCG
CAGTTGGGCA TCGCCACCGA TGCGCTGAGC TATACCGAGC TTCGGGCCAG CGCTGCGGGG
ATCATCACTG CGCGCAACAT CGAGGTCGGT CAGGTCGCGC AATCCGCCCA GTCTGCTTAC
ACGCTCGCCG AGGATAACGC CCGTGACGCC GTGTTCGACG TCAACGAGTC GATCTTCCTG
ACGCCGCTCG AGGGCAGCAC CGTCAAGCTG ACAATGGTGT CGGACCCGTC GATCAGCGCG
ATCGCGCGTC CGCGCGAGAT TTCTCCCACG GTTGACCAGA AGAGTGGGGC GGTCCGGGTC
AAGCTTTCGA TCGAAAATCC GCCGGCGGCA ATGACGCTCG GCAGCATCGT GACCGGCGAG
GGGCGCGGCA GGCCGGTCGA CAAGATCGTG CTGCCCTGGA GTGCCCTGAA CGCCAATCTC
ACTGGGCCGG CCGTCTGGGT CGTCGATCCA AAAACCCGCG CGGTGGCGCT CAAGAATGTC
GTGATCGAGA GCTACGAAAC CAATTCGATC GTGGTCGGCG GCGGTCTCAC CGCCGGCGAG
CGGGTCGTCG TCGACGGCGG CAAGCTGCTC CGGCCGGCAC AAATCGTCAC CTATGACGGG
GAAAACTCAT GA
 
Protein sequence
MSRSLMLPSL RNVLWLPVCV VLAGCSEEAP QNRPAALVKT ELVRLQPRQT VIRLTGDVQA 
RVSTELSFRV SGRVIERLVD VGAHVKAGDV LARIDPTEQR ADLVGAQAAV AAAEAQLRLA
KATFERQKSL MASGFTTRVA FDQAQEGLRT AEGSLDTAKA QLGIATDALS YTELRASAAG
IITARNIEVG QVAQSAQSAY TLAEDNARDA VFDVNESIFL TPLEGSTVKL TMVSDPSISA
IARPREISPT VDQKSGAVRV KLSIENPPAA MTLGSIVTGE GRGRPVDKIV LPWSALNANL
TGPAVWVVDP KTRAVALKNV VIESYETNSI VVGGGLTAGE RVVVDGGKLL RPAQIVTYDG
ENS