Gene RPD_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0235 
Symbol 
ID4020693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp273587 
End bp274672 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content69% 
IMG OID637960414 
Producthypothetical protein 
Protein accessionYP_567376 
Protein GI91974717 
COG category[S] Function unknown 
COG ID[COG5330] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCG CCCTTCTGCT CGCACCCGAA CTGGACGACG TCGTCGGCCA TGGCGATGCC 
GCGCGGCGCG CCGATGCCGT GCGGCGGATC GCCGACCTGT TCGTGCAAGG CGCGATGAAT
TTCAACGCCG AACATGTCGC GGTGTTCGAC GGCGTCCTGG TGCGACTGAT ACCCGACACC
GACGGCGATC TGCGCGGCGA GCTGGCGCGG CGGTTCTGCT CGCTCGGCAA CGCGCCGCCG
ACGCTGATTG AGCAATTGGC GCAGGATGAG GACATCGGCG TTGCCGGGCC GCTGTTGCGA
CGCTCGACGC AGATCACCGA CGACACGCTG GTGGAGCTCG CCGAAGTGCG CGGCCAGACC
CATCTGATGG CGATCTCCGA GCGGCCGGCG ATCTCGCCGC CGGTCACCGA CGTGATCGTC
CGCCGTGGCG ATCGCGACGT GTTGCGGATG GTCGCGGTGA ATGCCGGCGC GGCGTTTTCA
GCCTTTGGCT TCAGCGGCCT GATCCGCCGC GCGGCACAGG ACGGCGTGCT GGCCGTCGCC
GTCGGCGTGC GTGACGACCT GTCGTTGCCG CGGTTGAAAG ATCTGCTGGC GTCCTCGAGC
GAACCGGTGC GCCGGAAGCT GTTCGAAACG GCGTCGCCGA GCGCCCGAAT CGCGATCAAC
CGCGCGCTGC GCGAACTCAC CGGCGAGCCG ATGCAGCCCT CGGTGAAGCG CGACTTTGCG
CCGGCGCAAC GCGCGATCGT GGCGCTGCAT AATGCCGGCG GCCTCACCGA GCAGGCGCTG
CTCAGCTTCG CCCGGGCATT CCAATACGAG GAAACGGTGG CGGCGCTGTC GGCGATGTCC
GGCGTGCGGA TCACGACGCT CGACCCGTTG ATGGCGGGCG AACGCCACGA CCCGATGCTG
ATGCTCGGCA AGGCGCTCGG CCTCGACTGG ACCACGGTGC GCGCGATGAT CGGGCTGCGG
CGCGGGCCGG ACCGGATGCC GTCCTCGCCC GACGTCGAGG AGGCGCGGCA GAATTTCGAG
CGGCTGGCGC CCTCGACCGC ACATCAGGTG GTCGGCTTCT GGAAAATGCG ACAGGCGATG
AACTGA
 
Protein sequence
MSAALLLAPE LDDVVGHGDA ARRADAVRRI ADLFVQGAMN FNAEHVAVFD GVLVRLIPDT 
DGDLRGELAR RFCSLGNAPP TLIEQLAQDE DIGVAGPLLR RSTQITDDTL VELAEVRGQT
HLMAISERPA ISPPVTDVIV RRGDRDVLRM VAVNAGAAFS AFGFSGLIRR AAQDGVLAVA
VGVRDDLSLP RLKDLLASSS EPVRRKLFET ASPSARIAIN RALRELTGEP MQPSVKRDFA
PAQRAIVALH NAGGLTEQAL LSFARAFQYE ETVAALSAMS GVRITTLDPL MAGERHDPML
MLGKALGLDW TTVRAMIGLR RGPDRMPSSP DVEEARQNFE RLAPSTAHQV VGFWKMRQAM
N