Gene RPD_0743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0743 
Symbol 
ID4021216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp832216 
End bp833667 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content67% 
IMG OID637960932 
ProductHWE histidine kinase 
Protein accessionYP_567882 
Protein GI91975223 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTCGG CGGTCGCGGC GCTGAGCAGG GAAGAACTCG AAGCCAGGCT GGAGGAAGCG 
GAAGATACGC TGCGTGCGAT CCGGGAAGGC GAAATCGACG CGCTGGTGGT CAAGGGCGGC
GGCGCCGAGC AGGTCTTCAC ACTCGAAGGT GGCGGCCAGT CGTACCGCAC ATTTATGGAA
GCGATGGACG TCGGCGCCGC GGCGTTCGAC GGCGAGGGAC AACTCCTCTA CGCCAATCAT
GCGCTATGCG CGCTGCTCGA TCAGCCGCCC GGCTCGCTGA CCGGGACCGC GCTGGTCGGA
CTGCTCGATG CCGTCAACCA GGCCAAATTC CGGCGGTTGC TGACGCAGGC GATGAGCGAG
CGGCAGTCGG GTGAAATCCA TTTGCGCGCG GGCAACGTCG AACGCAGCAT CCTGCTCGCG
GGGACGCCGC TGGAATTCGG AGTCGTCCAC GGCGCCGCGA TCACCTTCAC CGATATCAGC
GAGCGCGAGC AAGCCGCGGC CGCCAGAGAA TCCGAACGTC TGGCGCGGGC GATCCTCGCT
TCCGCCAACG AGGCGGTGAT CGTCTGCGAC CGCGCCGGCA GGGTCACGCA TCTCAATGGC
GCCGCGACCC GGATCTGCGC CGAGTCGCCG GTCGGGCTGT TGTTCTCTGA CGCGATCGAG
CTGACCTTCA CCGAGGCCTC CGGGCTGATG AGCGGCAGCG ACATCGTCAC GATGGCGGCG
TCCGGGATGC CGGTTCAAGG TCTGGAGGCT GCGGTGCGCA CGCCGTCGGC CTCCGTCACC
GACGTGCTGA TCAGCGCCGC GCCGCTGGTC GTATCCGGCG AACGCACCCA GGGCTGCGTC
GTCACGCTGA TCAACCTGTC GCAGCGCAAG GCCGCCGAGC GCCACCAGGC GCTGCTGATG
GGCGAACTCG ACCACCGCGT CAAGAATACC CTGGCGATGG TGATGGCGAT CTGCGCGCGC
ACCGCCGCGC ACGAGCGCAC CATCGAGGAT TTTCAGAGAT CCTTCTCAGG CCGGATCCAG
GCGCTGGCCG CGACCCACAC GTTGTTGTCG AACTCATCGT GGCAAAATCT ACAGATCAAG
GACGTGCTCG GCGCCGAGCT TGCACCATTC GCGTCGCTGT CGAGCGGCCG GATCGTCACC
GAGGGACTCG ACATCACCGT CGACGCCAAG ACCGCGGTGT CGCTCGGTCT GGTGTTTCAC
GAACTGACCA CCAACGCGGT GAAATACGGC GCGCTCTCGG TGCCCGGAGG CAAGATCGCC
GTGCGGCAAG TCGGCCGGTC GGACGACGGC GCGCTGATGA TCGAATGGCA GGAGCACGAT
GGCCCGCTGG TGACGCCGCC GGAGTCGTCC GGATTCGGCC AGGCGCTGAT CTCGCGCAGT
CTCGGCAGCG GCGGCGCCAC ACTGGAGTTT CGCCCCACCG GGGTGATCTG TAAAATCGCG
ATCCCGCGCT AA
 
Protein sequence
MGSAVAALSR EELEARLEEA EDTLRAIREG EIDALVVKGG GAEQVFTLEG GGQSYRTFME 
AMDVGAAAFD GEGQLLYANH ALCALLDQPP GSLTGTALVG LLDAVNQAKF RRLLTQAMSE
RQSGEIHLRA GNVERSILLA GTPLEFGVVH GAAITFTDIS EREQAAAARE SERLARAILA
SANEAVIVCD RAGRVTHLNG AATRICAESP VGLLFSDAIE LTFTEASGLM SGSDIVTMAA
SGMPVQGLEA AVRTPSASVT DVLISAAPLV VSGERTQGCV VTLINLSQRK AAERHQALLM
GELDHRVKNT LAMVMAICAR TAAHERTIED FQRSFSGRIQ ALAATHTLLS NSSWQNLQIK
DVLGAELAPF ASLSSGRIVT EGLDITVDAK TAVSLGLVFH ELTTNAVKYG ALSVPGGKIA
VRQVGRSDDG ALMIEWQEHD GPLVTPPESS GFGQALISRS LGSGGATLEF RPTGVICKIA
IPR