Gene RPB_3436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3436 
Symbol 
ID3911238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3935569 
End bp3937884 
Gene Length2316 bp 
Protein Length771 aa 
Translation table11 
GC content65% 
IMG OID637885339 
Producthistidine kinase 
Protein accessionYP_487043 
Protein GI86750547 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.416045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0284383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCG TGCAGGGCGC GAGCGCCTGC GCTCAATCCG ATTCGATCAA AGGATTGGCG 
CAGTCGATCG CTAAACCGGC CTACCATCGA CTGTTGACGG CCGAGCCTGT GCTCCGCCGC
GCCGTCCCGA TCCTCATCAT CGCCTTCCTC ATCACGATCT GCTTCGGCGC CGCCGTGCAG
TTCATCGATC AGAGCCGGCA GAAGCGCGCT GCGCTCAATC GCGATCTCTG CGCCCTCGCC
GATCTGCTCG CCGAGCGGCT CGAACATATC GGCGTCGTGC GGCTCGATCG TCCGGCTTCG
ATCGAACGCC TGCAGAATTT GCTACCAGGC CTGATCCCGT CCTGGGGCAT CGCCGCCGGG
CGCCACGTCA TCGTCACCAG CGCCGATCAG CGCGTCCTGG CGCGCGTACC CGGCGACGAA
GGCACCAGCG CCGCCAGCAG TTTCCTCGAC ATCATCAGCG CCACCCCGCC GCTGACACGG
TCCGCGCAGC AAGGCACCAT CACCCAGATC ATCCTGCCGA GCGGCGCCTC GGCGCTGGCG
ACGTTGCACA CCGTCAAGTC GCTGCCCGGC CAAGTCATCG TCATTCACGA AGACACCGGG
TCGATCCTGC GTTCCGATGC GGCGCTGCAG ATCACGCTGT CGGCTACCAC CGGCTTCGTC
GTGCTGATCC TCGGCTTCGC CTTTCACTGG CAGTCGACCC GCGCCCAGGA GGGCGACCTC
ATCAACGATG CCGTGCGCAG CCGGATCGAC ACCGCGCTCA ATCGCGGTCG CTGCGGGCTG
TGGGACTGGG ACCTGTCGCG CGGCCGGATC TTCTGGTCGC AATCGATGTT CACTTTGCTC
GGCCTGGAGA GTCGCAACGA CCTCCTGACC TTCGGCGAGG TCAACGCGCT GGTGAACAGC
GACGACATCG ACCTGTTCGC GATCGGCGAC CAGTTGATCG CCGGCAACAC CGATCACATC
GACCACAGCT TCCGGATGCG CCACGCCAAC GGCCACTGGA TCTGGCTGCG GATGCGCTGC
GAACTCAGCC AGGAAACCCA ATCCGACAAC AAGCACCTGA TCGGCATCGC GGTCGACGTC
ACCGAGCAGA AGAGCCTCGC CGAGCGCACC GTCGAAGCCG ATCTGCGGCT GCGCGACGCG
ATCGAGACCA TCCCCGAGGC CTTCGTACTG TGGGACGCCG ACAATCGCCT GGTGCTGTGC
AATTCGCACT TCCAGCGGCT GCACAAGCTG CCCGACATCG CGGTCGCGCC CGGCACCTCC
TACGAGACGG TGATCGAGGT CGGACAGATG CCCGAGATCC GCACCCGTCT CTGCGACAAC
AGCGGAGGAT CGTCCCCCGG CGCGCGGACC TTCGAGGCCC AGCTCGCCGA CGGCAGCTGG
CTGCACATCA GCGAACGCCG CACCAAGGAC GGCGGCTACG TCGCGGTCGG CACCGACATC
ACCCGGATCA AGGCTCACGA ACAGAAGCTC GTCGACAACG ACCTGCGGCT GCGCGCCACC
GTCGCCGACC TCAAGATCTC GCAATACAAG CTGGAGCGTC AGGCGATCGA ACTCGCCGAT
CTGGCGCGGA AATATTCCGA GGAGAAGACC CGCGCCGAGG AAGCCAATCA GACCAAATCC
AAATTCCTCG CCAATATGAG CCACGAGCTG CGCACGCCGC TCAACGCCAT CATCGGTTTT
TCCGAGATCA TGGGCAGCGG CATGTTCGGC ACGCTGGGCT CGGAGAAGTA TCAGGAATAC
TGCCACGACA TCATGACCAG CGGGCACTAT CTGCTGGAAG TGATCAACGA CATTCTCGAC
ATGTCGAAGA TCGAGGCAGG TCGGATGCGG CTCGACATGG AGGAGCTCGA TCTCGCTCGC
ACCCTCGGCG AATCCCTCAA GGTCGTCGCC GGCCGCGCCG AGCACAAGCA GCTCGAACTG
CTCTCGGAGA TCGAGGACGA CATTCCGATC GTGGCGGACC GCCGGGCGGT CAAGCAGATC
CTGATCAATT TGTTGTCCAA CGCCGTGAAA TTCACCCCCG ACGGCGGACG GGTCACGGTC
CGCAGCCGGA CGCTGCCGAA CGCCATCGTG TTGATGATCG CCGATTCCGG CATCGGCATC
GCGCCGCAAT CGCTGCGCCG TCTCGGCCAG CCGTTCGAGC AGGTCGAGAG CCAGCTCAGC
AAGACCTATC ATGGCTCCGG CCTGGGTCTG GCGATCGCCA AATCGCTCAC CACGCTGCAC
GGCGGCTCGA TGCGGCTGCG CTCCACGCTC GGCGCCGGCA CGGTGGTGAT GGTGACGCTG
CCGCGCGACT GCCAGCGCCG CCGGCTGGCG GCCTGA
 
Protein sequence
MARVQGASAC AQSDSIKGLA QSIAKPAYHR LLTAEPVLRR AVPILIIAFL ITICFGAAVQ 
FIDQSRQKRA ALNRDLCALA DLLAERLEHI GVVRLDRPAS IERLQNLLPG LIPSWGIAAG
RHVIVTSADQ RVLARVPGDE GTSAASSFLD IISATPPLTR SAQQGTITQI ILPSGASALA
TLHTVKSLPG QVIVIHEDTG SILRSDAALQ ITLSATTGFV VLILGFAFHW QSTRAQEGDL
INDAVRSRID TALNRGRCGL WDWDLSRGRI FWSQSMFTLL GLESRNDLLT FGEVNALVNS
DDIDLFAIGD QLIAGNTDHI DHSFRMRHAN GHWIWLRMRC ELSQETQSDN KHLIGIAVDV
TEQKSLAERT VEADLRLRDA IETIPEAFVL WDADNRLVLC NSHFQRLHKL PDIAVAPGTS
YETVIEVGQM PEIRTRLCDN SGGSSPGART FEAQLADGSW LHISERRTKD GGYVAVGTDI
TRIKAHEQKL VDNDLRLRAT VADLKISQYK LERQAIELAD LARKYSEEKT RAEEANQTKS
KFLANMSHEL RTPLNAIIGF SEIMGSGMFG TLGSEKYQEY CHDIMTSGHY LLEVINDILD
MSKIEAGRMR LDMEELDLAR TLGESLKVVA GRAEHKQLEL LSEIEDDIPI VADRRAVKQI
LINLLSNAVK FTPDGGRVTV RSRTLPNAIV LMIADSGIGI APQSLRRLGQ PFEQVESQLS
KTYHGSGLGL AIAKSLTTLH GGSMRLRSTL GAGTVVMVTL PRDCQRRRLA A