Gene RPB_4104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4104 
Symbol 
ID3911911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4673771 
End bp4675444 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content62% 
IMG OID637886008 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_487708 
Protein GI86751212 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain
[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0461305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGAGC GGCTGATCGC CGACAAGCTG CGAACAGCGG AAGACGCACG GACCCAGCAA 
ATCGGAACCG GCATCGAACT GATCGCGCGC CGCAAGGACG GCAGCGAATT CCCGATCGAG
ATCATGTTGA GCCCGCTCGA CAGTCCCGAA GGCGTCCTGG TCACCGCTGC CATACGCGAC
ATCTCGGAGC GTAAGGACGC GGAAAAGCAC CTGGTGCAGA TGGAGGGACG ATACCAGGCG
CTGCTGGAGG CGGCGCCTGA CGCGATGGTG GTGGTGAATC AAACCGGGGA GATCGTCCTG
CTCAACCTGC AAGCGGAAAA GCAGTTCGCC TATCGTCGCG ACGAACTGCT CGGCCAGAAG
GTGACGAACA TCATCCCCGA AGGATTCGCC GAGCGGTTGA TCGCCGACCG GTTACGATCC
AGGGAAGATG CATTGGCCCA GCACATCGGC GCAGGAATCG AGCTCACCGG CCGGCGAAAG
GACGGCAGCG AATTTCCGAT CGAGATCATG CTCAGCCCGC TGGAAAGCGA AGGCAGCATC
CTGGTAACCG CCGCGATCCG CGAGATCAGC GCCCGCAAGC ACATGGAGCG CCTCAGGGAC
GAATTCGTCG CGACCGTCAG CCACGAATTG CGCACGCCGC TGACGTCGAT CTCGGGCTCA
CTCGGCCTTC TGGTGGGGCA ATGGGTCGGC ATTTTTCCGG AGCCCGCGGC GAGGCTCGTG
GCCATTGCCT ACAAGAACAG CCAACGGCTG GTGCGCCTGA TCAACGACAT CCTCGATATC
GAAAAACTCG GCGACGGTCG CGTGGTCTTC AATCTGTGTC GCGTCGACGT GCACGCCATT
GTGGAGCAGG CGATCGAGAG CAACCGCGGA TTTGCCGAAG GCTACGGCGT CAATGTCGGG
CTCACCACCG CATCGAGGAG CAGCGACGTC AACGCCGACC CGGATCGCCT CGCGCAGGTG
ATTACCAACC TTCTTTCGAA CGCGATCAAG GCCTCGCCTC CTGGTCGGGA CGTCCTTGTC
GCGGTCGAAC CACACGACGC GTTTGTCCGC ATTTCCGTTC GCGATCAGGG CGACGGAATT
CCAGCCACCT TCCGGCTGCA CATCTTCGAA AAATTCGCCC AGGCGGATGC GACCGACGCA
CGACAGAAAG GCGGTACCGG CCTCGGCCTC AGCATCGTCA AGCAGATCGT CGAACGGCTC
AATGGCCGGG TCAGCTTCGA AGACGCGGCC GGCGGCGGCA CGGTGTTCTG CGTCGATCTT
CCGGAATGGA AGATTCCGAT CGATGAACGG CCGTCGGTTC CAAAGTATCG AGTTCTGCAC
CTCGATGATG ATCCCAACAT TCTGGCGGCT GTCGCCCATG CCCTCAGCCC GGCAGCCCAG
GTGATATCGG TCGGGTCCCT CCAGGACGCA CGACGCATCT TGTCGGTCGA TCCGGTCGAC
ATGGTCCTGC TGGACATCTC GCTCGGCGAG CACTCCGGGC TGGAATTATT GGCTGATCTG
CACGACGCTG ACGGTGAAGC CATCCCGGTT ATCGTCTTTT GCACCAGCCC GATCGAACTG
ACACCCGACG GACAAGTGCA TGTGATCCTC GTCAAATCAC GGACCCCGCT GGATTCCTTG
CTGGCGTCGG TGCGGGACTG CCTGGCTCAT CGCGAACATC GCGAGGTGGC ATGA
 
Protein sequence
MPERLIADKL RTAEDARTQQ IGTGIELIAR RKDGSEFPIE IMLSPLDSPE GVLVTAAIRD 
ISERKDAEKH LVQMEGRYQA LLEAAPDAMV VVNQTGEIVL LNLQAEKQFA YRRDELLGQK
VTNIIPEGFA ERLIADRLRS REDALAQHIG AGIELTGRRK DGSEFPIEIM LSPLESEGSI
LVTAAIREIS ARKHMERLRD EFVATVSHEL RTPLTSISGS LGLLVGQWVG IFPEPAARLV
AIAYKNSQRL VRLINDILDI EKLGDGRVVF NLCRVDVHAI VEQAIESNRG FAEGYGVNVG
LTTASRSSDV NADPDRLAQV ITNLLSNAIK ASPPGRDVLV AVEPHDAFVR ISVRDQGDGI
PATFRLHIFE KFAQADATDA RQKGGTGLGL SIVKQIVERL NGRVSFEDAA GGGTVFCVDL
PEWKIPIDER PSVPKYRVLH LDDDPNILAA VAHALSPAAQ VISVGSLQDA RRILSVDPVD
MVLLDISLGE HSGLELLADL HDADGEAIPV IVFCTSPIEL TPDGQVHVIL VKSRTPLDSL
LASVRDCLAH REHREVA