Gene RPD_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1872 
Symbol 
ID4022354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2096442 
End bp2099252 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content64% 
IMG OID637962065 
Productsensor histidine kinase 
Protein accessionYP_569008 
Protein GI91976349 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.291512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.670925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGAAACG GATTGAATTT ATCGACGCGG CTTACGATTG CGATCGTCCC GCTCGTGGTG 
CTGACCGCCG CGAGCGTCGG CTATCTCGGC TATCGGAACC TCGCGACTGT TGCGATCGGG
CGCACACTGG CGGCGATCGA TACCACTGCG ACCTCGCGGG CAATCGAGCT CGCGAGCCTG
GTCAAGAACG TTCGTGCCGA CGTGACGGCT TTTCGCGCTG CCATCGGCCT CGGCGAAATG
ACAACACTCA GTCGCAATCC TTCGCTGCAG ACGACCCGCG GCTGGACGCT AGCAGAATGG
AGCGCCGGGG TCGGACAACG GCTCGCTGGC GAACTCGAAG CCAAGCCCGA TCTCCTCAAG
TACCGCCTGA TCGGATTGGC GGATGGAGGC CGCGAACTCG TTCGGGTCGA ACGTCAGCGC
AACGGCGCCG TTCGCGTCGT TCCGGACGAA GAGTTGCAGC GCGCCAGCGA GCGCGATTTA
TTTGCGCAGG CGATCAACGT TGCCGAGGGA GAGAGCATCG TCTCGCCGGT CGAACTCGAT
CAGGATCATG GCGCGACGAC AAAGCCACAT GTGCCGGTGA TCCGCGTCTC GGCGCCGGTG
TTTGGGCCGG ACGGAACGCA GTTCGGTTTG ATCGTCGCCC ATATCGATCT GCGCGCCGCC
TTCGACCGGG TGACCGGTCT CGCGCAGCAG GGGCGGGTCG TCTACGTGAT CAACAACAAC
GGCGACTATC TGCTCCATCC CGACAAGACG CGCGAGTTCG GCTTCGAACT GGGCAGGCCG
GCCCGCATCC AGGACGACTT CCCCGCTCTC GTTGAAGCGA TTGCCAAGAA CCGGGAACGA
ACGTCGATTG TCGAGGATCG CAACGGAACG CCGTTCGGCG TGGCGCTCGA ACATGCCGAT
GGCGTGGCAC TGTCGATCGT CGAGACCATC CCACAGCGGA TCATTCTCGA CGCGATCATG
ACGGCTTGGC TGAGTTCCAC CTTACTCGGC TGCGCTTTCG CGGTGCTGAT CGCGATCGGG
CTCGCGCTGG TCATGGCCCG CACCATGACC AGACCGCTGT CGCAGATGAC CGCCGCGGTC
TCGTCCTTCG CAGATGATCG GCCGATCGAC CTACCGCTCG ACACCGGCGG CGAAATCGGT
GTCCTGGCGC GTGCTTTTCA GAAAATGGCG CTCGATTCGC GTGACAAAAC CGCGGCGATC
CGCCGCGAGA AGGAGATTTT CGAGCGGATC ATGAACGCGA TGGCCGAGGC GGTCCTGCTG
ATCGACAGGA AGGGACAGGT CATCTACGAG AGTCCCGGCG CAGTGAAGCT GAGGTCGCCG
ACGCCCGGCC GCCCGGTGCG GCCCTGGGCA GAGGCGATCG ACTCCTTCCT CGAAGACGGG
GTGACGCCGC TGCCTCCGGA TCGGCGCCCA GGCCAGCGCG CTTTGCAAGG CGAAACCGTC
GACCAGATCG AACTGGTCCT GCACGTTCGC GACGCCGGCC GCAACGTCGA GGTCATCGGC
AACGCCCAGC CTATCCGGAA CGCCGCAGGC CGGATCAACG GCGCGGTCGT CGTTTACAAG
GACGTCACCG AATTGAAGGA AGCCGAGCGC CGACTGCATC AGGCGCAGAA GCTGGAGGCG
ATCGGTCAGC TCACCGGCGG CGTCGCGCAC GACTTCAACA ACATGCTGAC CGTCATCAGC
GGGACCGCAG AGATCCTGAT TGAAGAACTC ACCGACCGAC CGAACCTCAG CAACATCGCC
AAGATGATCG AACAAGCTGC AGAGCGTGGC GCCGACCTCA CGCGGCAGTT GCTCGCCTTC
GCTCGCAAAC AGCCGCTGCA GCCGCGCAAT GTCGACGTCA ACGCGATCGT TCTCGAAACC
GAGCAATTGC TGCGGGCGAC CATCGGCGAA CACATCCAGG TCGACGTCCG GCTGGAGCAG
GACGTTGATG CGGCGCGGAT CGACCCGTCG CAGCTCTCAT CGGCACTGCT CAACCTCGCG
GTGAATGCGC GCGACGCGAT GCCGATCAGC GGCAAGCTGC TGCTCGAAAC CGGCGGCGTG
GTGCTCGACA ACGACTACGC CCAGCAAAAT CCAGACGTGC GCCCCGGGCG CTACGTGATG
ATCGCGGTCA GCGACACCGG CACCGGCATT CCCGCGGAGA TGCGCGATAA GGTGTTCGAG
CCGTTCTTCA CCACGAAGAG CCTTGGCAAC GGCACCGGCC TCGGGCTCAG CATCGTGTAC
GGTTTCGTCA AACAGTCGGG CGGCCACGTC AAGATCTACA GCGAGGAAAA TCAGGGCACC
ACGATCAAGC TGTATCTTCC GCGCACCGAC GCCGACATCG ACGGCGCGCC GATCGCAGCG
CCCGTTGTGG GCGGTAGCGA GACCATCTTG CTGGTGGAGG ATGACGAACT GGTCCGCAAA
TTCGCGATCG CCCAGCTCCA GGGCCTCGGC TATCGCACCA TCGCGGTATG CGACGGTCCC
TCGGCTCTGA AAGAGGTCGA GCGCGGCGCC GCGTTCGATT TGCTGTTCAC CGACGTGATC
ATGCCCGGCG GGCTGAATGG CCCGCAACTC GCCGAAGCGG TCGCGCGGAT CAGGCCGGTC
CGGGTGCTGT TCACGTCCGG CTACACCGAG AACGCGATCT TGCATCATGG CCGGCTCGAT
CCCGGTGCGC TGCTGTTGAG CAAGCCGTAT CGCCGGTCGG ATCTGGCGCG GATGGTGCGC
GCCGCTCTCG ATCAGGAATA CTACGTTCCC GCCGAGCCGT CGGCCTGCGC GGTCGCGGCT
AAATCACCCG ACCAGCTTTG GCCCAATACG GATCGCGTAG CCGGCGCTTG A
 
Protein sequence
MRNGLNLSTR LTIAIVPLVV LTAASVGYLG YRNLATVAIG RTLAAIDTTA TSRAIELASL 
VKNVRADVTA FRAAIGLGEM TTLSRNPSLQ TTRGWTLAEW SAGVGQRLAG ELEAKPDLLK
YRLIGLADGG RELVRVERQR NGAVRVVPDE ELQRASERDL FAQAINVAEG ESIVSPVELD
QDHGATTKPH VPVIRVSAPV FGPDGTQFGL IVAHIDLRAA FDRVTGLAQQ GRVVYVINNN
GDYLLHPDKT REFGFELGRP ARIQDDFPAL VEAIAKNRER TSIVEDRNGT PFGVALEHAD
GVALSIVETI PQRIILDAIM TAWLSSTLLG CAFAVLIAIG LALVMARTMT RPLSQMTAAV
SSFADDRPID LPLDTGGEIG VLARAFQKMA LDSRDKTAAI RREKEIFERI MNAMAEAVLL
IDRKGQVIYE SPGAVKLRSP TPGRPVRPWA EAIDSFLEDG VTPLPPDRRP GQRALQGETV
DQIELVLHVR DAGRNVEVIG NAQPIRNAAG RINGAVVVYK DVTELKEAER RLHQAQKLEA
IGQLTGGVAH DFNNMLTVIS GTAEILIEEL TDRPNLSNIA KMIEQAAERG ADLTRQLLAF
ARKQPLQPRN VDVNAIVLET EQLLRATIGE HIQVDVRLEQ DVDAARIDPS QLSSALLNLA
VNARDAMPIS GKLLLETGGV VLDNDYAQQN PDVRPGRYVM IAVSDTGTGI PAEMRDKVFE
PFFTTKSLGN GTGLGLSIVY GFVKQSGGHV KIYSEENQGT TIKLYLPRTD ADIDGAPIAA
PVVGGSETIL LVEDDELVRK FAIAQLQGLG YRTIAVCDGP SALKEVERGA AFDLLFTDVI
MPGGLNGPQL AEAVARIRPV RVLFTSGYTE NAILHHGRLD PGALLLSKPY RRSDLARMVR
AALDQEYYVP AEPSACAVAA KSPDQLWPNT DRVAGA