Gene RPB_3596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3596 
Symbol 
ID3911398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4123837 
End bp4126617 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content67% 
IMG OID637885498 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_487202 
Protein GI86750706 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCCGAC GATTGAACTT ATCCACCCGG CTCACCCTCG CCATCGTGCC TCTCGTGGCG 
CTGACCGCGG CGACCGTCGG TTATCTGGGG TACCGAAACC TCGCGGCCAT TGCGATCGAA
CGCACGCTGG CCGGGCTGGA TGCTACTGCG CGGTCGCGAG CTGTGGAACT CGCGAGCCAG
ATTCGGAACG TCAGCGCCGA CGTCGCGAGC TTTCGCACGA TGATCGGCCT GGGCGAATTG
ATCGCGCTCA GCCACGACGC GACGCTCCGG ACCGCCGGCG GCCGGACGCT GGCGGAGTGG
CGCGCGCGGA TCGAGCAGCG ATTCGCCGAC GAACTCGGAG CGAAAGCCTA TCTGATCCGA
TACCGCCTGA TCGGAGCGAG CAACGACGGC CGCGAGATCA TCCGGGTCGA GCGACGGAAC
GATACGGTCC GGATTGTTCC GGACGACGAG TTGCGCGGGC AGAGCGAATA CGCCTTCTTC
GAACAGGCCA TCCGAGCGGC CGGAAGCGAG GTGGTGGTCT CGCCGGTGGA ACTCGCCCGG
ACCGACGGCG CGATCCTGCA ACCTCCGATG CCGCTGATCC GCGTGTCGGC CGCGCTGTTT
GCGACCGACG GTACGATGTT CGGGCTGATC ATCGCCGATG TCGGTCTGCG CCCCGCCTTC
GCGACGGCCA CCGCAAAAAC GCGAAAAGGC CGCACCGTCT TCATCATCAA CGACCGCGGC
GACTACCTGC TGCATCCCGA CAAGTCTCGC GAGTTCGGTT TCGAATTCGA TCGGCCCGCC
CGCATCCAGG ACGACTTTCC AAGCCTCGCC ACCGCGATCA CCAGCGGCAA GGATCAGACG
GCGATCGTCG AGGACCGCAA CGGCGTGCCG ATCGGGGTTG CGATCGACCG TGTCGAAGGG
GCGCCCCTGG CCATCGTCGA GACCGTGCCG CAGCAATTCA TTCTCGACGA CATCATGACC
GCGTGGCTGG ATTCGACCTT GACCGGCGGC TCGGTCGCCG TGCTGACCGC CGTTCTGCTG
GGTTTCGTCA TGGCCCGGAC CCTGATCAAG CCGCTGTCGC AGATGACGAA GGCGGTGGCG
GGATTTGCCG AGGACGCGCC GCCGAAGATG CCGGTCGCGG CCAGTGGCGA AATCGGCGTG
CTGGCGCGGG CGTTCGACAC CATGGTGCAG GACGTGCAGG CGAAGACCGC CGCGATCCGG
CACGAGAAGG AGCTGTTCGA GAGCATCATG ACCACGATGG CCGAGTGTGT CGTGCTGATC
GACCGCAACG GCGAGGCCAT CTATCAGAAC CGCGCCAACC GGGAACTGCT CAGCGCACTC
GATATCAGGG TCGACCAGTG GCAGGAGCTC TACGACATCT ACACGCCGGA CGGCTCGACC
CGGCTGTCCG CCGACCATTG GCCCTCCGCC CGCGTCCTGC GCGGCGAGAC CGTCGATAAT
TACGAGATCG TCTGCCGAAG GCGCGATTCC GGCAAGACGG TTCATCTGAT GGGGAGCGCG
CGGCCGTTGT GGGAAGCCGC GGGCACGCAA ACCGGAGCGG TCGTGGTGTT CCGCGACGTC
ACCGAGATGC GGGCGACCGA GCACCGGCTG CATCAGTCAC AGAAGCTGGA AGCGATCGGC
CAGCTCACCG GCGGCGTCGC GCACGACTTC AACAACATGC TGACGGTGAT CAACGGCACC
GCCGAGATCC TGCTCGACGA ACTTGCCGAC CGGCCGGACC TCTGCAGCAT CGCCAGGATG
ATCGAGCAGG CCGCCGGGCG CGGCGCCGAC CTGACGCGGC AACTGCTCGC CTTCGCCCGC
AGGCAGCCGC TGCAACCGCG CAATATCGAC GTCAACGCCA TCGTGCTGAA CACCCAGCAA
TTGCTGAAAG CGACGATCGG CGAACACATC GACGTCGAAG TCAGGCTGGC GCAGGACGTC
GATGCGGCGC GGGTCGATCC GTCGCAACTC TCGTCGGCGC TGCTCAACCT CGCGGTGAAT
GCGCGCGACG CGATGCCGAA CGGCGGCAAG CTGATGCTCG AAACCGCCGA CGTGGTGCTC
GACGCCGCCT ACGGGCAGCA CAATCCCGAC GTCCAGCCCG GCCGCTACGT GATGATCGCG
GTCAGCGACA CCGGCACAGG AATTCCAGCC GAGTTGTGCG ACAAGGTGTT CGAGCCGTTC
TTCACGACCA AGAGCGCCGG CCAGGGCACC GGCCTCGGCC TCAGCATGGT CTATGGCTTC
GTCAAGCAAT CGGGCGGGCA CATCAACATC TACAGCGAGG AGGGCCACGG CACCACGCTC
AAGCTGTATC TGCCGCAGGC CGATTCCGAC CCGGCCGTCG ACAGCGCACC GGACGCCGGC
CCGGCGACCG AGGGCGGCAG CGAAACCATC CTGCTGGTCG AGGACGACGA GTTGGTGCGC
AAATTCGCGA TCGCCCAGCT CGCGGGTCTC GGTTATCGCA CCATCGCGAT GTGCGACGGC
CAGGCGGCGC TGCGTGAGGC GGAGCGCGGC ACCGCGTTCG ATCTGCTGTT CACCGACGTG
ATCATGCCGG GCGGCCTGAA CGGCCCGCAA CTCGCCGACG CGATCGCCCG GGTCCGGCCG
GTGCGGGTGC TGTACACCTC GGGCTACACC GAGAACGCGA TCGTGCATCA CGACCGGCTC
GACAGCGGCG CGCTGCTGCT GACCAAGCCG TATCGCAGGT CGGATCTGGC CCGGATGGTC
CGCGCCGCAC TCGGCAAGGA CGTGCACGTC CCGCCGACCG GGATCGCGGC GGCACCCTCG
TCGCGCGCCA GCGCCCGTTA G
 
Protein sequence
MPRRLNLSTR LTLAIVPLVA LTAATVGYLG YRNLAAIAIE RTLAGLDATA RSRAVELASQ 
IRNVSADVAS FRTMIGLGEL IALSHDATLR TAGGRTLAEW RARIEQRFAD ELGAKAYLIR
YRLIGASNDG REIIRVERRN DTVRIVPDDE LRGQSEYAFF EQAIRAAGSE VVVSPVELAR
TDGAILQPPM PLIRVSAALF ATDGTMFGLI IADVGLRPAF ATATAKTRKG RTVFIINDRG
DYLLHPDKSR EFGFEFDRPA RIQDDFPSLA TAITSGKDQT AIVEDRNGVP IGVAIDRVEG
APLAIVETVP QQFILDDIMT AWLDSTLTGG SVAVLTAVLL GFVMARTLIK PLSQMTKAVA
GFAEDAPPKM PVAASGEIGV LARAFDTMVQ DVQAKTAAIR HEKELFESIM TTMAECVVLI
DRNGEAIYQN RANRELLSAL DIRVDQWQEL YDIYTPDGST RLSADHWPSA RVLRGETVDN
YEIVCRRRDS GKTVHLMGSA RPLWEAAGTQ TGAVVVFRDV TEMRATEHRL HQSQKLEAIG
QLTGGVAHDF NNMLTVINGT AEILLDELAD RPDLCSIARM IEQAAGRGAD LTRQLLAFAR
RQPLQPRNID VNAIVLNTQQ LLKATIGEHI DVEVRLAQDV DAARVDPSQL SSALLNLAVN
ARDAMPNGGK LMLETADVVL DAAYGQHNPD VQPGRYVMIA VSDTGTGIPA ELCDKVFEPF
FTTKSAGQGT GLGLSMVYGF VKQSGGHINI YSEEGHGTTL KLYLPQADSD PAVDSAPDAG
PATEGGSETI LLVEDDELVR KFAIAQLAGL GYRTIAMCDG QAALREAERG TAFDLLFTDV
IMPGGLNGPQ LADAIARVRP VRVLYTSGYT ENAIVHHDRL DSGALLLTKP YRRSDLARMV
RAALGKDVHV PPTGIAAAPS SRASAR