Gene RPD_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2046 
Symbol 
ID4022528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2293042 
End bp2294634 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content65% 
IMG OID637962239 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_569182 
Protein GI91976523 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.227598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.375408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGT CCGCCGAAAA GCCCGAAGTG TTCCAGCTTC CGGCCGAGAG CCCGGTGGTC 
GCCCCGGCAC GTAACCGGCG CGCTGCCGCG CAGCGCGTCC GCGAAGCTCG CGATCGGCTG
ACCTCGACCA GCGGCACCCG CCCGGCCTTC GATCACGAAC TCGTCCGGCA ATACGCGCAG
ACCCGGCTGT CGGCGTCCTT CGTCATCATG CTGCTGGTGG TCGTCACCGG CGCGCTGTTC
GGAGTCTGGC TCGATCCGAT CACTGCCGGC GCCTGGACCG TCGGGATGCT GTGCGTCCAT
GCCGTGGTGA TCCGCAACTG CAACCGCTTC CTGCAGGAAC CGGCCTCGCC GCCCCGCACT
CGCGGCTGGC GGCGGCGCTT CGTAGCGCTC GATCTGCTGT ATGGCCTCGC CTGGACCACG
ATCCTGGTGC ACCCTCTCGG GCTCACTGTC GTCTCCAACA CGCTGCTGAT GTTCTTGATG
CTGCTGGTGG TCGCGGTCTC CAGCATGCTG GCCGCGAGCC TGCCGATCGC GGCGCTGGCG
ACGACGGTGC CGGTGACCTG CGCGATCGCG TTGAACTTCA TCCTCGGCGG CACCTTCGAC
AGCTACGTGC TGGCGGCGCT CACGGTCGCG GCCGAGGGCT ATTTCGCCCT GCTCGCCCAT
CGCCTGCACT CGACGACGCT GGCGACCCTG GAAGCGCGCG CTGAAAAGGA CGCGCTGATC
GGCGAACTCG AACAGGCCAA GGCGATTTCG GACGAGGCGC GGCACCGCGC CGAAGCCGCG
AACGTCGCCA AGTCGCGGTT CCTCGCGCAG ATGAGTCACG AACTGCGCAC GCCGCTCAAC
GCCATTCTCG GCTTTTCCGA AGTGATGAAG AGCGAAATTT TCGGCGCGCA CGCCGTGCCG
GTCTACAAGG ACTACTCCGC GGACATCCAC AATTCCGGCG TCCACCTGCT CAACCTGATC
AACGAAATTC TCGACCTGTC GCGGATCGAG GCAGGCCGCT ACGAGCTCAA CGAGGAAGCG
ATCTCACTTG TCCATGTGGT GAGCGACTGT CACCACCTGC TGAAGCTGCG CGCCTCCAGC
CGCGGCATCA CTATCCACGA AGTTTTCGAA CAAGGCATGC CGCGGATCTG GGGCGACGAG
CGCGCCGTGC GTCAGGTCGT GCTCAATCTG CTGTCGAACG CGATCAAGTT TACGCCGCAG
GGCGGCGAAA TCTGGCTCAA GGTCGGCTGG ACCGCGTCGG GCGGTCAATA TCTCAGCGTC
AAGGACACCG GCTCGGGCAT CCCCGAGGAG GAAATCCCGA TCGTGCTGGC CTCGTTTGGC
CAAGGCTCCA ACTCGATCAA ATCTGCCGAA CAGGGAGCCG GCCTCGGACT GCCGATCGCC
AAGAGCTTGG TGGACATGCA CGGCGGCACC TTCACGCTGA AGTCGAAATT GCGGATCAGC
ACCGAAGTGA TCGTGACCTT CCCGCCAGAG CGTGTGATGT CGGCGCTGGC GCCGATGGCG
GACGAAGCGC CGCTGCAGCC GAGCCTCGTC GAGACCGACG AGCGCAGCCG TGTTCGCGGC
AAGCCGATCA TGAACGCCGG CACCGGGCTG TAG
 
Protein sequence
MSESAEKPEV FQLPAESPVV APARNRRAAA QRVREARDRL TSTSGTRPAF DHELVRQYAQ 
TRLSASFVIM LLVVVTGALF GVWLDPITAG AWTVGMLCVH AVVIRNCNRF LQEPASPPRT
RGWRRRFVAL DLLYGLAWTT ILVHPLGLTV VSNTLLMFLM LLVVAVSSML AASLPIAALA
TTVPVTCAIA LNFILGGTFD SYVLAALTVA AEGYFALLAH RLHSTTLATL EARAEKDALI
GELEQAKAIS DEARHRAEAA NVAKSRFLAQ MSHELRTPLN AILGFSEVMK SEIFGAHAVP
VYKDYSADIH NSGVHLLNLI NEILDLSRIE AGRYELNEEA ISLVHVVSDC HHLLKLRASS
RGITIHEVFE QGMPRIWGDE RAVRQVVLNL LSNAIKFTPQ GGEIWLKVGW TASGGQYLSV
KDTGSGIPEE EIPIVLASFG QGSNSIKSAE QGAGLGLPIA KSLVDMHGGT FTLKSKLRIS
TEVIVTFPPE RVMSALAPMA DEAPLQPSLV ETDERSRVRG KPIMNAGTGL