Gene RPD_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0119 
Symbol 
ID4020575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp135398 
End bp138127 
Gene Length2730 bp 
Protein Length909 aa 
Translation table11 
GC content66% 
IMG OID637960296 
Productsensor histidine kinase with PAS/PAC 
Protein accessionYP_567260 
Protein GI91974601 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.553437 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTGT CGATGCGTCT GACCTTGGTG ATGACGACGC TGGTGCTGTG CGCGGTCGCA 
GCGGTCGGCA TTCTGACCTA CTACAATGTC GGTCGTGCGG TGGTGCCGGC CGGGCTCACG
CGTCTGGCTG ACCGGGCCGA GGCGCGGCTC GGAGCTTTCG ACGGCATCCT CCGGCTGCTG
CAACTCGAAA TCCTCGCGGC GCGGACGTTC CCGCCTCATC AAGGCCTCGT CCGTGCCCGC
ATCAACGGCG GCCTCGATCC ACAGGACGAT CTCACCGAGG CGAGATGGCT CCGCCGTCTT
GGCGACGCCT ATCTCGGCCA GATGAGCGTG AAATCCGACA TCGTAAGGTT CAGCCTGGTC
AGCGCCGACG GTGGTCGCGA ATTGCTTCGC GTCGATCGCA GCGGTCCCGG CGGCGACATC
CGTGTCGTGC CGGATGACAA GCTCGAGCGG GTCGGCCAGG AGCTGTTCGA CCGGACCATC
GCGCTGTCCG AGGGAGAGGT CTACACCTCG CCGATTCACG CCGGGCCCGG TAACTCCGGC
GACGACGATC TGGCGCCGAT GGTCTCGATC GCAACGCCGC TGCGGATGCC AGAGGGCGAA
CCATTCGGCA TGCTCGTGCT CGATTTCGAT CTGCGGCCGA CCTTCGAGCG GATTCGGGCC
TCGCCCGACA ACAACACCAA GGCTTACTTT GTCGATGCCG GCGGTCCCTA TCTGCTCGAT
CTGCTCGACG GTCGGGTGAT TCCGTCGAGA CCACGCGGCC AGTGGCAGGA CGACTATCCC
GATCTGGCGA AGGCGCTGGG CGACAAGCCT GGCGCCGCCA CGGTTCTGAC CGCGCCCGAT
GGCGCGCGGG TCGCCGCCGC GATCGCGATC GCGCCGCTGG TCGGCAGACT GCGCGTCGGC
GTGATCGAGA CCGAGGCCTT CGAGCGCATC ATCGCGCCGG CAACCGCGCT GCGACAGACC
GTTGTGACCG TCGCGCTGTT TGCTGCGGGG GTGGCGGTGC TGCTGTCGGC GCTGCTGTCG
CGCTCGCTCG CCAAGCCGTT GGTGCAGATT ACCGCCGCGG TCGATGATTT CGCCAGCACC
GGACGCGTCG CCATGCCGCG CGGGCTCAGC GGCGAGACCA AGACGCTCGC CGCCGCGTTC
GAGCACATGG CCGCCGAGAT CGACACCACC ACCACGGCGC TGCGCGCCAA GTCGGAGGCG
CTCGACAAGA TCGTCGCCAG CATGGCCGAT GCGGTATTGA TGCTGGACGC CGAAGGTCGG
CGGGTGTTCG CCAATCCGAC CTTCTTCGCC CTGTTCGGCG ATATCGCCGA GATCGGCTCG
GAGCGTTGGC GACAACATTA CCAAGGTCTC CGGCCGGACG GCGTGACGCC GATTCCGGAC
GACGAGACGC CGTCCGCTCG CGCCCGGCGC GGTGAGAGCT TCGACAACCT CGACGTGGTG
CTACGTCGTG TCGGCCATTC GCAACTGGTA CATCTCGCCG CCAGCGGCCG GCCGATCGAA
ACCGCCAGCG GCGCGTTCGA CGGCGCCGTG GTGGTCTATC GTAACGTCAC GGCGTTGAAG
GAAACCGAGC GGCAACTTCG CCAGGCGCAA AAAATGCAGG CGATCGGTCA GCTGACCGGC
GGCGTCGCGC ATGATCTCAA CAACATCCTC ACCGTGCTGA CCGGCGGCAT CGAGATCCTC
GCGGATGGCG TGAGCGATCG GCCCGCGCTC AAGGACGTCG CCGTGATGGT CGATCAGGCG
GTGTCGCGCG CCAGCGACCT GACCAACGGA CTGCTGTCGT TCGCGCGCAA GCAGCCGTTG
CAGCCGCGCA GCATCGACGT CAACGCGCTG ATGCAGGAAA CCGCCCGGCT GCTGCGCGCG
ACCTTCGGCT CCCATATCGA GATCGCGTTC GAGCCGACGC CCGGTCTGCG GTCGGCGCTG
GCCGACCCGT CGCAACTCTC CGGCGCGCTG ATCAATCTTG CGATCAATGC CCGCGACGCG
ATGCCGGGCG GCGGCAAACT GCTGCTGGAG GCGGGCAATA TCGACCTCGA CCAGGCCTAT
GCCGACCACA ACGACGAGGT TGCTGCCGGC CGCTACGTGG TGCTGATGGT CACCGACACC
GGAACCGGCA TTCCGGCGGC GATCCGTGAT CGGGTGTTCG AGCCGTTCTT CACCACCAAG
GCGCTCGGCG AAGGCACCGG GCTCGGCCTC AGCATGGTCT ACGGCTTCGT CAAACAGTCC
GGCGGTCACA TCGCGATCTA CAGCGAAGAA GGCGTCGGCA CCACCATCAG GCTCTATCTG
CCCAGCGCTG ATCCGAAGAA CAGTCCGGGT GACGCCGCGG CGCCGCAACA GGCGCAAGGC
GGTCGCGAAT CGATTCTGCT GATCGAAGAC GATGTTCTGG TGCGTAGCTA TGTGGTGACC
GATCTTGCCG CGCTCGGCTA TACCGTTCAT GCCGCCGCCA CGGCTGCGCA GGCGATGGCG
ATGGTCTATG ACGAGCTCGA ATTCGATCTG CTATTCACCG ACGTGATGCT GGCGGGCAGC
ATCAATGGCT ACCAGCTTGC CGACGAATTG CGCAAGTATC GGCCGGACCT CAAGGTCCTG
CTCACCTCGG GCTACACCGG GAACATGCTC AGGCTGCAAG GGGGCCACGA GGACGGGACG
CCGTTCCTTG AAAAACCCTA TCGGCGAGCC GAACTGGCGC GAATGTTGCG GCTGGCGCTG
GACGGCAAGG CGCCGTCGTC GCCGACGTAG
 
Protein sequence
MRLSMRLTLV MTTLVLCAVA AVGILTYYNV GRAVVPAGLT RLADRAEARL GAFDGILRLL 
QLEILAARTF PPHQGLVRAR INGGLDPQDD LTEARWLRRL GDAYLGQMSV KSDIVRFSLV
SADGGRELLR VDRSGPGGDI RVVPDDKLER VGQELFDRTI ALSEGEVYTS PIHAGPGNSG
DDDLAPMVSI ATPLRMPEGE PFGMLVLDFD LRPTFERIRA SPDNNTKAYF VDAGGPYLLD
LLDGRVIPSR PRGQWQDDYP DLAKALGDKP GAATVLTAPD GARVAAAIAI APLVGRLRVG
VIETEAFERI IAPATALRQT VVTVALFAAG VAVLLSALLS RSLAKPLVQI TAAVDDFAST
GRVAMPRGLS GETKTLAAAF EHMAAEIDTT TTALRAKSEA LDKIVASMAD AVLMLDAEGR
RVFANPTFFA LFGDIAEIGS ERWRQHYQGL RPDGVTPIPD DETPSARARR GESFDNLDVV
LRRVGHSQLV HLAASGRPIE TASGAFDGAV VVYRNVTALK ETERQLRQAQ KMQAIGQLTG
GVAHDLNNIL TVLTGGIEIL ADGVSDRPAL KDVAVMVDQA VSRASDLTNG LLSFARKQPL
QPRSIDVNAL MQETARLLRA TFGSHIEIAF EPTPGLRSAL ADPSQLSGAL INLAINARDA
MPGGGKLLLE AGNIDLDQAY ADHNDEVAAG RYVVLMVTDT GTGIPAAIRD RVFEPFFTTK
ALGEGTGLGL SMVYGFVKQS GGHIAIYSEE GVGTTIRLYL PSADPKNSPG DAAAPQQAQG
GRESILLIED DVLVRSYVVT DLAALGYTVH AAATAAQAMA MVYDELEFDL LFTDVMLAGS
INGYQLADEL RKYRPDLKVL LTSGYTGNML RLQGGHEDGT PFLEKPYRRA ELARMLRLAL
DGKAPSSPT