Gene RPB_4224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4224 
Symbol 
ID3912032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4798798 
End bp4801431 
Gene Length2634 bp 
Protein Length877 aa 
Translation table11 
GC content66% 
IMG OID637886127 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_487826 
Protein GI86751330 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.249922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGCC TGCGGCAGGT GATGATGACG GCACGGGGAA CGGGGGGTGA CTGGTCTTCT 
CGGTTCGCCT TCTGGTTCGG CGGCCGCGGC GCCGGCACCC TGACACCTGC AAATGTCGAC
GAACGCGAGA TGCGTCTCAT CCGCGCCGCG CAGATCCACA GCGTCAGCCG GCTGGTGCCG
GTGACGATGT CGATCAACAT GATCAACGCG ACGCTGGTCC TGGTGGCGTT CTGGGACAAC
AACTCCCGCT TCTTCCTGCT GGCGTGGTTC GGCTCGATCG GCATCGCGGC GGCGCTGGCG
ATGCGGTCCT GGCTCAAGAC GCGGCACAAT CCGCCGCGCG AGGCCTCCGC GAATGCGATC
CGGCGGATGA GCGCGCAGGC GCTGATGCTC GGCCTGATCT GGGGCGCAAT GCCGATTGTG
CTGTTTCCCA ACGCCGAACC GACCGATCAG TTGATCATCG GCTGCCTTGT GACGGGCATG
ATGTCCGGCG GCGCGTTTGC GCTGTCCACG GTGCCGCGCG CGGGCCTTGC CTATACCTGG
TCGATGGTGC TCGGCTCTGC GATCACGCTG ATGCTGTGCA CCGGCACCGG CTATCAGATC
ACCATGATTT TTCTGATGCT CTACGCCGTG TTCATGTCGC GGAATCTGGT CTCGCATGGC
GAGATGTTCT TCGACAATCT CTGCGCGCAG TTCGAGCTGG AGCGCAACAC CGAGATCATC
TCGTTGCTGC TCAAGGACTT TCAGGCCAAC GCCAGCGACT GGCTGTGGCA GACCGATTCC
GAGTGCCGGC TGGTGCACGT GCCCGACCGC TTCGTCGAAG TCGCGCGTCT TCCCGTCAAT
ATCCTGCGCG GCGCACAACT CTCCGACGTG CTCGGCATGC TGTGCCCGGA AGACGGCCGC
TGTGCGGCCG CAGTCGCCGC CAAGATGGCG TTGCGCGAGC CGATGCACGA GCTCGTCGTG
CATGTGATGA TCGGCGGCAC CCAGCGCCTG TGGTCGCTGA CGGCGCGGCC GATGCTCGAT
CACAATGGCG AGTTCACCGG CTATCGCGGC GTCGGGCGCG ACGTCACCGA GCGGTGGCGG
GCCGAGCAGG CCGAGGCCGA GAACCGGGCG AAGTCCGACT TCCTCGCGAT GATGAGCCAC
GAAATCCGCA CGCCGATGAA CGGCGTGCTC GGGCTCGCCA ATTCGCTGCT CGAGACCAAG
CTCGATCCGG AGCAGCAGCA CGCGGTCACG ACGATCCGCG ATTCCGGCGA CAATCTGCTG
CGGATCCTCA ACGACATCCT CGACCTGTCG AAGCTCGAAG CCGGCCGCAT CGAGTTCGAA
CAAGTCAACT TCTCGCCGTC GGTGCTGGTC GATGCGGTGC GCTCCATCGT CGAGCCGAAT
GCGCGCGCCA AAGGACTGAC GCTGAAGATC GATGTCGATC CGCGGCTGCC GCCGGCTCTG
ACCGGCGACG CCCAGCGCAT TCGACAAGTG CTGCTCAACC TCGCCTTCAA CGCGGTCAAA
TTCACCGATC GGGGGGGCGT CACCATCGTG CTCACCTGCG TGAGGCGGGA CGATGCCTAC
GCCACGATCG AGTGGCAGGT CACCGACACC GGCATCGGAA TTCCGCTCGA TCGGGTCGGC
TCGCTGTTCA CCGATTTCGC GCAGGGCGAC GTGTCGATCA ATCGCCGCTT CGGGGGCACG
GGTCTCGGGC TCGCGATCAG CCGGCGCATC GTCGAGCAGA TGGGCGGCGC CATCGATGTA
ACCTCGAAGC CGGGTGAGGG CTCCACCTTC CGGTTCAGCC TCGACCTGCC GTGGACCAAT
GCGCTGATCT CGGACTATCG GCTCGACCGC GTCGGCAACG ACGATCTGCG AACCCGGATC
GCGATGCTCG GGCATCCGCT GCGGGTGCTG ATCGCCGAAG ACGATGCCAC CAACCAGATG
GTGGTCCTGA AGATGCTTCA GGAATTCGTC GCCGAAACGC GGGTCGTGTC CGATGGGGCC
GAAGCGTTGC GCGCGTTGGC CGAGGAGGAA TTCGACGTCG TTCTGATGGA TGTGCGAATG
CCGACGATGG ACGGCCTTGC CGCGACCCGT GCGATCCGTG CCCAGGGCGG TGCGTTCGCC
AAGCTGCCGA TCATCGCCCT GACGGCGAAT GCGTTCCCGG ACGACATCAG GACCTGCCGC
GAAGCCGGCA TGAGCGACTT TCTCGCCAAG CCGCTGCGCA AGCCGGCGCT GGTGGCCGCC
GTGCTGCGCG CTCTGCGTGG TGGCGGCGGC GTCGCGACCT TCGGTCCGCC GGCGCCTCTG
GCTCCGCCTC CGCCGCTCGA TCTGAATATC CTGACCGAGC TGACCGAGGA GATCGGCCGC
GAACAGGTCA ACGAAATGGT GGCCCTGTTC TTCAGCGAAA CCGAACGCCG GATCGCCTTG
TTTCGCGGGT TCGGCGATGC CATCGACCGC GACGTGCTGG CGATCGAGGC CCACGCGATG
AAGGGTGGCG CCGCGACGCT GGGATTTGCC ACCATCGCCG AGGTTGCGCG CGCCATCGAA
CTGGGGGCGA CGATCGTGTC GGTGGAAGCT CTCGAGGCGC AAACCGTTCA ACTCGCCAAG
TCGCTGGCCG AATTGCGCCG CCACTGCGAG GGCAGCTTTC GGCTGGCGAG CTGA
 
Protein sequence
MSGLRQVMMT ARGTGGDWSS RFAFWFGGRG AGTLTPANVD EREMRLIRAA QIHSVSRLVP 
VTMSINMINA TLVLVAFWDN NSRFFLLAWF GSIGIAAALA MRSWLKTRHN PPREASANAI
RRMSAQALML GLIWGAMPIV LFPNAEPTDQ LIIGCLVTGM MSGGAFALST VPRAGLAYTW
SMVLGSAITL MLCTGTGYQI TMIFLMLYAV FMSRNLVSHG EMFFDNLCAQ FELERNTEII
SLLLKDFQAN ASDWLWQTDS ECRLVHVPDR FVEVARLPVN ILRGAQLSDV LGMLCPEDGR
CAAAVAAKMA LREPMHELVV HVMIGGTQRL WSLTARPMLD HNGEFTGYRG VGRDVTERWR
AEQAEAENRA KSDFLAMMSH EIRTPMNGVL GLANSLLETK LDPEQQHAVT TIRDSGDNLL
RILNDILDLS KLEAGRIEFE QVNFSPSVLV DAVRSIVEPN ARAKGLTLKI DVDPRLPPAL
TGDAQRIRQV LLNLAFNAVK FTDRGGVTIV LTCVRRDDAY ATIEWQVTDT GIGIPLDRVG
SLFTDFAQGD VSINRRFGGT GLGLAISRRI VEQMGGAIDV TSKPGEGSTF RFSLDLPWTN
ALISDYRLDR VGNDDLRTRI AMLGHPLRVL IAEDDATNQM VVLKMLQEFV AETRVVSDGA
EALRALAEEE FDVVLMDVRM PTMDGLAATR AIRAQGGAFA KLPIIALTAN AFPDDIRTCR
EAGMSDFLAK PLRKPALVAA VLRALRGGGG VATFGPPAPL APPPPLDLNI LTELTEEIGR
EQVNEMVALF FSETERRIAL FRGFGDAIDR DVLAIEAHAM KGGAATLGFA TIAEVARAIE
LGATIVSVEA LEAQTVQLAK SLAELRRHCE GSFRLAS