Gene RPB_4358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4358 
Symbol 
ID3912173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4941643 
End bp4942983 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content71% 
IMG OID637886264 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_487956 
Protein GI86751460 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.66153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGCT CGCTGCGCCT TCGGCTGTTT CTGGTGCTGT TCGGCGCCAC CGGCATCGTC 
TGGATCGCCG CGGTGATGTG GATCTACACG AGCAGCCAGC GCGAGCTCGA ACACGTGCTC
GACGCGCGGC TGCAGGAGGC GGCCCGCATG GTGGTGTCGC TGGTCGGCAA CATGGAGGGC
CGTGTTCCGG AAGCGCTGCC GGATGTGCCG GCGGCGCCGG CGACGCCAGG CAATTACGAG
CGGCAATTGT CCTGCCAGGT GTGGTCGGTG CAGGGCCGGC TGATCGCGCG CTCGGGCGGT
GCGCCGGAGC AGAGTCTCAG CGATCAGGGG GCGGGATTTT CCGAGCGGCA GATCGACGGC
GAAACCTGGC GGATCTATGC GACGGAGGAC GCCGCCCGCG GCTTTCGCGT GCTGGTCGGC
GACCGGCTCG GCTTGCGCGA GCGGCTCGTC GCCGATCTGA TCAAGGGATT GCTGTGGCCG
GCGCTGCTGA TCGCGCCGCT GCTCGGCCTG CTGATCTGGA TCAGCGTCGG CCGCGGGCTG
CGACCGCTGC AGGCGATTGC GTCGGATCTG GTCGGGCGCG ACGCCGACGA CATGCGGCCG
GTCGATGCCA GCCGCACCCC CTCCGAAGTG ATGCCGCTCG CCCGCGCGCT GAACGCGCTG
TTCGACAAGG TCGCGCTGGC GCGGCGTCAC GAGCGCGAGA TCACCGCCTT CGCGGCGCAC
GAGCTGCGCA GTCCGCTCAC CGGCCTCAAG ACCCAGGCGC AAGTCGCGCT GGCGACCACC
GATCCGGCGG TGGCGCGGGC GGCGCTGCAG CAGATCCTGG TCGCGGTCGA TCGCGCCACC
CGGCTGGTGC GCCAGTTGCT GACGCTGGCG CGGCTCGACG CCCAGGCCGG GACCGACGAT
CACGGCAGCG TCGCGATCCG GCCGCTGATC GACGAGGTGG CGCGGATGAC GCCGCGCCCG
GCCGGCGTCG CGGTGGTGAT CGACGACGAT CTCGCCGGCG CCGACTGGCG CGGCAATCGC
GAATGTCTCG AACTGGCGAT CCGCAATCTG CACGAGAATG CGATCCAGCA CATGCCGGGC
GGCGGCGAGG TGCGCTGGCG CGCCGGCGCG TCGCCGCGAT CGGTGGTGAT CGAGGACAGC
GGTCCCGGCG TGCCGGAGGA CGAATTGCCG AAACTCGGTC AGCGCTTCTT CCGCGGGCGA
AACAAATCGG CGATCGGCAG CGGCCTCGGC CTGACCATCG CGGCGCTGGC GCTGGAGCGC
ACCGGCGCGG AGCTGCAGTT CGGCAACCGC CCCGACCGCC GCGGTTTTCG CGCCGAAATC
GTCCTGCGCC GAACGGTGTG A
 
Protein sequence
MIGSLRLRLF LVLFGATGIV WIAAVMWIYT SSQRELEHVL DARLQEAARM VVSLVGNMEG 
RVPEALPDVP AAPATPGNYE RQLSCQVWSV QGRLIARSGG APEQSLSDQG AGFSERQIDG
ETWRIYATED AARGFRVLVG DRLGLRERLV ADLIKGLLWP ALLIAPLLGL LIWISVGRGL
RPLQAIASDL VGRDADDMRP VDASRTPSEV MPLARALNAL FDKVALARRH EREITAFAAH
ELRSPLTGLK TQAQVALATT DPAVARAALQ QILVAVDRAT RLVRQLLTLA RLDAQAGTDD
HGSVAIRPLI DEVARMTPRP AGVAVVIDDD LAGADWRGNR ECLELAIRNL HENAIQHMPG
GGEVRWRAGA SPRSVVIEDS GPGVPEDELP KLGQRFFRGR NKSAIGSGLG LTIAALALER
TGAELQFGNR PDRRGFRAEI VLRRTV