Gene RPB_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4034 
Symbol 
ID3911841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4603290 
End bp4605617 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content68% 
IMG OID637885938 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_487638 
Protein GI86751142 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCGG GCATTGACAA TTCGGCCGAG TTGATAGTCG CAGAATTCGA GGCATTCGGC 
CAGTCCGGAT GCGAGCGCGA AGCGATTCAT CTGCCCGGTT CCATCCAGCC GCACGGCGCC
CTGCTGGCGC TGGCGCCGGA CGATCTGCGG ATCGTCCACG CCAGCGGCGA TACGACGTTG
CTGCCGGGCG CGCCGGCCGG ACCGCTGCCG GGCCGCCACG CCACCGAGAT CCTGTCGTCC
GACCAGATCG CGCGGTTGCG CGCCCTGCTC GGCCCGGATC GCAGGATCGA GCGGCCGCTG
CACGCCTTCG CGTTGACCGC GCCCGACGCC ACGCCGATCG ATGTCGTGGT GCATCAGTCG
TCGGGCCTGC TGGTTCTGGA ATTCGATCCG CGCCGCGAGC CAGCGCCGGA GAATTCGCTG
GCGCTGGTGC AGTCGATGAT CCGGCACGTG CAGCGCGCCG GAACGGTGCA GGCGTTCTGC
GAAGCGATCG TCGCGGAGCT CCGCGCGGTC ACCGGCTTCG AGCGGGTGAT GATCTATCGC
TTCAGCGCCG ACGACAGCGG CGAAGTGATC GCGGAATCAT GCGCAGCCGG CATCGAGAGT
TTCCTAGGGC TGCGCTATCC GGAATCGGAC ATCCCGAAGC AGGCCCGCGC GCTGTATCTC
ACCAACTGGA TCCGCGCGAT TCCCGATGCG CGCTACGCGC CGGCGCCGAT CACGCCCGCC
GTCAATCCGC GCACCGGCCT GCGGCTCGAC CTCAGCCAGA GCGTGATCCG CAGCGTGTCG
CCGGCGCACC GGGTGTATCT GGCGCATATG GGCGTGGTGG CGTCGATGTC GCTGTCGATG
ATCCAGCACG GCCGGTTGTG GGGGCTGATC GCCTGCCATC ACAGCGCGCC GCGCTATCTG
CCGTACCGGA TGCGCGAGGC GTGCGAGCTG TTCGCCGAGA TGGCGTCGTC GCAGCTCGAG
GCCAAGGTCG CCGCCGAACA GCTCGAGGCG CGGCTGCGCA GCACCCGGAT CCACGAAGAG
CTGGTGACGC GGATGAGCCA GGAATCCGAC CTCGCCGAAG GTCTGATCCG GTTTCATCCC
AATCTACTCG ACTTCATCCC GGGGACCGGC GTCGCGCTGT GGATCGACGG TCAGTTCACC
GGCCTCGGCC GCACCCCGGA CGCCGCCCAG ACCGAAGCGC TGATCGGCTG GTTGACGGCC
AACTCGAACG ACGGCGTGTT CCACACCGAC GGCCTGCCGC TGATCTATCC GCCGGCGAAG
GCCTATGCCG ATTGCGCCAG CGGCCTGATG GCGCTGTCGC TGTCGAAATC GCCGCGCGAC
TACGTGCTGT GGTTCCGGCC CGAAGTGGTG CGCACCGTCA CCTGGGCCGG CAATCCGAAC
AAGCCGGTCG GCGTCGGCCC CGGCGGCGAA TTTCTGACGC CGCGCCGCAG CTTCGCCGCG
TGGCAGGAAT CGGTGCGACT CCATGCCGAG CCGTGGCGCG CCTCCGAGAT CGAGGCCGCG
CACCGACTGC GGCTGTCGCT GCTCGAAGTG GTGCTGCGGC GGATCGACGG CATCGCCCGC
GAACGCCGAT CCGCGCGGCT GCTGCAGGAA CAATTGATGC GGCAGGTCGA ACTCGGGCTG
CGCCGGTCGC ACGACGTCGC CCAGACGCTC CAGGAAGAGA CCCGGCGGCG AGTTTCGGTC
GAGGCCGACC TGTCGCAGGT GCTGCGCCGC ACGGTCGAGG ATCAGGAAGC CGAGCGGCTG
CGGATCGCGC GCGAGCTGCA CGATACGCTC GGCCAGTCGC TGACGTTGCT GCAACTCGGC
TTCGAAAATC TCGGGCAGGT CGCGCCGGAC AACGGCGAAT TGCAGAACCG CATCGCCGGC
ATGAGGAGCC TCACCGCCGA TATCGGCCAG CAGGTCAACC GGCTGGCGTG GGAGATCCGG
CCCACCGCGC TCGACGATCT CGGCATCCAG ACCGCGATCC AGCATCTGCT CGACGCCTGG
AGCGAGAAGG CGCAGGTGCA GTTCGACCTG CACATGACGC TCGGCGACCG CCGACTTCCG
CCGGCGATCG AGACCACCCT GTATCGCGTG CTGCAGGAAG CGCTGACCAA CATCGTCCGC
CACGCCGCCG CGAGCCATGT CAGCGTCATC CTCCGATTGT CGGACCAGCA GGTGACGATG
GTGGTCGAGG ACGACGGCCG CGGCTTCGTC AATCCCGATG CCGGTCTCCC GCCGGAGCGG
CTCGGCCTGC TCGGCATTCG CGAGCGGCTG ACGCTGGTCC GCGGCTCGCT CGAAATCGAA
TCCGCGCCCG GCAAGGGCAC CGCTCTATAC GCACGAATTC CGTTGTAA
 
Protein sequence
MNSGIDNSAE LIVAEFEAFG QSGCEREAIH LPGSIQPHGA LLALAPDDLR IVHASGDTTL 
LPGAPAGPLP GRHATEILSS DQIARLRALL GPDRRIERPL HAFALTAPDA TPIDVVVHQS
SGLLVLEFDP RREPAPENSL ALVQSMIRHV QRAGTVQAFC EAIVAELRAV TGFERVMIYR
FSADDSGEVI AESCAAGIES FLGLRYPESD IPKQARALYL TNWIRAIPDA RYAPAPITPA
VNPRTGLRLD LSQSVIRSVS PAHRVYLAHM GVVASMSLSM IQHGRLWGLI ACHHSAPRYL
PYRMREACEL FAEMASSQLE AKVAAEQLEA RLRSTRIHEE LVTRMSQESD LAEGLIRFHP
NLLDFIPGTG VALWIDGQFT GLGRTPDAAQ TEALIGWLTA NSNDGVFHTD GLPLIYPPAK
AYADCASGLM ALSLSKSPRD YVLWFRPEVV RTVTWAGNPN KPVGVGPGGE FLTPRRSFAA
WQESVRLHAE PWRASEIEAA HRLRLSLLEV VLRRIDGIAR ERRSARLLQE QLMRQVELGL
RRSHDVAQTL QEETRRRVSV EADLSQVLRR TVEDQEAERL RIARELHDTL GQSLTLLQLG
FENLGQVAPD NGELQNRIAG MRSLTADIGQ QVNRLAWEIR PTALDDLGIQ TAIQHLLDAW
SEKAQVQFDL HMTLGDRRLP PAIETTLYRV LQEALTNIVR HAAASHVSVI LRLSDQQVTM
VVEDDGRGFV NPDAGLPPER LGLLGIRERL TLVRGSLEIE SAPGKGTALY ARIPL