Gene RPB_3152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3152 
Symbol 
ID3910953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3605826 
End bp3607247 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content68% 
IMG OID637885054 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_486759 
Protein GI86750263 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.285761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCA AACGATTCTC AAATTCAACT CTCGCGCCAT GGCTGCTCGG CGCCACCCTG 
CTGCTGTTCA TCGTCATCGC CGGTGCGTTG ATCCTGAATC TGATGCGGCT GCGCGACAGC
TTCTCCTGGG TGCAGCACAC CAACGACGCC CTGCTGGCGA TTTCAGGCAT CCAGCGCGAA
GTGCTGGAAG CCGAGACCAA CGAGCGCGGC TATCTGCTCA CCGGCATCGA CAGCTACCGC
GAGAATTACA ACCATACGCG GGACACACTG GCATCGCGGC TCGACAGCCT GCGCTCGATC
GTCACCGACA ACCCCGAACA TGTCGCCCGG GTCGACGATC TCCGCCAGTT GATCGACATG
CGGACGGCCC AGCTCGGGCG GGTCATCGAA CTGGGGCCGG AGCGTGTGCG CGAGGCCCTC
GACATTCTCG AGCAGGCCCG CACCGACCGG CTGACCGAAC GCGTCGAAGC CAGCCTGAGC
GCCCTCACCC GTGTCGAACA AGCCCTGCTG ACGGAGCGCA TGTCGCGGCA CGATCACGAG
AGCCTCGCGG CGGCGCTGAT CACCGCCTGC CTGCTGATCC TCGCCGTCGC CAGCGCCGCG
ATCGCCGCAT TTCTGCTCGA GCACCAGCGC GCCGCGGCGC GGCAGCAGGA GGCGGACCAG
CGGCTGCAGA GCCTGCAGGC CGAATTGATG CGCGTGGCGC GGCTCAGCAC CATGGGCGAG
ATGTCGAGCG CGCTGGCGCA CGAGCTCAAC CAGCCGCTCG GCGCGATCAC CAACTACGTG
CAGGGCTCGC GCCGGCTGGT CGAGGCCAGC AGCCATCCCG ACAAGGCGAA GATCGGCGCC
GCGCTCGACA AGGCCGCGCA GCAGACGCTG CGCGCCGGTG CGGTGATCCA GCGACTGCGC
GAATTCGTCG GCCGCGGCGA GACCGACAAG ACGGTCGAGA GCCTGCGCGC GATCGCCGAG
GACGCGCTGG CGCTCGCCTC CGTGGTCACG CGCGACCGCC CGGTCGACGT CGCGCTGACG
CTCGATCCCG CGGTCGATCG CGTGCTGGTC GACAAGGTGC AGGTGCAGCA GGTCTTCCTC
AACCTGTTCC GCAACGCCTT CGAGGCGATG CACGAACTCC CGGAACGGCT GCTCTCCATC
ACCAGCCGGG CCGTCGAGGA CGGCATGATC GAGGTCGTGG TCGCCGATTC CGGTCCGGGT
CTCGATCCGC AGATCGCCGA CCGGATGTTC CAGCCGTTCG AGACCACCAA AGCGGAAGGC
ATGGGGATCG GCCTGTCGAT CTCGCAGACC ATCATCCAGG CTCATGGCGG CTCGATCAAC
GCCGAGCCCG CCCCAGCCGG CGGCACGATG TTCCGTTTCA CTTTGCCCTG CGCCGATCCC
GGCGCACACG AGCCCCGCCC CGCCGCCGTC TCGGTGTTGT GA
 
Protein sequence
MTRKRFSNST LAPWLLGATL LLFIVIAGAL ILNLMRLRDS FSWVQHTNDA LLAISGIQRE 
VLEAETNERG YLLTGIDSYR ENYNHTRDTL ASRLDSLRSI VTDNPEHVAR VDDLRQLIDM
RTAQLGRVIE LGPERVREAL DILEQARTDR LTERVEASLS ALTRVEQALL TERMSRHDHE
SLAAALITAC LLILAVASAA IAAFLLEHQR AAARQQEADQ RLQSLQAELM RVARLSTMGE
MSSALAHELN QPLGAITNYV QGSRRLVEAS SHPDKAKIGA ALDKAAQQTL RAGAVIQRLR
EFVGRGETDK TVESLRAIAE DALALASVVT RDRPVDVALT LDPAVDRVLV DKVQVQQVFL
NLFRNAFEAM HELPERLLSI TSRAVEDGMI EVVVADSGPG LDPQIADRMF QPFETTKAEG
MGIGLSISQT IIQAHGGSIN AEPAPAGGTM FRFTLPCADP GAHEPRPAAV SVL