Gene RPB_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1960 
Symbol 
ID3908040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2227354 
End bp2228682 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content68% 
IMG OID637883854 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_485579 
Protein GI86749083 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.463219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAGC GGCTCGGCAG GTTTCGATTC AGCGGCATCC GCAGCCAGAT CGCGGTGCTG 
GTGTTGGCGT CGCTCATCGG GACGCAGTTG CTGATCATGG CGACTTTCCT GCTGCGCGGG
CCGGACCGCT TCGGTCCGCC CGAACATCGG CGCGAGCAGT TCGAAATCGC GGTCCTTCTG
ATCGCGGCGA CTCCCGAGGA GCAGAGGCCG CAGCTCGTCG AACAAATCAC CAGGACATTC
CCTCACCTCG AACTCCGCCT GCTCGACGCC GCCGCGGTGC CGCCGCGGTC CGACCGCGAA
CTGCCGGAAA TTCGCGACGT CGCGCGCACG CTCGGACCAT TGGCGAGGGT GTTCGCGCTG
CCCGGCGCGG AGCCGCCGCA AGTCGGCATT GCGTTGCCGG ACGGGACCGC GATCGCCGCC
GCGATTCCGG AGATGCGAGG CCGGCCTCCG ATCTCGAACG GGCCGTGGAT GAGTGCGTTC
GCCGGCGTGA TCATCAGCCT CGCGATGTTC GGCCTGTGGG CCGACCGGGC GCTGTCGACG
CCGCTGTCGG AATTCGCTGC CGCGGCGGAA AACTTCAAGC TGGACGGCAC CGACGAGCCG
CTGGCCGAGA GCGGGCCGGA CGAAATCCGC TCGCTGGCGC GGGCCATGAA CCGGTCGCGC
AACCGGATCA CGGCGCTGAT CGACGACCGC ACCCGGACAC TGGCTGCGAT CGGCCACGAC
CTGCGCACCC CGATCACGCG GCTGCGGCTG CGCAGCGAAT TCATCGAGGA CGCCACCCAG
CGCGACAACA TGCTGCGCGA TCTCGACCAG ATGCGCTCGA TGCTCGACGC GGTGCTGTCG
TTCCTGCGCA CCGGCCGCGC GCTGGAACCG ATGACGCGGA TCGATCTGGC GAGCACGCTG
CAACTGATCA CCGATCAGTT CACCGATCTG GGCCACAAGG TGACTTATCT GGGGCCTGAA
CATGCCGAAT TGTTGGCGCG GCCGGACGAT ATCCGACGCG CCGTCACCAA TCTGGTCGAC
AACGCAGTTC GCTATGGAAA GGACATCCTG GTCCGGCTGG AAGCGTCGCC CGGGCGCGTC
ACGATCGAGG TCGAAGACGA CGGCCCCGGC ATTCCCGAAG CCTGCAAGGC CGACGTGATC
GAGCCGTTCG TGCGCGGCGA CGATGCGCGC AACATGGACG AGACCTCCGG CTTCGGCCTC
GGCCTCTCGA TCGCCCGGAC CATCGTGCAG AATCACGGCG GCGAACTGAC GCTGCGCGAC
CGCAAGCCGC ACGGCCTGAT CGTGCGGCTC GACCTCCCCG GACAGCAGCA GGACAGCGGC
GCCGCGTGA
 
Protein sequence
MLERLGRFRF SGIRSQIAVL VLASLIGTQL LIMATFLLRG PDRFGPPEHR REQFEIAVLL 
IAATPEEQRP QLVEQITRTF PHLELRLLDA AAVPPRSDRE LPEIRDVART LGPLARVFAL
PGAEPPQVGI ALPDGTAIAA AIPEMRGRPP ISNGPWMSAF AGVIISLAMF GLWADRALST
PLSEFAAAAE NFKLDGTDEP LAESGPDEIR SLARAMNRSR NRITALIDDR TRTLAAIGHD
LRTPITRLRL RSEFIEDATQ RDNMLRDLDQ MRSMLDAVLS FLRTGRALEP MTRIDLASTL
QLITDQFTDL GHKVTYLGPE HAELLARPDD IRRAVTNLVD NAVRYGKDIL VRLEASPGRV
TIEVEDDGPG IPEACKADVI EPFVRGDDAR NMDETSGFGL GLSIARTIVQ NHGGELTLRD
RKPHGLIVRL DLPGQQQDSG AA