Gene RPC_3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3026 
Symbol 
ID3973479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3325227 
End bp3326420 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content67% 
IMG OID637926137 
Producthistidine kinase 
Protein accessionYP_532890 
Protein GI90424520 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.38166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTTG CGGCGGTGAG CATCTCGACT GCGGCGCTGG TCTATCTGTT CAAGCCGCCG 
AACGAACGGC AGGCGATCGG TGCCCTTGCC GACCAGGTCG AACTGCTCGA TCGGGCCGCG
CGCGTCGCGC CGAGCCTGGT GAGCGTCCGG CAAGAGCCCG GCGGCGGCCA GGTCGAGGCC
AACCTCACCG CCTGGCTGCG CCAGGCCGTC GCCGAACGCG GCATGCCGCG CGACGTCGTG
GTGTCCCGGA ACGACCATCT GCCGCTGCTG TCAGTTCCGG TCGATCGCGG CTGGATTGTC
ACCGAGATCG CCGACCTACC CCCGGAAGGC GGCATCCTCA ACAGGCTGTT GAAGTGGCTC
GCCTTCGTCA CCTTCGGGGT GTTGCCGGTG GCACTGTTTT TCGCCAATCG CATGGTCAAG
CCGCTGGTGG TGATCGAGCG GGCGATCGCG AACGTCGGTC CAGACGGCCT GCTGCCCGAA
CTCCCGATCG AAGGACCAGC CGAGGTGCGT CTCGCGGCCA AGGCGCTCAA CTCGCTGTCC
TCGCGCCTGA AGGTGGCGAT GGACAGCCGG ATGCGCGTGG TCGCCGCCGC CGGCCATGAT
CTGCGAACCC CGATCACCCG CATGCGCCTG CGCGCAGAGT TCGTCGAGGA CGACGATGAT
CGCACGATGT GGCTGAAAGA TATCGATGAG CTCGATCGCA TTGCCGACAG CGCGATCCAG
CTGGTGCGCG AGGAAGCGGG GAAGACGACG CCCGAATCCC TCGGGCTCGA CCTCCTGATC
GCCGAAACGG TCGAGGAGTT GCGGGCGCTG TCTTACGACG TCAACCTCAC CCGCACCGCG
GCGGGCTGCG TGCTCGCCGA CCGGACCAGC CTGAAGCGCG GGTTTCGCAA TCTGATCATC
AACGCCGCCA CCCACGGCAA GCGCGCGCGG GTCGCGATCG AGAGCGCGCC GTCGGAAGTG
AGCGTCATCA TCGAAGACGA CGGCCCGGGG ATTCCCTCCG ACATGCTCGG GCGGGTGTTC
GAACCGTTCT TCAGCGCCAA TCGCGCGCGC ACCAAGCATT TCGGCGGCGC CGGGCTCGGC
CTCACCATCG CCCATGAAAT CGTGCAGCGC GCCGGCGGGA CCATCAAAAT CGAAAACGGT
CGCCTGCGCG GGTTGATGCA GACCGTGCGG CTGCCGGCTC ATCGATCGTC GTGA
 
Protein sequence
MIVAAVSIST AALVYLFKPP NERQAIGALA DQVELLDRAA RVAPSLVSVR QEPGGGQVEA 
NLTAWLRQAV AERGMPRDVV VSRNDHLPLL SVPVDRGWIV TEIADLPPEG GILNRLLKWL
AFVTFGVLPV ALFFANRMVK PLVVIERAIA NVGPDGLLPE LPIEGPAEVR LAAKALNSLS
SRLKVAMDSR MRVVAAAGHD LRTPITRMRL RAEFVEDDDD RTMWLKDIDE LDRIADSAIQ
LVREEAGKTT PESLGLDLLI AETVEELRAL SYDVNLTRTA AGCVLADRTS LKRGFRNLII
NAATHGKRAR VAIESAPSEV SVIIEDDGPG IPSDMLGRVF EPFFSANRAR TKHFGGAGLG
LTIAHEIVQR AGGTIKIENG RLRGLMQTVR LPAHRSS