Gene RPB_2946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2946 
Symbol 
ID3910745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3357340 
End bp3358389 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content60% 
IMG OID637884852 
Productsignal transduction histidine kinase 
Protein accessionYP_486559 
Protein GI86750063 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.341682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.680161 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCAGA CCCCGACCGA ACATTTCGAG GCAGCGAGCA CACTTGCCGT TGTGGTGTCC 
TCGCACGAAC CGCTCCTTTT CCTCTCGGAG GAGCAGAGGG TCATTGCGGC GAGCGCGTCG
TTCTGCCGGG CCTTCGCGAT CGATCCTGCG ACGGTTTCGG GCAAATGCCT CAGCGAACTC
GGCAACGGTG AGTGGGCGAT GCCCAAGCTT GCTTCGCTGC TGAGGGCGAC AGCGTCGGGA
AGTGCACAGA TCGAAGCCTA TGAAATCGAC CTCGAGAGGC CGAACCAGAC AGCCCGGAAC
TTGGTCGTCA ATGCCCGGAC TCTCGATGAC GGAGACAGAG ATCATGTCCG GCTGCTCCTG
GCGGTTACCG ATGTGACCGA TGAGCGCGCC GCAGCGCGAC TCAAAGACGA TCTCATCCGT
GAAAAGGCGA TCCTTCTGCA AGAAGTGCAG CATCGGGTTG CGAACAGCCT CCAGATCATT
GCAAGCGTGC TTATGCAGAG CGCCCGTCGG GTCCAGTCCG AAGAAGCGCG CGGTCAGCTC
CGCAATGCCC ACAACCGGGT CATGTCGATC GCTGCTCTCC AACGCCAGCT CTCGACCGCG
AGTGGGGAAA CCGTAGAACT CCGGACCTAT TTTATCCAAC TCTCCCAGAG CCTCGGTGCG
TCGATGATCG ATGACCCGGA CCGCCTCTCG ATCCTTGTAA AATCGGACGA CACCACGGTG
AGCGCGGGAG TATCCGTCAG TCTCGGCCTT ATCGTCACCG AACTGGTGAT CAATGCCCTC
AAGCACGCAT TCCCGGATCA GCCCACGGGC CAAATCGTGA TCGACTATCA TTCGTCCGGC
AAGGACTGGA CACTTTCGGT TGCGGATAAC GGCGTCGGCA TGCCTTTGGG TGGCGATGCG
CCAAAGGCAG GTTTGGGCAC CGGAATCGTC GAGGCGCTCG CGAAGAACCT GTCAGGAACG
GTTGCCGCGA CGGACGCAAA CCCCGGCACG ATTGTGACGA TCAGCCATCG GGAGAATTCC
GATAGCCGAG ATGGCCTTCC ACAGGCGTGA
 
Protein sequence
MLQTPTEHFE AASTLAVVVS SHEPLLFLSE EQRVIAASAS FCRAFAIDPA TVSGKCLSEL 
GNGEWAMPKL ASLLRATASG SAQIEAYEID LERPNQTARN LVVNARTLDD GDRDHVRLLL
AVTDVTDERA AARLKDDLIR EKAILLQEVQ HRVANSLQII ASVLMQSARR VQSEEARGQL
RNAHNRVMSI AALQRQLSTA SGETVELRTY FIQLSQSLGA SMIDDPDRLS ILVKSDDTTV
SAGVSVSLGL IVTELVINAL KHAFPDQPTG QIVIDYHSSG KDWTLSVADN GVGMPLGGDA
PKAGLGTGIV EALAKNLSGT VAATDANPGT IVTISHRENS DSRDGLPQA