Gene Rsph17029_2164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2164 
Symbol 
ID4895385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2291247 
End bp2294072 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content72% 
IMG OID640112758 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001044039 
Protein GI126462925 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0320639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCCG CCTGGGCCGC GCGTCTCGGC GCGTCGCGCG ATCAGCAGCG CGCGCTTGAG 
CAAGAGCGGA GCCGGACGCA GGCCGCACTT CAGGTCATGC CGCACATGTT GATCGAACTG
GACGCCGACG AACGGCTCCT GCGGGGCCAT CCGGGCTGGC TCGACCGCCT GCCCACGCTC
GAGCCCATGG CTCCGGGACA AGGGCTGCAG GACTTTCTGC CGGCCGACCT CGCGACGGAG
GTGCGCCGTG TGCTGGCCGA AGTCGAGACG ACCGGCGCCC CCGTCAGCCG CGACTGGCAG
GTGCGGCAGG CGGGTGCCCG GGTGTGGCTT GCGATCTCTG CCGCGCGCCT GCCCCCGCTC
GCGCCGGACA GGCGGCCCGG CCATGCGGTC ATGATCTGCG ACACGACCGA GCTGCACGCC
CAGCGCAAGC AGGCCGACAG GCTGGGCAAG TTCGCCCAGC TCACCACCAA CCTCGTGATC
GTCACCGATC CCCACCAGCG CATCGAATGG GTGAATGCAG CCTTCGAACA GCAGACCGGC
CATCCGGGCC AGACGGTGCG CGGCCAGCCG CTCTGCGCGG TGCTGCAGGG CGAGCCCGCG
CCGGGCGGAT GCGGCGATCG CCTGCGCGCC ATGCTGGCCG AGGGTCGGAC GGCACGGGCC
GAGGTCGAGG CCACCGCCGC CTGCGGGCGC CGCTTCTGGC TCGATGTCTC GCTCCAGCCG
CTTATCGACG ACGAGGGTGC GCTCGATGGC TACATGGCCG TGGCCAGCGA CATCACCGCC
CACAAGCGGC AGGAGGAGCA GCTGGCCGCC ATGGCGGCCG AAGCGCGCGC CACCCGGGCG
CGGCTCGCGG CCGCCGTCGA CGTCTTGCCG GACGGGTTCG CCTATTTCGA CGCCGAGGAC
CGGCTCGTCG TCTTCAATCC TCAGTACCGC GACTGCTACC CCGGCGCCGC GGCGGCCATC
GTGCCGGGCG CGAGTTTCGA ACTGATCCTC CGTCAGGCGG TGCAGTCCGG CGACATCCTC
GAGGCGCGCG GCCGCGAGGA AGAATGGCTG GCCGCGCGTC TCGCCGCCCA CTACCGCGGC
AGCAACCAGC AGGAGCAGCA GCTGGCCGAC GGGCGCTGGC TGCGCGTGAT CGAACGGCAG
ACCCCTGACG GCGGCCGCGT GGGTCTCAGG GTCGATGTGA CGGCGCTGAA ACTGGCCGAA
CAGCGGACCC GCATCGACTT CTCGGCCACG ATGGACGCGT CGCAGGACGG AATTGCCTTC
ACCGATCCCG AAGGGCGCTA CATCTACATG AACCCGGCCC ATCGCGAGAT GTTCGGCATC
GCCACCGAGG ACGAGATCCT CGGCCGGTCC TGGCGCGAGC TCTATGCCGG CGACGTGGCC
GACCATATCG CCACCACGGC CCTCCCCGCG CTGCTGAGCA CCGGCGGATG GCGCGGCGAG
CTGATCGGCC GACGCCGCGA CGGCAGCGAG CTGCCACAGG AAGTGTCCCT TACCCTCAAG
GCGGACGGCG GGATCATCTG CATCTCGCGC GACATCTCCA GGCGGCTGCG CGAGCAGCAG
GAACGCATGC GCCTGCGCGA GGAGCTGCAG ATGGCGCAGC GGCGCGAGGT CATCGGGCAG
CTGGCCTCCG GCCTCGCGCA CGACCTGAAC AACCTGCTCG CGGCCATCGG CGGCTCGGCG
CTGCTGATCC AGGATCTGCA GTCCGGCGCG GCCGAGGTCC ATGCGCAGCG CATTCTGGCC
GCGACCGAGC AGGCCGGCGC GCTCGTCCGC CGCTTTCTCA CCCTCGGCAA GCGGCAGAGC
AACCGCTCGC GCATCGACCT GTGCCCCCTC CTGCAGGAGG CGGCCGATCT GGTGCAAGCG
GGCCTGCGCA ACCGCACACG GCTCACCCTC GCGCTGCCCG ATGCGCCGAT CTGGATCGAG
GCGGATCCGA CCGACATCCT GCAGGTGGTG CTGAACCTCG TCATCAATGC GCGCGATGCG
ATCTCGACCG CGCCCCCGCG CGAGGGCGGG CACGAGATCA CCGTGGCGCT CGCGCCGGCC
GGACCCGCTC AGCTTGCGCA GACCTACGCG GTGGGCGCGC CCTGCGCCGA CCGGCGCTAT
GTCTCGATGA CGGTCAGCGA CAGCGGCCCC GGGATCGAGC CTGCGATGCA GGCCAGGGTG
TTCCAGCCCT ATTTCTCGAC CAAGGGCGCC GCGGGCACCG GGCTCGGGCT CGCCATCGTG
ACGGGCGTGC TTACCGCCAA CCACGGCGCC ATCGCGCTCC AGAGCGCGCC CGGAGAAGGC
ACGCGCTTCA CCGTGCTCTG GCCGGTCGAG CCGCCGGACG CGGCGGCCCC TCCCGACGAG
CCCGCGCCGA ATTTCGCCCT GCCGGCCACC GCAGAGCCTG CCGCCTCCGC TCTGCCAGCC
GCCGCAGAGC CCGCCCCGGG GCCGGAGATG CGGGCTGCCG GTCGCCTCGC CGGGCGCACC
ATCCTCGTGG CGGACGACAA TGCCGATGTG CTGCGGGTGC TGGTGGCCTT TCTCGAGGGG
GCGGGCGCCG AGGTCGTGCC CTGTTCGGAT CCGCGCGACG CGCTGGCGGC GCTGCAGGAC
GACCCGCGGA TCTGGGACCT GCTGGTGACA GACTACGACA TGCCGCAGAT TTCGGGGGCC
GACCTCTGCC GCGCCGCCAA CGACCTTGCA CCGGATCTTC CGGTGCTGCT CATCACTGCC
CTGCCCGACT GGCGGAGCCG GATCAGCGCT CCGGCGCCCC GCTTCACTGC CGAGCTGGGC
AAGCCTCTCA GCCGCGCGAC GCTGCTCGAC GCGGCCGAGC GCGCGATCGG CGCGAAGGCG
GGCTAG
 
Protein sequence
MMPAWAARLG ASRDQQRALE QERSRTQAAL QVMPHMLIEL DADERLLRGH PGWLDRLPTL 
EPMAPGQGLQ DFLPADLATE VRRVLAEVET TGAPVSRDWQ VRQAGARVWL AISAARLPPL
APDRRPGHAV MICDTTELHA QRKQADRLGK FAQLTTNLVI VTDPHQRIEW VNAAFEQQTG
HPGQTVRGQP LCAVLQGEPA PGGCGDRLRA MLAEGRTARA EVEATAACGR RFWLDVSLQP
LIDDEGALDG YMAVASDITA HKRQEEQLAA MAAEARATRA RLAAAVDVLP DGFAYFDAED
RLVVFNPQYR DCYPGAAAAI VPGASFELIL RQAVQSGDIL EARGREEEWL AARLAAHYRG
SNQQEQQLAD GRWLRVIERQ TPDGGRVGLR VDVTALKLAE QRTRIDFSAT MDASQDGIAF
TDPEGRYIYM NPAHREMFGI ATEDEILGRS WRELYAGDVA DHIATTALPA LLSTGGWRGE
LIGRRRDGSE LPQEVSLTLK ADGGIICISR DISRRLREQQ ERMRLREELQ MAQRREVIGQ
LASGLAHDLN NLLAAIGGSA LLIQDLQSGA AEVHAQRILA ATEQAGALVR RFLTLGKRQS
NRSRIDLCPL LQEAADLVQA GLRNRTRLTL ALPDAPIWIE ADPTDILQVV LNLVINARDA
ISTAPPREGG HEITVALAPA GPAQLAQTYA VGAPCADRRY VSMTVSDSGP GIEPAMQARV
FQPYFSTKGA AGTGLGLAIV TGVLTANHGA IALQSAPGEG TRFTVLWPVE PPDAAAPPDE
PAPNFALPAT AEPAASALPA AAEPAPGPEM RAAGRLAGRT ILVADDNADV LRVLVAFLEG
AGAEVVPCSD PRDALAALQD DPRIWDLLVT DYDMPQISGA DLCRAANDLA PDLPVLLITA
LPDWRSRISA PAPRFTAELG KPLSRATLLD AAERAIGAKA G