Gene RPB_0413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0413 
Symbol 
ID3908851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp456237 
End bp458186 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content70% 
IMG OID637882299 
Productperiplasmic sensor hybrid histidine kinase 
Protein accessionYP_484035 
Protein GI86747539 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.486331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGGT GGTTGCGATT CAGATCGAGA CGGCGCGCGT TCGCCGGACG GTTTCCGCGG 
CTTGTGTTCG CGCTGCGCTG GAGCATGATC TTTCTGGCGG CGTTCGGTGG CGCCTATGGG
TTCATCATCG GCAGCCGCGC CGAATCCTCC GGCTACAACC CGCATACTTT CGCGATCGGT
GCGAGCTTCC TGTTCGCGCT CGCCTGCCTC GGCCTCGCCG TCCAGAGCAT ACGACTGCGC
TGGTTGCGCC GGCGCCTACA GGCGTTGGAG CGGCACAACG AGACGCTGGC GGACCGCAAC
TGGGAGCTGA GGGATGCCGA GGAACGCGCG CGCGTCCTCC ACCAAGCTCG CGACGAAGCG
GACGCGCCAC GCCGCGACGA AGACGCAACG CGGCGGAAAT CGCGGCTGCT GGCGATGGCG
TCACACGAGA TCCGCACGCC GCTGAACGGC ATCATCGGCA TGAGCGGCCT GCTGCTCGAC
ACCGCGCTGA CGCCCGAGCA GGCGACCTAC GCCCGCGCGG TGAAGACTTC GGGCGACGCG
CTGGCGACGC TGATCGACGA GCTGCTCGAC TATTCCCGGA TCGAGGCCGG CAGGTTCGAA
CTCGACAGCC GGCCGTTCGC GCTGACCGCC CTGATCGAGG ACATCGTCGA GCTGCTGGCG
CCGCGGGCGC AGGCGGGCGC GCTGGAGATC GCGGCGGATA TCGATGACCG GGTACCGTCG
CGGGTGATCG GCGACGCGGC GCGCTTGCGC CAGGTGCTGC TCAATCTCGC CGGCAACGCC
ATCAAGTTCA CGTCCGTCGG CGGCGTCGTG CTGATCGTCC AGCCGGGCGA GCGCGACGGC
GAGATCAGCT TCGCGGTGCG CGACACCGGC ATCGGCATCG CCCGTGAGGC GCAGACGCGG
ATCTTCGGCG AATTCGAACA GGCCGACGAC GGCATCGCCC GCAGCTTCGG TGGCACCGGG
CTGGGGCTGA GTATCAGCCA GCGGATCGTC GAGCGGATGG GCGGTCGGAT CGCACTCGAC
AGCCAGCCGG GACGAGGCTC CACCTTCACG GCCTCGATCC CGCTCACAGC GGCGGACGAT
TCCGGCGCCT CCCCCATTGT CCAGCCGCCC GACCTCACCG GTCAGTCGAT CATGATCGCG
GCGCCGCAGA CCATCGAGGC GACGCTGGTC GCCCAGCGGC TGCAGCGCTG GGGCGCGCAC
GTGCTGGCGT TGTCCGAACT CTCCGCCGCG CGCACGGCGC TGCCCGAACG CGCCTGGTCG
GCCATCCTGA TCGATGCCGG CTTCGGCGCC GATGCGGCCG AGGCGCTGGC GCGGAGTGCC
CGACCGCATG CGGCGCAACG CGTCGTCATG CTGACGCCTG CCGCTCGCTC CGAACTATTG
CCGGATCTGC CACCTGCATT CACCGGCTAT CTCGTCAAGC CGCTGCGGGC GGCCTCGCTG
TCGGCGCGGC TGGCAGGGTC GTTGATCGAC ACCGTCGCGC CGGGGATCGC GGACGGCGAT
CCGCCGGAGA CCGCGGCAAC GGATGCGACG GCGCCAGGCG GCCGGCGGCT CTCCGTGCTG
GTCGCAGAAG ACAACGAGAT CAACGCGCTG CTGATCCGCT CGCTGCTGGC TCGGCTCGGT
CATCGCGTGG CTGTCGCCGG CGATGGCGAG CAGGCGCTGC AAAGCTGGCG CGCGGCCGCG
GCCGACGGCG CGCCCTTCGA TCTGGTGCTG ATGGACGTGC AGATGCCGAC CAGCGACGGC
ATCGCCGCCA GCCGGCAGAT CCGCGCGCAG GAGGCCGTCG GCGACGGACG CCGCACGCCG
ATTCTGGCGC TAACGGCGAA TGCGCTCGCA GAAGATCGTG AGGCCTGCTT CGCGGCGGGT
ATGGACGGCT TTCTGGTCAA GCCGCTCGAT CGCGACAAGC TGATGACGGC GCTCGACCGG
GTCGCGGCGG CGAGGCCGAT GGTCGCGTAG
 
Protein sequence
MKRWLRFRSR RRAFAGRFPR LVFALRWSMI FLAAFGGAYG FIIGSRAESS GYNPHTFAIG 
ASFLFALACL GLAVQSIRLR WLRRRLQALE RHNETLADRN WELRDAEERA RVLHQARDEA
DAPRRDEDAT RRKSRLLAMA SHEIRTPLNG IIGMSGLLLD TALTPEQATY ARAVKTSGDA
LATLIDELLD YSRIEAGRFE LDSRPFALTA LIEDIVELLA PRAQAGALEI AADIDDRVPS
RVIGDAARLR QVLLNLAGNA IKFTSVGGVV LIVQPGERDG EISFAVRDTG IGIAREAQTR
IFGEFEQADD GIARSFGGTG LGLSISQRIV ERMGGRIALD SQPGRGSTFT ASIPLTAADD
SGASPIVQPP DLTGQSIMIA APQTIEATLV AQRLQRWGAH VLALSELSAA RTALPERAWS
AILIDAGFGA DAAEALARSA RPHAAQRVVM LTPAARSELL PDLPPAFTGY LVKPLRAASL
SARLAGSLID TVAPGIADGD PPETAATDAT APGGRRLSVL VAEDNEINAL LIRSLLARLG
HRVAVAGDGE QALQSWRAAA ADGAPFDLVL MDVQMPTSDG IAASRQIRAQ EAVGDGRRTP
ILALTANALA EDREACFAAG MDGFLVKPLD RDKLMTALDR VAAARPMVA