Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2356 |
Symbol | |
ID | 3909354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2707866 |
End bp | 2710568 |
Gene Length | 2703 bp |
Protein Length | 900 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637884253 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_485972 |
Protein GI | 86749476 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0113546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.244783 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAACG CCATTGCAGC GTTCATCCAG AGCGTGAGGT CGCGCACGCT TTCGATCGGA CAGATGACGT TCGGCAGCTT CCTGTTGCTG CTGACCATCA TCGCCATGAC CAGCATCGGC AGCGTGGTCG CGATCCGTCA TATCGACGCC ACCTTCGCCG AATTGCAGCG GCTGCAAAGT GTCGGCGATC TCGCCGAGGA GATCGAGCGC CGGATGAGCG ATCTCCGCTT TGCGGCGCGC CAGGTCGTCA GCGAACCGAG CGCGCAGCCG GCCGGGCGGG TGTTCGAGGC CGCCTCGACC CTGACCGCGC TATTGAAGAA GACGCGGCTC GAGCTCGATG CCGATCAGCG CGAGATGATC GACGGCGTCA CCGATCGGCT CACCAATTAT CGCGACGGGC TGGAGCGGAT CACCGCGCTG ATGCCGCGGC GGGCGGACAT GGTCGCCACC GTCCCGCCGC TGCGCGAGGC GTTCGAGGCG GTGGTGGTCG GGCTGCAGGA CCGTTCGCTG GCGGCGGCGC TGCACGGCGA GCACAGCAAG ATCGCCGCTG CGCTGCTGTC GCGCGATCTG ATCGGCGCGG GGGAGGCGGC GCAGCGGATG CGCGCGATCC CGATCGGAGA TGTCGCCGCC GCGACCGCGG TGCGCGACTA TGCCGACGTG ATCATCTCGA CCACGGCGGT CGAGCACGAG ATCGCATCGC TCGATCGCAA CGTGCTCGGT GCGGAAGGAC GGTCGATCGC GCGTGTCACC GAGCTGCTGC GTGATGTCGC CGCGCGGCGG GGCAGGGTGC TGTCGCGCGA TCTCGCCCGG ACGCTCACCG AGGACAAGTG GCAGAGCATC GTGCTCGGCT CCGCCGGCGT GCTCGTCGGC CTGTTGGCGG CGGCGTTCGT CGTGCGCCGC ACCGTCGGTC CGCTCACCGC GATCACCAAG GCGATCCGCC AGCTCGCGGC CGGCCAGCAA TACACCGCGA TTCCGGCGAT CGAACTCAAG AACGAGATCG GCGACATCGC CCGCGCGGCG GAAGTGTTCC GTCGCACGCT GGTCGAGGCC GACAGCGCGC GCGAGGCAGC GGTGCGGGCG CTCGCCGAAC AGCGCCTCGC CGAGGAGAGC TACCGCAAGC TGTTCGAAGG CTCGATCGAC GGCATCTATG TGACGACGCC GGACGGCGCG CTGCTGAACG CCAATCCGGC GCTGGCGCGG ATGATGGGCT ACGACGATCC GGCCGACCTG ATGCGCGCCA CCGGCGACGT CTCCAACTCG ATCTATCTCG ATCCGCGCAA GCGCGACGCC TATCGCGCGC TGATGCAGCG CGACGGCATG GTGCGCGAGT TCGAGTATCA GGCGCTCAAG CGCAACGGCG ACGTGCTGTG GCTGTCGGAC AGCGCCAGCG CGGTGCGCGA CGAGACCGGC GCGGTGATCC GCTACGAGGG CGCCGTCCGC GACATCACCG ACCAGAAGCG TGCGGAGACC GCGGTCGCCG AGGGGCGCCG CCTGCTGCAA CAGGTGATCG ACACCGTTCC GGCGGTGATC AACGTCAAGG ACACCGAGCT TCGCTATCTG TTGATGAACC GCTACATGGC GACGATCTTC GGCATCGATC CGAACGACGC GATCGGCCGC ACCACCGCCG ACCTGATGTC GCGCTACGGC TCGCACAAGA CCGACGCCAA CGACAAGAAG GTGCTGGCGA CCGGCGAGGG ACTCGGCTTC TACGAAGAAG AATATCAGGA CGCCACCGGC GCGATGCGGC AATGGCTGGT CAACAAGATG CCGCTGAAGG ATTCGCAAGG GAATATCGAG CGGATCGTCA CGGTCGCGCT CGATATCGGC GAGCGCAAGC GCGGCGAACT GGAAATGCGC AAGGCCAAGG ATGCCGCCGA GGCGGCGCTG CGCAATCTGC GCGAGACCCA GCAATCGCTG ATCGAGGCGG AAAAGCTCGC CGCCCTCGGA CGTCTGGTCG CGGGCGTCGC CCACGAGGTC AACAATCCGG TCGGGATCAG CCTCACGGTC GCCTCCGCGC TGGAGCGCAA GACCTCGGTA TTCGCCGCCG AAGTCGAGCG CGGCGATCTC AAACGCTCGC GGCTCAACGA ATTCCTCGCC ACCAGCCGCG ATGCGGCCTC GCAGCTCGTC GCCAATCTCA ACCGCGCCGC CGAGCTGATC CAGTCGTTCA AGCAGGTCGC GGCCGACCGC AACTATTCGG ATCAGCGGCC GTTCGATCTC GGCGACCTGA CCGAGCAGGT GGTGATGAGT CTGCGGCCGG GCCTGCGTAA GCACAATCTG ACGCTGGATG TCGATTGCCA GCCCGGTCTG ACGATGAATT CCTATCCCGG CCCCTACGGT CAGGTACTGA CCAACCTGTT TCTAAACTCG GTGGCGCACG CATTTCCGAA TGGCCGCCCC GGCACGGTGG AGATCAAGGT CCGTGCCTCC GGCAAGGACA ATGTCGAGGT GATCTACGCC GACGACGGCT GCGGCATGAG TCTGGATGTG CGCAGGCGGG CGTTCGACCC GTTCTTCACC ACGCGGCGGG ATCAGGGTGG CACCGGGCTC GGCCTGCATA TCGTCTACAG CATCGTCACC AACCGCCTCG GCGGCCGGCT CGATCTCGAT TCCGCGCCGG GCAACGGCAC GCGCATCCAG ATGATCCTGC CGCGCGTCGC GCCGCGCGAC GTGGTGGACG AGCCGATGAC CGCGCAGGCC TGA
|
Protein sequence | MPNAIAAFIQ SVRSRTLSIG QMTFGSFLLL LTIIAMTSIG SVVAIRHIDA TFAELQRLQS VGDLAEEIER RMSDLRFAAR QVVSEPSAQP AGRVFEAAST LTALLKKTRL ELDADQREMI DGVTDRLTNY RDGLERITAL MPRRADMVAT VPPLREAFEA VVVGLQDRSL AAALHGEHSK IAAALLSRDL IGAGEAAQRM RAIPIGDVAA ATAVRDYADV IISTTAVEHE IASLDRNVLG AEGRSIARVT ELLRDVAARR GRVLSRDLAR TLTEDKWQSI VLGSAGVLVG LLAAAFVVRR TVGPLTAITK AIRQLAAGQQ YTAIPAIELK NEIGDIARAA EVFRRTLVEA DSAREAAVRA LAEQRLAEES YRKLFEGSID GIYVTTPDGA LLNANPALAR MMGYDDPADL MRATGDVSNS IYLDPRKRDA YRALMQRDGM VREFEYQALK RNGDVLWLSD SASAVRDETG AVIRYEGAVR DITDQKRAET AVAEGRRLLQ QVIDTVPAVI NVKDTELRYL LMNRYMATIF GIDPNDAIGR TTADLMSRYG SHKTDANDKK VLATGEGLGF YEEEYQDATG AMRQWLVNKM PLKDSQGNIE RIVTVALDIG ERKRGELEMR KAKDAAEAAL RNLRETQQSL IEAEKLAALG RLVAGVAHEV NNPVGISLTV ASALERKTSV FAAEVERGDL KRSRLNEFLA TSRDAASQLV ANLNRAAELI QSFKQVAADR NYSDQRPFDL GDLTEQVVMS LRPGLRKHNL TLDVDCQPGL TMNSYPGPYG QVLTNLFLNS VAHAFPNGRP GTVEIKVRAS GKDNVEVIYA DDGCGMSLDV RRRAFDPFFT TRRDQGGTGL GLHIVYSIVT NRLGGRLDLD SAPGNGTRIQ MILPRVAPRD VVDEPMTAQA
|
| |