Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3109 |
Symbol | |
ID | 4023614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3455982 |
End bp | 3458687 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637963310 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_570236 |
Protein GI | 91977577 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.143082 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAACG TAGTCGCAGC GTTCATCCAG AGCGTGAGGT CGCGCACGCT CTCGATCGGG CAGATGACGT TCGGCAGCTT CCTGCTGCTT CTCACCATCA TTGCCATGAC CAGCATCGGC AGCGTGGTCG CGATCCGCCA TATCGACAGC ACTTTCGCCG AGCTGCAGCG CTTGCATAGC GTCGGCGATC TCGCCGAAGA GATCGAACGG CGGATGAGCG ATCTGCGTTC CACGGCGCGG CAGGTTGTCA GCGAGCCGAG CGGCCAGCCG GCCGGCAAGG TGTTCGAAGC CGCCTCGACA CTGACCGCGC TTCTGAAAAA GACCCGTCTG GAAATCGACT CCGATCAGCA GGAGATGATC GACGGCGTCG CCGCACGCTT GAGCAACTAT CGCGATGGAC TGGAGCGCAT CACTGCGCTG ATGCCGCGGC GGGCAGAGAT GGTCGCGAGC GTGCCGCCGA TCAGCGAGGC GTTCGAGGCC GCCGTCGACG GGCTGCAGGA CCGCACGCTC GCGACGACGC TGCACGCCGA ACACAACAAG ATCGCGGCCG CGCTGCTGTC GCGCGATCTC GGCGGCGCCG GCGCGGCGGC GCAGAGGATG CGCGCCATCC AGATCGGCGA TCCGGCCGCT GCGATCGCCG TGCGCGACTA CGCCGACGTG ATCATATCGA CCGCGGCGGT GGAGCAGGAG ATCGCCCAGC TCGACCGCGA TGTGCTGGGC GCCGAAGGCC GGCTGATCGC GCGGGTCACC GAATTGTTGC GCGACGTCAG CGCGCGGCGG GGCAGGGTGC TGTCGCGCGA TCTTGCCCGG ACGCTTGCGG AGGACAAGTG GCAGAGCATC GTTCTCGGCT CCGCCGGCGT GCTGGTCGGA TTGATGGCGG CGGCTTTCGT CGTGCGGCGC ACCGTCGGCC CGCTGGCTGC GATCACGGCG GCGATCCGTT CGCTTGCCGC CGGCCAGCAA TACACCGCGA TCCCGGCGAC CGACGTCAAG AACGAGATCG GCGATATCGC CCGCGCCGCC GAGGTGTTCC GCAGGACATT GGTCGAGGCC GACAGCGCGC GGGAAGCCGC GGTGAGGGCG CTCGCCGAGC AGCGCCTCGC CGAGGAGAGC TATCGCAAGC TGTTCGAAGG CTCGATCGAC GGCATCTACG TCACCACGCC CGAGGGCGCG TTGCTGAACG CCAATCCGGC GCTGGCGCGG ATGATGGGTT ACGAGAGCAC GGCGGAGCTG ATGCGGGAGA CCGGCGAGGT TTCGCAGAAG ATCTATGTCG ACCCGCGCAA ACGCGACGAA TATCGCGCGC TGATGGAGCG CGACGGCATG GTGCGCGAAT TCGAGTATCA GGCGTTCAAG CGCAATGGCG ACGTGCTCTG GCTGTCCGAC AGCGCCAGCG CGGTGCGCGA CGAGACCGGC GCGGTGATCC GCTACGAAGG CGCGGTTCGT GACATCACCG ACCAGAAGCG CGCGGAGAGT GCGATCGCCG AAGCACGCCG TCTGCTGCAG CAGGTGATCG ACACCGTGCC CGCGGTGATC AACGTCAAGG ACACCGAGCT GCGCTACGTG CTGATGAATC GCTACATGGC GAGCATCTTC GGCATTCATC CGAAGGACGC GATCGGCCGA ACGACGGCCG AATTGATGTC GCGCTACGGC TCGCACAAGA CCGATGCCAA CGACAAGCGC GTGCTCGCGA CAGGCGAGGG GCTCGGATTC TACGAGGAGG AATATCAGGA CGCTACGGGT GCGATGCGAC AGTGGCTGGT CAACAAGATG CCGCTGAAGG ATTCCGGGCA GCGAATTGAG CGGATCGTGA CGGTCGCCCT CGATATCGGC GAGCGCAAGC GCGGCGAACT CGAGATGCGC AAGGCAAAGG ACGCAGCCGA GGCGGCGCTG CGCAATCTGC GCGAGACCCA GCAATCGCTG ATCGAGGCCG AGAAGCTGGC CGCGCTCGGG CGTCTGGTCG CCGGCGTTGC GCACGAGGTC AACAACCCCG TCGGCATCAG TCTCACGGTC GCCTCGGCGC TCGAGCGCAA GACCGCGGTG TTCGCCTCCG AAGTCGGGCG CGGCGACCTG AAACGCTCCC GGCTCAACGA ATTCCTCGAC ACCAGTCGCG ATGCGTCCTC CCAGCTCGTC GCCAATCTCA ACCGTGCCGC CGAACTGATC CAGTCGTTCA AACAGGTCGC GGCCGATCGC AACTACTCGG ACCAGCGGGC GTTTGATCTC GGCGATCTGA CCGAGCAGGT GGTGATGAGT CTGCGTCCGG GTCTGCGCAA GCATAATCTG ACGCTGGACG TCGAATGCCA GCCCGGGCTG ACGATGAATT CCTATCCCGG GCCTTATGGT CAGGTGCTGA CCAACCTGTT CCTGAATTCG GTGGCGCATG CCTTTCCGAA CGGCCGCGCC GGCACAGTGG AAATCAAGGT CCGCGCGTCC GGTCCGAACG ATGTCGAGAT CGTCTATGCG GACGACGGCT GCGGCATGAG CCTCGATGTG CGGAGGCGGG CGTTCGATCC GTTCTTCACC ACGCGCCGCG ATCAGGGCGG CACCGGACTC GGGCTGCACA TCGTCTACAG CATCGTCACC AATCGCTTAG GCGGGCGGCT CGATCTCGAC TCCGAGCCTG GCGGCGGCAC GCGGATCAAG ATGATCCTGC CGCGCGTCGC GCCGCGGGAC CTGGTGGATC CGCCGGTCGC CAGCGCGCAA GCCTGA
|
Protein sequence | MPNVVAAFIQ SVRSRTLSIG QMTFGSFLLL LTIIAMTSIG SVVAIRHIDS TFAELQRLHS VGDLAEEIER RMSDLRSTAR QVVSEPSGQP AGKVFEAAST LTALLKKTRL EIDSDQQEMI DGVAARLSNY RDGLERITAL MPRRAEMVAS VPPISEAFEA AVDGLQDRTL ATTLHAEHNK IAAALLSRDL GGAGAAAQRM RAIQIGDPAA AIAVRDYADV IISTAAVEQE IAQLDRDVLG AEGRLIARVT ELLRDVSARR GRVLSRDLAR TLAEDKWQSI VLGSAGVLVG LMAAAFVVRR TVGPLAAITA AIRSLAAGQQ YTAIPATDVK NEIGDIARAA EVFRRTLVEA DSAREAAVRA LAEQRLAEES YRKLFEGSID GIYVTTPEGA LLNANPALAR MMGYESTAEL MRETGEVSQK IYVDPRKRDE YRALMERDGM VREFEYQAFK RNGDVLWLSD SASAVRDETG AVIRYEGAVR DITDQKRAES AIAEARRLLQ QVIDTVPAVI NVKDTELRYV LMNRYMASIF GIHPKDAIGR TTAELMSRYG SHKTDANDKR VLATGEGLGF YEEEYQDATG AMRQWLVNKM PLKDSGQRIE RIVTVALDIG ERKRGELEMR KAKDAAEAAL RNLRETQQSL IEAEKLAALG RLVAGVAHEV NNPVGISLTV ASALERKTAV FASEVGRGDL KRSRLNEFLD TSRDASSQLV ANLNRAAELI QSFKQVAADR NYSDQRAFDL GDLTEQVVMS LRPGLRKHNL TLDVECQPGL TMNSYPGPYG QVLTNLFLNS VAHAFPNGRA GTVEIKVRAS GPNDVEIVYA DDGCGMSLDV RRRAFDPFFT TRRDQGGTGL GLHIVYSIVT NRLGGRLDLD SEPGGGTRIK MILPRVAPRD LVDPPVASAQ A
|
| |