Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2740 |
Symbol | |
ID | 5085243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 2780405 |
End bp | 2783236 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640484303 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_001168932 |
Protein GI | 146278773 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGTCG CACTTGACAA GATTTTTGCC GAAGATCCAT CACCCTACGT TCTTCTGAAA CGCGATTTGA CAATGATCTG GGCGAATACC GCCTACCTGC GTGCAACCGG GCGCACGCGC GAGGACATTA TCGGCAAGTG CATGACCGAC GTCTTTCCCG CCGACCCGGA TTCCATCCCT GACCAGATGC TCCGCGGCTC CTTTCGCAGG GTGATCGAAC TGGGGCGCCC CGACCATCTG CCGCTCATCC CCTACCCGAT CACAAGCGGC GACGGCCGGA AGGAGGACCG GTTCTGGAGC GCGACGAACA CACCGATCAA GGATGCAGAC GGGCAGGTGG AGTTCATCCT GCAGAACACC AACGACATCA CGGCGCTTTA TCGTGAGGAC GGGGTCGGCG CGCAGCCTTC GGATCAGGCG CGGATGATCC GGCGCGCCGA GGCGGTGACG CGCGAGAACC TCGAACTCGG CAGCGCGATC GATTTCTTCC GCAGCGCGTT CGACCAGGCG CCGAGCTTCA TGGTCGTGCT TGAGGGCCGG GATCATGTCT TCCGCATCGT GAACCAGGCG TACCTGCAAC TCGTGGGGCG TCGCGACCTG GTGCATCGCA CGGTGCGCGA GGCGCTGCCG GAACTCGACG GACAGGGGTT CTTCGAGCTT CTGGACGATG TGTTGCTGAC CGGCCGGGCG TTCACCCGCC GCGCCGCGCC TGTCCTGCTG CGCAACGGCG CCGAGATGGA CCGCAGATAC GTCGATTTCA TCATGTGTCC ACTGCGGGGC CCGGATGGGG ATGTGCGCGG CGTCTTCGTC GAGGGCCATG ACGTCACGGG CCAGAAGCTG GCCGAAGCCG AGGTGATCGA GACGCGCGAG CGGTTCCGGC TGATGGCGCA GACGATGCCC AACCACGTCT GGACAGCGAG GACCGATGGC AGGCTCGACT GGATCAACGA TCGGTTGCGC GACTACTGCG GTCTGCCGGC CGAGGACCTG GTGGGAGCGG CGCCCATGTC CATCCTTCGC CCCGAGGACA GGGAGCGCGC ACAGGAAGCC TGGTCGAACT CGCTGACGCA GGGCGTTCCC TTCGAGCAGG AGGTCCGCAT CCGGCGACAT GACGGCGCCG ACCGATGGCA CCTCTCGCGG GCCTTGCCGA TCCGCGACGA CCAGGGCCAC ATCCTGCGGT GGATCGGCAC GAACACGGAT ATCGAGGATC GCAAGCTTGC CGAGGAAGCG ATCGCCGGGC TGAACGCCAC CCTTGAGGAG CGGGTTGAGC AGCGCAACCG TGAGCTGGAG GAACTGAGCA ACACGCTGCG GCAGAGCCAG AAGATGGAGG CCATCGGAAA TCTCGCAGGT GGAATTGCTC ATGACTTCAA CAATTTGCTT CAGACCATCA CGGGCAGCGT GCAGCTGGCT CTGCGATCGC TGGACGAAGG ACATACGGCA CGCCCCCGCC TCGACCAGGC CCTGAGCGCC GTCGGGCGCG GTGCAACTCT GGCGTCACAA TTGCTGGCCT TCGGCCGCCG GCAGCCCCTG GCGCCCCGCG TGATCAATCT CGGCCGGCTC CTGCGGGACG CCGACCATAT CGTCCGCAGT GCCGTGGGCG AGGGCATCGA GGTCGAGACG ATTGTGGCGG GCGGGCTGTG GAACACCTGC GTGGATCCGG CGAATGTCGA GACGGCTCTT CTCAACCTGG CGATCAATGC GCGCGACGCC ATGGACGGCC GCGGGCACCT CACGATTGAA ATCGGCAACT CCTGGCTGGA CGATGCCTAC ACCAAGGGGA TCCAGGATCT GGCACCGGGC CAGTATGTCA TGCTGGCCGT AACCGACACC GGCTGCGGCA TGGCCCCCGA CATCATCGAG CGCGTGTTTG AACCTTTTTT CACCACAAAG GCGGAAGGGC GCGGCACGGG TTTGGGGATG TCCATGGTCT ATGGATTCGT CAAACAGTCG GGCGGACACA TCAAGATCTA CAGCGAGGTC GGCAACGGCA CATCGATCAA GATCTACCTG CCCCGCTCCC TCGAGGCCGA AGATGCTGCG CGGCCTCCGG CGATCGGTCC CATGACCGGT GGCGACGAGA CGATTCTGCT GGTCGAGGAT GACGAGCAGG TGCGCCTCAC CGCAGCGGGT CTTCTGCATG ATCTGGGCTA TGCGGTCCTG CAGGCCCGGG ATGCCGATCG TGCGCTGACG ATCGTCGAAA GCGGCGCCCG CATCGATCTG CTCTTCACCG ATGTCGTCAT GCCCGGGCAG CTGAACAGCC GGCAGCTGGC AGAGAAGGCC CGGATGCTGA GGCCCGGTCT GCCGGTCCTC TTCACCTCGG GTTACACGCA GAATTCCATC GTGCACGGGG GCCGACTTGA CAGCGGCGTG CACCTGCTCA GCAAGCCCTA CACGCAGGAG GCCCTGGCAC GGAAGATCCG CGAAGTGCTT TCGACGGGCG GTTCACCGAG CGCGCCGCCG GCACCCGCGC GCCATGTGCT TGTGTGCGAG GACGATGTCA TCATCCGCAT GAACCTCGCC GAGACGCTGG CGGAGGCCGG CTACCGGGTT ACTCAGGCAG GGGCAGGTGG TGCCGCGCTC CTGCAGATCA GGCAGGAAGC CCCGGACATG CTGCTGGTGG ATCTTGGCCT GCCTGACATG AGCGGCCTCG AACTGGCCCG GACGGCCCGG ACCATCCGCC CGGATCTGCC GATCCTGTTC GCGACGGGCG ACAGCCAGAT CCCCGATCTC GAGTTCAGGC CGCACGCCGA CGTGCTGATC AAGCCGTTCG GAGACGAGGC GATGCTCGAA TGCGTGGCGC GACTCCTGGC CGGTGAGGTC ATTTCACGAT GA
|
Protein sequence | MLVALDKIFA EDPSPYVLLK RDLTMIWANT AYLRATGRTR EDIIGKCMTD VFPADPDSIP DQMLRGSFRR VIELGRPDHL PLIPYPITSG DGRKEDRFWS ATNTPIKDAD GQVEFILQNT NDITALYRED GVGAQPSDQA RMIRRAEAVT RENLELGSAI DFFRSAFDQA PSFMVVLEGR DHVFRIVNQA YLQLVGRRDL VHRTVREALP ELDGQGFFEL LDDVLLTGRA FTRRAAPVLL RNGAEMDRRY VDFIMCPLRG PDGDVRGVFV EGHDVTGQKL AEAEVIETRE RFRLMAQTMP NHVWTARTDG RLDWINDRLR DYCGLPAEDL VGAAPMSILR PEDRERAQEA WSNSLTQGVP FEQEVRIRRH DGADRWHLSR ALPIRDDQGH ILRWIGTNTD IEDRKLAEEA IAGLNATLEE RVEQRNRELE ELSNTLRQSQ KMEAIGNLAG GIAHDFNNLL QTITGSVQLA LRSLDEGHTA RPRLDQALSA VGRGATLASQ LLAFGRRQPL APRVINLGRL LRDADHIVRS AVGEGIEVET IVAGGLWNTC VDPANVETAL LNLAINARDA MDGRGHLTIE IGNSWLDDAY TKGIQDLAPG QYVMLAVTDT GCGMAPDIIE RVFEPFFTTK AEGRGTGLGM SMVYGFVKQS GGHIKIYSEV GNGTSIKIYL PRSLEAEDAA RPPAIGPMTG GDETILLVED DEQVRLTAAG LLHDLGYAVL QARDADRALT IVESGARIDL LFTDVVMPGQ LNSRQLAEKA RMLRPGLPVL FTSGYTQNSI VHGGRLDSGV HLLSKPYTQE ALARKIREVL STGGSPSAPP APARHVLVCE DDVIIRMNLA ETLAEAGYRV TQAGAGGAAL LQIRQEAPDM LLVDLGLPDM SGLELARTAR TIRPDLPILF ATGDSQIPDL EFRPHADVLI KPFGDEAMLE CVARLLAGEV ISR
|
| |