Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0361 |
Symbol | |
ID | 4711351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 419270 |
End bp | 421885 |
Gene Length | 2616 bp |
Protein Length | 871 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639854824 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_001001957 |
Protein GI | 121997170 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.363033 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGAT CCGAGCAGTC CGAGGCGCAG CGGCTGTACA CCCGCATGTT CACCGAGCAT GTCGCGGTTC AGCTGCTCAT CGATCCCGAG ACCGGCGCCA TCCTCGACGC CAACCCCAGT GCCGTGCGCT TCTACGGGTA CCCTCGGGAG GAGCTCTGCG GCATGCGGAT CCAGCAGATC AATACCTTGG ACGAGCCGCA AGTGCGCGCC GAAATGGCCC GCGCCCGGGC CGAGCAGCGG CGCTACTTCC GCTTCCAGCA CCGCTTGTGC GGCGGCGAAG TCCGGGACGT CGAGGTCTAC AGCGGGCCGC TGGAGCTCGC CGGACGGCAG TACCTTCACT CCATCATCCA CGACATCACC GATACACGCC GCTATCAGCG CCGCCTGGAG GTCTTCCACG ATCTATTCCG CTCCCTGCCG GTGGGCATCT ACCGCAACAC CCCCGGGCCT TCGGGGCGTT TCCTTGAGGT CAACCCGGCG ATGGTCGAGC TCTTCGAGGC CGAGAGCGAG GCGCAGCTGC TGGCGACGCC GGTCGCGGCG CTCTACCGGG ACCCGGAACG CCGGGCCGAG ATCAGTGCCC TGATCGAGCG CGAAGGCTCC GTGAGCTGGG CGGAGCTCCA GGCCCGGACG CTGCGCGGTC GGCCGCTCTG GCTGCGGCTT TCAGTGCGGC GGGTGGAGGA TGAGCACGGG CGGACGGTAT TCGACGGCAT GGTCGAGGAT ATCTCGGCCC ACGTCCGCCT GCAGGGTGAG CGGGATCGCC TGCTCGAGGC CATCAACGAA GGGGTGTGCG GGCTCGATGA CGCCGGGCGT TTCACCTTCC TCAACCCCGC CGCCCGGCAG TTGCTCGGGT TTGCCTCCGA GGAGGCCGCC CTCGGCCGGG AGGCCCACGC GCTGACGCAC CATAGCCGTC CGGACGGCAC CCCGTATCCG CTGGAGGAAT GTCCGATTTT CCGGGTGCTG CGCAGCGGCG AGCCCTTGGA GGCGTGGCAG GACTACTTCT GGCGGACCGA CGGCCGGGGC TTCGATGTGC TGGTTTACGC CGCCCCGCTG CGCGATGTGG AGGGCGGGAT CACCGGCATT GTGCTCTCCT TCCAGGACAT CAGCCGGCGC AGGCGCATCG AGCGCGAGCG CGATCAGATG CTGGAGATCC TCGACCACCA CCCGCACCTG ATCCAGCGCT TCCTGCCGGA TACCACGCTG CTCTACGCCA ACCGCATGGT GGCCGAGCTG TTCGGGATCC CGCCGCCGCA GATGGACGGC CGGCGCTGGA TCGAGTGGCT CGGGCCGGAG GCGCGCGAGC AGCTGGAGGC CTTCCTGTGC CGCTTCACCC GGGCGGCGCC GGTGGGCACC GTGCAGCTCG CGGTGCTCGC CGCCACAGGG GAGCAGCGCC GGGTGCAGTG GACCTGCCAG GCCTTCTTCG ATGAGGACGG ACTGATCAGC CACTTCCAGG CCGTGGGCAT CGACATCACC GAGCAGGTCG CGGCCGAGCG GGCGCGCGCC CAGGCCGACC GCGATCGACG AACCTTCCTC GCGGCGGTCA GCCACGATCT GCGCACGCCC CTCAATGCCA TCTACGGCTT CACTGATTTG CTCCAGGCCA CGGAGCTGAC CCCGCAGCAG CGCGAATACC TCGGGTTGTG CCGTAGCGCC AGCGAGAAGC TGCTGGCACT GATCGACACC CTGCTCGATC TCTCGCGCAT GGAGTCGGGG CGGCTGACCC TGCACGACGA GCCCTTCGAG CTGGCGGAGG TCGTGGAGCG CCAGGTGGCG GTGCTGCGGG CCGTCGCCGA GGAGGCCGGA CTGCGCCTCA CCGTCCGGGA GGCGCCGGGC ACGCCGCAAT GGGTGCGGGG CGACGCCACC CGCTTCGGCC AGGTCGTGCA TAACCTGATC AGCAACGCCA TCCAGTTCAC CGAGCACGGC GAGGTGGTGC TGGAGGTGGC GCCCGAGGGC AGCGGTTGGC TCTACGTGGC GGTCCACGAC ACCGGGGTGG GGATCGCCCC GGAGGATCGG GAGCACATCT TCAAGGCGTT TGCCCAGGGG AACCCGTCGT TCCGGCGCCA GCCCGGCAAC GGCCTGGGGC TGCGGATCTG CCAGGAGTTG GTGCGGCTGA TGGGCGGGCG GTTGGAGATG ACCAGCGAGC TGGGCAAGGG GTCCACCTTC TATTTCACGG CGCGTCTGCC CCGCGTGGAG CCGCCGCAGG CGGCGTTCGA GGCGACGCCC GAGCCCCACC GCACGACCCA TCTGCGCGTC CTGGTGGCCG AGGATGACCC CACCAACGCC CTGCTCATCC AGGCGCAGCT GGAGCTCGCC GGGGTGACCC CGACCCTGGT GGAGGACGGC CGCCAGGCGG TGGACGCCTG GCAGGAGCAG GACTGGGACC TGGTGCTCAT GGACGTACAA ATGCCCGAGG TTGACGGTCC GGATGCGGTG CGCACCATTC GCGCCCGCGA GGCCGAGCGC GGCCGAGCCC GCACCCTGAT CATCGCCCTG AGCGCCCACG CCGTCGATCA GGTCCGCGAG GAGTGCCTGG AAGCCGGCTG CGACGAGTAC CTGACCAAGC CGGTGGACCG GCAACGGCTG GCGGCCCTGC TGGCCGGGAT CGCCGGCCGC GACTGA
|
Protein sequence | MNRSEQSEAQ RLYTRMFTEH VAVQLLIDPE TGAILDANPS AVRFYGYPRE ELCGMRIQQI NTLDEPQVRA EMARARAEQR RYFRFQHRLC GGEVRDVEVY SGPLELAGRQ YLHSIIHDIT DTRRYQRRLE VFHDLFRSLP VGIYRNTPGP SGRFLEVNPA MVELFEAESE AQLLATPVAA LYRDPERRAE ISALIEREGS VSWAELQART LRGRPLWLRL SVRRVEDEHG RTVFDGMVED ISAHVRLQGE RDRLLEAINE GVCGLDDAGR FTFLNPAARQ LLGFASEEAA LGREAHALTH HSRPDGTPYP LEECPIFRVL RSGEPLEAWQ DYFWRTDGRG FDVLVYAAPL RDVEGGITGI VLSFQDISRR RRIERERDQM LEILDHHPHL IQRFLPDTTL LYANRMVAEL FGIPPPQMDG RRWIEWLGPE AREQLEAFLC RFTRAAPVGT VQLAVLAATG EQRRVQWTCQ AFFDEDGLIS HFQAVGIDIT EQVAAERARA QADRDRRTFL AAVSHDLRTP LNAIYGFTDL LQATELTPQQ REYLGLCRSA SEKLLALIDT LLDLSRMESG RLTLHDEPFE LAEVVERQVA VLRAVAEEAG LRLTVREAPG TPQWVRGDAT RFGQVVHNLI SNAIQFTEHG EVVLEVAPEG SGWLYVAVHD TGVGIAPEDR EHIFKAFAQG NPSFRRQPGN GLGLRICQEL VRLMGGRLEM TSELGKGSTF YFTARLPRVE PPQAAFEATP EPHRTTHLRV LVAEDDPTNA LLIQAQLELA GVTPTLVEDG RQAVDAWQEQ DWDLVLMDVQ MPEVDGPDAV RTIRAREAER GRARTLIIAL SAHAVDQVRE ECLEAGCDEY LTKPVDRQRL AALLAGIAGR D
|
| |