Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4547 |
Symbol | |
ID | 5736943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5817743 |
End bp | 5820664 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641281709 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_001547306 |
Protein GI | 159901059 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTAT TATGGCTTTT TTCCACACTG GCCGAGTTTA TGAATAGTTT TAGCGCTACG TCCTTCCAAC CAGCCATGGA GCAACCAACG CTCGCTGAAT TAACCCAAAC TATCGCTCGT TTAGAGGCTG AGCTGGCTGC TACCAAGCAG GCCTTGCTCG CCAGCGAACA GCGCAATCTG CAATTTTTTC AACAAACTCA AGCGATTGAG TTAATTGTTG AGCCTGATAC TTGGCGAGTT TTGGCGGCCA ATGCTGCTGC TTGCCGTTTT TATGGCTATA GCCAAGCCGA GTTTAGTAAT CTCAACGTTC GCGATTTAAA TGTATTGACC AGTAAAGAAC ATCGGGTGGC CATTCGCAAC GGTGAGCTTG GTGGTAGCAA TCGCTATGAG TTTCGCCATC GCTTACGTTC AGGAATGGTC TGCGATGTTG AAGTTTATAT CACTCCCTTT GAATATCAAC AGCAACCAGC CTTATTTTGT TTAATTCACG ACGTAACTGC CCATAAACAG ACGATTCGCC AAATGCAACG CCAAAATGCC TATCTTGGCT CGCTGCACGA TGTGACCTTG GCCTTGATCG ATCAGGCCGC CCTGCCCGAT TTGCTCGAAA TTATTCTGTG GAAGGCAACC GATTTGGTCG GCAGCGAATA TGGTGTGATG TATATTCGCA GCGCCGATGG TCAAGCCATG GAATCTGTCG TGAGTCGCGG GGCGATCAAC GTAATCCAGC ACATCGAATT AGGCCAAGGT GCAGTTGGCA CGGTCGGCCT TAGCGGTCAG CCATTAATTA TTGATGATTA CTCCGCCTGG AGTGAGCGTT TATTTACGCC GCAAGATGGT TTTGGTAATG GCCTGTTGGT ATTACCATTA TTCCGTGAAC AAACAGTCGC TGGCGCACTA GCGATTATCT ACGAGCCAAG TCAGATTCAA ATTAGCATTA CGATGCTTGA AATGGTGCAA CAATTTGCCC GCTTTGCCTC GCTTGCACTG GAGAAGCATA TGCTCTACAC CGCAGCCCAA ACTGAACTGG CCGAACGTCG GCGGGTCGAA CAGCAGTTGC GTTCGCTGAA TGAGCGGTTT GAATTGGCCC AAACGGTGCT GAATGGCGGC ATCTACGATT GGGATATGAT TCAGCAGACC ACAGTGGTGA ATCGGAGCTT TACAACGGTG TTTGGCTATA CCAGTGAAGA GGTTGCCAGC AGCCCAACCT GGTGGGAAGA ACATCTGCAC CCCGATGATC GTGAGTTGTT CATCCAACGA ACTGCCGATA TTTTCGCTGG CCATAGCGAT GTTTTTTCGG CTGAATATCG CTTTCTCGAT CAACATCAAC ATTATCGCTT TGTGCAGGAT CGCGGCCATG TGCTGCGTGA TCCCGATGGC CAAGCTTTGC GTATGGTTGG CACACTGATC GATATCTCCG AGGAATATCG CCATATTGTT GAGATGCTGC CCTTGCCGCT GTTTGTAGTG CAGGATGATC GAATTGTCTA TGCCAATCAG GCTGGACTGG CGTTGACCCA AACCACCAGC AGCGAATTGC TTAGCCAGTC GGTGCTTAGT GGCTTACGGC CACAAGATCC TGATGATTTT TATGTGCTGA TTCGCCATGC CAGCACCATG CCCACCAGTT TTATTGAAAC AACCTTTTTG CGCCGTGATG GCACGCCAAT TTATGGCGAA TGTTCAACCT CAGCAATTTT ATTTGAGCAT CGAAATTCAG TCCAAGTGAT TATTCGCGAT ATTAGCGAAC GCAAGCAGGC CGAGCAAAAA CGCCTAGAAA TTGAACAACA CTTTTCAGAA ACCCAAAAAT TAGAAAGCCT TGGCGTGTTT GCTGGCGGGA TTGCCCACGA TTTCAATAAT GTGTTCATGT CAATTCTTGG TAATACCCAT TTAGCTATGC TCGAACTGCC TGAGCAAGCC AACCTGCGTC CGCTTTTAGC CGAAATCGAA CAATCGGCCA AACGGGCAGC GCAAATCACC AAGCAAATGG TTGCCTACAC CGGCCAAAAT CTTGATGTGC GGCGCAGCCT GATGATCAAC GAATTAGTTC AGACCTCAAT TCGTTTGCTG CCCTCAAGCC TAACCCAGCA ACGTCATATT CAGAGCAACT TAGCTGATAA CTTGCCGCTG ATCGAGGGTG ATCAACAACA TCTGCGCCAA GCGATCAGCA ATGTACTGAT CAATGCGCTT GAAGCGTCGA GCAGCGGAAC GCTTACAGTT ACCACCAGCT TGCGCGATTT GCAAGGAGCA CCCGAGGGCC ACACCTATAC GACAGGCACA TTTTTGCCCG CCCAATATAT CGCGATTGAA ATTAGCGATC AAGGCTTAGG CATGGATCAT GCGACGATCT CGCGCATGTT CGACCCATTT TTTACCACCA AATTTACTGG ACGCGGGTTG GGTTTAGCGG TTACTTTAGG GATTGTGCGC GGCCATCGCG GTGGCATTGC GGTCAAGAGT CGCCCTCAAC AGGGCACGAC AATAGGAATT TATCTGCCAA TTATGACAGC AGTACAAACT GAACTAGCCA GCGAACAACC AGCCTTAGTC GATTCTGGCC CAGGCAAAAT TTTGGTGATC GACGACGAGC CTGATGTACG AATCGTGATT GATCGGATTT TGCGTCGTTT GGGCTACGAA ACCTTACTAG CCGAGGGTGG TAATCGGGGC ATTAGCCTCT TCCGCGACCA TCATCAAGAA ATTGCGGGCG TTTTTCTCGA CCTAACCATG CCCGATCTCG ATGGTCAAAG CACGCTCGAA GTGCTCAAAC AGATTCAGCC TGAGATTCGC GTGGTGATTA TGAGCGGCTA TAATGAACAA CAAGTGAGCC AACAATTTGG TACTACTGGT ACAACCCAAT TTCTGGCAAA ACCATTTATG ATTGAGGAGA TCCGCGATAA GCTCACGGCA TTACGGCTCT AG
|
Protein sequence | MKLLWLFSTL AEFMNSFSAT SFQPAMEQPT LAELTQTIAR LEAELAATKQ ALLASEQRNL QFFQQTQAIE LIVEPDTWRV LAANAAACRF YGYSQAEFSN LNVRDLNVLT SKEHRVAIRN GELGGSNRYE FRHRLRSGMV CDVEVYITPF EYQQQPALFC LIHDVTAHKQ TIRQMQRQNA YLGSLHDVTL ALIDQAALPD LLEIILWKAT DLVGSEYGVM YIRSADGQAM ESVVSRGAIN VIQHIELGQG AVGTVGLSGQ PLIIDDYSAW SERLFTPQDG FGNGLLVLPL FREQTVAGAL AIIYEPSQIQ ISITMLEMVQ QFARFASLAL EKHMLYTAAQ TELAERRRVE QQLRSLNERF ELAQTVLNGG IYDWDMIQQT TVVNRSFTTV FGYTSEEVAS SPTWWEEHLH PDDRELFIQR TADIFAGHSD VFSAEYRFLD QHQHYRFVQD RGHVLRDPDG QALRMVGTLI DISEEYRHIV EMLPLPLFVV QDDRIVYANQ AGLALTQTTS SELLSQSVLS GLRPQDPDDF YVLIRHASTM PTSFIETTFL RRDGTPIYGE CSTSAILFEH RNSVQVIIRD ISERKQAEQK RLEIEQHFSE TQKLESLGVF AGGIAHDFNN VFMSILGNTH LAMLELPEQA NLRPLLAEIE QSAKRAAQIT KQMVAYTGQN LDVRRSLMIN ELVQTSIRLL PSSLTQQRHI QSNLADNLPL IEGDQQHLRQ AISNVLINAL EASSSGTLTV TTSLRDLQGA PEGHTYTTGT FLPAQYIAIE ISDQGLGMDH ATISRMFDPF FTTKFTGRGL GLAVTLGIVR GHRGGIAVKS RPQQGTTIGI YLPIMTAVQT ELASEQPALV DSGPGKILVI DDEPDVRIVI DRILRRLGYE TLLAEGGNRG ISLFRDHHQE IAGVFLDLTM PDLDGQSTLE VLKQIQPEIR VVIMSGYNEQ QVSQQFGTTG TTQFLAKPFM IEEIRDKLTA LRL
|
| |