Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1473 |
Symbol | |
ID | 5733358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1718640 |
End bp | 1722782 |
Gene Length | 4143 bp |
Protein Length | 1380 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641278611 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_001544245 |
Protein GI | 159897998 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTGTT ATGCCAATGA ATCGCTGCTG GAACTCCAAT TAAAGCAACA AGTCAGCCTG CTTGAAACCG AACTTCAGCA TGTTCGCAGC CATACTGCCA AACTCCAAGC CTTGATTGAC CATACTGAGA GCCTGATTTG GTCGATTGAT AGTAGTCAGC ATTTAATTAT GAGCAACCGC ACCTTTCATG ATTATTTGCA TCAATGGTAC GACCAACATA TTGAGCTGGG CCAGCCAATG AGCATGTTGC ATCTGCCTAA ACCAATTGCT AAGCCGTGGA ATGAAGCCTA TCGCCGCGCG TTGCAAGGTG AGGCGTTTGC TCAAGAATTC GTGCATCACT ATGCTGGGGC CACTCGTTAC TATGAATTTC ACTTTGCGCC AATTCTCGAT GAACAGCGAG TGGTGCAAGG GGTTAATATT TCGGGCCGCG ATATTAGCAG CCGCCGTTTG GCCGAAGTGA TGTTGCGCAG CAGCGAGGCT CGCTATCGGG CAATTAACGA GGCCGCTCCG TATGGGGTCT TTTTCTGCGA TACCCAAGGC CAATGTGTCT ATGCCAACAC TGCGCTCTTG CATATGCTTG AAGTTAAGCT CGAAGATGTA CTGGGTGATG CTTGGATGGA ATTTTTACAC CCTAAAGATG CCTCAGATAG CGAGAGTTTT TGGCACAATT TGGCGCTTTC AGTCAATCAA CAGCCGATTC CTAGCGTTTT GTTTGATCAA ACGGTGCGTT TGCATACCAA GCTGAATCAA TTAATTTGGC TGCGGTTGCG CATTTCGCCA GTAATTGAAG CTGGCGTGCT CTGTGGCTTT GTTGGCATCA CTGATGATCT GAGTGCCTTG AAAAAAGCTG AAACGGCGGC CCAAGCCAAG CAGCATTTTA TTCAAAAAAT TGCCGATACC TTGCCATCGC AACTGTTTAT TTACGATCGT GCTTCGGCCT CGTTGATCTA TACCAACGAA GTGAGCCGCC AATGGTTCAA AAGCTTCAAC ATTGATCCTA GTGATATGAA CACCATTCGG CCCCTATTCC ACCCTGACGA TCATGCTATC ATCCGCAAAG TCCTACAAGG ACGTACAGTT ACTAGCAGCG ATCCCCAATT AATTGTTGAT TTTGAATGTC GCTTACGCTC ACCAGAAGGC ATCTATCGCT ATTTTGCTCT ACGTATGACC CCTTTTGTAA TCGATGCTGA TGGTACAGTT TCACAATTAT TGGGCGTGGC AAGCGATATT ACTGAACGTA AATTACAAGA GCAGCAAATT CGGCAATTGA ACGAGCAGCT TGAACAACGT GTTATCGAAC GAACCCAAGA GTTAGCTCAA TCGTATCGGT TTCAGCGCAC TATGATTCAA CATGCACCTT CAATTATTAT TTCGCTTGAT GCTGGAGGCG TGGTGCGCGG CTTTAATCGC GCGGCTGAAT TTGAGTTTGG CTATCAAGAA ACTGAATTAT TAGGCCGTCC CTTCCCGACT AAATTATTTG ATCGCTCCGA TTTGAGGTAT CGCTGGGAGA CCGAGCAACA TCGTAATCGT GGCTTGTTCT ATTCCGATTT GGATATTTTG CTGGCCGAGG CGCGGCGTGG TGTGGCTGAG CCTCACGAAT GGCAAGCTAT CCACCGTTCT GGCTCACGCT TTCCGCTTGA ATTAACCATC ACGCCACTGT TGCATGCCGA TGTGCTCGAA GGCTTTTTGC TGATTGGCAA TAATATCGCT GCCCGCAAAC GCACCGAAGA AGAATTTCAT TTGCTCTATC GTACAACCCG CTCGGTCAGT GAGGCCGCTG ATTTTAATAG TGCGCTTGAA GTGGTGTTGC GCAACATCTG TGCAGCAATT GGCTGGGATT TGAGTGTAGC CTGGGTTCCA GATGCCAACC AAGATTTCTT GGCGCTTGCC CCGATTCGCT GGAGCAGCAA CGAACGTTTT CAACATTTTT ATACGTATCT TCAAAGCTTG GAATTACCGC AAGGTGCTGG TTTGGCTGGG CGGGTCTGGC AAACCAAACA CTCAGCCAAT TATGCGATTG AGCACGATCA TTGGACTGAA GGCAATCTTA ATCAACGTGG CTACGAATTA GCCCAGCAAG TTGGGCTTCA ATCGGCGGTT GCTGTGCCAA TTTTGGCTAA TGATCAAGTT GTGGCAATTT TGGAATTTTT TCGCAGTAGT CGTGGGGTTG ATCACGAACG CACCACTAAT TTGATTTCAG TGATTGCCAC CCAATTGGGC ACGTTATTTC AACGCAAGCA TGCCGAAATG CAACTGCGCC AGAGCGAAGC CAAAAATCGA GCGTTGCTAG CAGCCATGCC CGATTTGATG ATTCGATTTA CGCTCAAGGG CGAGATTTTA GATTATCACA CCAACGATCC CTCGGATCTC TTTTTGCCCC AAGATGCCAT GATTGGGGCG AATGCCCATA ATCATCAGCC GCAACCGCAA ATTCTGAATA TTATGAGCGC TACTCAACGG GCGATTGAGA CCAATAGCAC CCAAAATGTT GAGTATGAAC TGACCTTGCC CAAGGGGAAT ACCGTATTTG AGGCGCGGAT TGCGCCAAGT GCTAATGATG AGGTCGTGAT GGTGATTCGC AATATTACCG AGCGCAAGCG GATTGAACAA ACCTTGCAGC AACAAACCGA TGAATTGAGC ATTGCCAATG CTGAACTAGC CAAAGCAGCA CGGCTTAAAG ATGAATTTTT GGCTAGTATG AGCCATGAAC TGCGTACACC CTTAACTGGA ATCTTAGCCT TTACTGAAGC CTTGCGCTAC GACCAATATG GAGCATTAAA CCAAGCTCAA GCACAAGCAT TGCAACAAAT TGATGAAAAT AGCCGCCATT TACTCGATTT GATCAACGAT ATTCTTGATC TTTCCAAAAT TGAAGCTGGC AAACTTACAA TTAATCATCA ATCAATGCTG ATTGATGAAA TTTGCCAAGC CAGCATTCGC ATGGTGCAAA AATTAGCTCA GAATAAACAA CTTGAATTAT TGTATGAACC ATGTGCCGCC GATGCGATGC TTTGCGCCGA TTCACGTCGT TTGAAACAAA TGTTGGTCAA TTTATTAAGC AATGCGGTGA AGTTTACACC TGCTGGTGGC CGAATTGGTT TATCAGTCAA GTTGGATAGC CAATTCCATC AGGTTGAATT AACCGTTTGG GATACTGGGA TTGGCATCAA TACTCAGGAT ATCCCAAAAT TATTTCGGCC ATTTTCACAG CTTGATAGCA AACTTTCACG CCAATATGCA GGAACTGGCT TAGGTTTGGC CTTAGTGTAC CATATGGCCA ATTTGCATGG TGGACGAGTT GAATTACAAA GTGAAGTTGG GGTTGGTAGT CAGTTTCGCC TAGTGCTGCC ATGGTATGGC AATGCTGAAA TTGTTGAGCC AACCGAGCAA CCGATGCTAC TGGCGGTGAC CGAAGATCGC ACCCTGAATA TGCAATTGGC GGCCTATGCC GAACAATTGG GTTTAGCGCT ACGTTTTTGC AAACCATATT TCGATGTACA AGCTTACTTA CAACAACCCA ATCAAACGCT CATTTATGAT TTACGCCAAA CAAGCTTATC ACAACAGCTT TTTGAGCAAC TTCGCCATCA AACTGCCGCT CATCCAGTCA TTTTATTCTG TGATCAAGAT TTCAAGCTTG ATCTGACGAT TCCTGCGCAT TGGCATTGCA TGTATCAGCC GTTAAATCAG GCTCGTTTGT TGAATGCATT GCAATATATC GATTCGCGCT ATCAGCTGCC GCAAAAATTG CCGATTAATC AAGCCAAGCA GATTCTGTTT GCTGCCGATA ATTTGGCTAA TAGCTTATTG ATTCGCGATT TCTTGAGTGA ATTTGGCTGG AATGTCAACT TAGTCTACAA CCAACACGAT ATCTATGAAG CAATTGCCAA TCAGCCAATT GATTTATTGA TGCTCGATTT ACAATTAGCT GGCGGTGATG CACTCCAGAT GGTTAATACA ATTCGACGAC ATAAGCGTTA TTATGATCTG CCGATTATAG CTTTAAGCGC TTTAGCAATT CTTGATCAAC CACAGCTTGC CGAGCAAGCC GATACAGTTT TATATAAACC ACTTAATTTA GTTGAACTTG AACAATTAAT TAATACCTTG TGCAACCAAC AAAGGAGTCT TGATCGTGAA TAA
|
Protein sequence | MTCYANESLL ELQLKQQVSL LETELQHVRS HTAKLQALID HTESLIWSID SSQHLIMSNR TFHDYLHQWY DQHIELGQPM SMLHLPKPIA KPWNEAYRRA LQGEAFAQEF VHHYAGATRY YEFHFAPILD EQRVVQGVNI SGRDISSRRL AEVMLRSSEA RYRAINEAAP YGVFFCDTQG QCVYANTALL HMLEVKLEDV LGDAWMEFLH PKDASDSESF WHNLALSVNQ QPIPSVLFDQ TVRLHTKLNQ LIWLRLRISP VIEAGVLCGF VGITDDLSAL KKAETAAQAK QHFIQKIADT LPSQLFIYDR ASASLIYTNE VSRQWFKSFN IDPSDMNTIR PLFHPDDHAI IRKVLQGRTV TSSDPQLIVD FECRLRSPEG IYRYFALRMT PFVIDADGTV SQLLGVASDI TERKLQEQQI RQLNEQLEQR VIERTQELAQ SYRFQRTMIQ HAPSIIISLD AGGVVRGFNR AAEFEFGYQE TELLGRPFPT KLFDRSDLRY RWETEQHRNR GLFYSDLDIL LAEARRGVAE PHEWQAIHRS GSRFPLELTI TPLLHADVLE GFLLIGNNIA ARKRTEEEFH LLYRTTRSVS EAADFNSALE VVLRNICAAI GWDLSVAWVP DANQDFLALA PIRWSSNERF QHFYTYLQSL ELPQGAGLAG RVWQTKHSAN YAIEHDHWTE GNLNQRGYEL AQQVGLQSAV AVPILANDQV VAILEFFRSS RGVDHERTTN LISVIATQLG TLFQRKHAEM QLRQSEAKNR ALLAAMPDLM IRFTLKGEIL DYHTNDPSDL FLPQDAMIGA NAHNHQPQPQ ILNIMSATQR AIETNSTQNV EYELTLPKGN TVFEARIAPS ANDEVVMVIR NITERKRIEQ TLQQQTDELS IANAELAKAA RLKDEFLASM SHELRTPLTG ILAFTEALRY DQYGALNQAQ AQALQQIDEN SRHLLDLIND ILDLSKIEAG KLTINHQSML IDEICQASIR MVQKLAQNKQ LELLYEPCAA DAMLCADSRR LKQMLVNLLS NAVKFTPAGG RIGLSVKLDS QFHQVELTVW DTGIGINTQD IPKLFRPFSQ LDSKLSRQYA GTGLGLALVY HMANLHGGRV ELQSEVGVGS QFRLVLPWYG NAEIVEPTEQ PMLLAVTEDR TLNMQLAAYA EQLGLALRFC KPYFDVQAYL QQPNQTLIYD LRQTSLSQQL FEQLRHQTAA HPVILFCDQD FKLDLTIPAH WHCMYQPLNQ ARLLNALQYI DSRYQLPQKL PINQAKQILF AADNLANSLL IRDFLSEFGW NVNLVYNQHD IYEAIANQPI DLLMLDLQLA GGDALQMVNT IRRHKRYYDL PIIALSALAI LDQPQLAEQA DTVLYKPLNL VELEQLINTL CNQQRSLDRE
|
| |