Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0852 |
Symbol | |
ID | 5732753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 962161 |
End bp | 965328 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277984 |
Product | GAF sensor hybrid histidine kinase |
Protein accession | YP_001543628 |
Protein GI | 159897381 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGATG GTGATCGCGC GGTGTCAACC ATTTGGCATG AGCAACTTGA GTTGTTAACA AAAACCGCTC AAGAACATTT TTGCCTACAC AATGCCGAAT TGTTGATTCA ACGCATTTGC GCACTGATCA GCCACTATTT GCAAACGCCG CGGGTCACCA TCGCCCTCAT TCAACACGAT AGTTTTGTGG CAGTTGCCAG CGCCCTTGGC CCTTTAGAAG ACCCAAGCTA TCAGCAAGCA CCACGTTTGG TCTACAGCGC TGGAGCGGCT TGGTCGGCGG TTTGGCAACA GCAACGCAGC ATAATCACTC CTGCGCCAAC CCCTGATGGA ATGTGCCAAC TCAGCGTGCC AATTCTGCGC AATGCTCAAT TACTGGGATT AATCGATATT CAAAGCCCAC AGCCTGAGCA TAAACTAAGC CTTTTACAGC CAATTATCGA GATGATCGCC CAGCAATTGG CGCTAGCCTT GGGCACACTT CCTAGCCATC GTCAACATTA TCGCGAAGAT CAATGCACCC AAATCATCGA AAAAATCAAT CTGCATATTT TGAAACATCT TGATGTATTA GCTGAGCCAC AACAGATTAT TGAATTAATT TGGCGTTTTT TGCCGTTGCA AGGCGCTGCA TTGTATTTAT ACGATGAACA AGGTTTGGGC TTATTAACCC ATATCAATGC AGCCCATGTG CCAACTCAGC GTGGAATTCA GCCGCTTAGC CCCAGCTATT ACGAGCAAAT CGGCCAGCAA TCAAGCCATA ACTCGGCCAA AATGTTGCTA GCAGGGCAGC ATGGCATCGA TATTCATGTA CCGGGCAATC AAGCCAATCT TGGTTTGTTA CACTTAATTC CTGCCACACC GTTTGAAGCT AATGATTTGC TGGCCTTGCA AGAGTTTGGC GCACGCTTAG GCTACATTTT AGAGCATAAT CGGCTGTTTC GCTCGATGTA TGTGGCCAAT GAACGCAATG TGCTGTTTGC CCGCATCGTC AGCCATATTC GCCAAACCAT GAATTTGCGC GAGGTCTGGC AGGATGTATT AGCAACCCTC GGTTTGGGTT TACGGGCCGA TTTCTGCACC GTGGCTTTGT ATGAGCAAGA GCAACGCTTG CGCTTCCATG GAACCTACAC TACCTTGCTG CTACCCGATC ATCTGACGGC TGCCGAGGCT TTGTTGCATA GCGAAATGCT GCAAGCTTTT CAGCAAAAGC GCTCGCTGAC GATCGACGAA TATCACGATA GCAATGACCC TGAATTGCGC GAACGGCTGC TGGCCTTGAA TATTCATGCC TTGGCTTGGG TTCCCTTGGT CGCCAACGGC CAATGGCTGG GCTTTGTTTG CGTCTACAAA GTGCAGCGCC CCTTCCTCTG GACGCTTGAC GATTTGCGCT TAATCAACGA TGTAGCCGAT CAATTGGCCT TAGCTTTGCG CCAAATGCAG CTCTACGAGG CCGAACGCCA ACGCCGCCGC GAATTAGAAG CCTTACAAGA AATTATTCGA GCGATCAGCG GCGAACTCAA TTTACATGCT TTATGTGGCA ATGTGGTCGA AAAAGTAATC GATGTGTTTA AAGTTGCCGC CGCTGCGGTG CTGATGTGGC ACAACGATGG CAGCGCCATG CAAGTCGTGG CCCACGCAGG TTTTTCCGAG CGCTATCTCG ATTCGCTGGA GATGAACACC GATACAGTGA ATTATTGGAT TTCGCGCTTC AAGCCACCAA CACCCTTGTA TATTAGCGAT ATTCGGCGTG TTTCGCTGAT TGGCGGCGAT CCAGCTAGCG CCGAGGGTTT GTCATCACTC TTTGCTCAAC CACTGATGGT TGATGGGCGG TTTAGTGGTT GGTTGCAAAT GTACAGCCGT GGCAATGTGC GGGTTTGGAA TCCTGAGGAA AGTCATCTAG CGGCCTCGAT TGCCCAACAA ATTTCTCAAG CAATTCATAC AGCCCGGCGT TATGAGCAAG AGCATTTGCT ACGCACCGAT GCCGAGCAAT CGTATTACCA GTTGCGCAGC GTGCTTGACG AGCTAGAAAA CACTCGCGAA ACCTTGATCA ATTCGGAAAA GCTGCGGGCT TTGGGTCAAC TTGCCAGTGG CGTTGCCCAC GATTTCAACA ATCTCTTAGC TTCAATTTTA GGTAATGCCC AATTTTTGCT GATTCACGAA GATGATGCCG ATCGTCGCGA TGCCCTGCAA GTAATCGAAC GTGCCGCCAA AGATGGGGCG GTCACGGTGC GCCGGATTCA AGAGTTTGCG CGGGCCTCCG AAACAATCTA CGACGATATT GTTGATTTGC GTGATGTGGT TAATATTGCG CTCGAATTTA CCCGCCCTTC GTGGCGTGAT AAAACCCAGC AACGTGGGAT TAAGCTCGAT ATTAGCACCC ATTTGGCTGC CGCCCATGTT CAGGGTAGCG CCGCCGAATT ACGCGAAGTC TGCGTTAATT TAGTGGTCAA TGCAATTGAT GTCTTGCCCC AAGGCGGCAC AATTACGATC AGCACTGGCA CAACTGGCGA GTGGTCGTAC TTTACGATTG CTGATAACGG CCCAGGCATC GCCCCCGAAG ATCGCACCCG TATTTTCGAG CCATTTTTCT CGACCAAGCC AATTGGCGAG GGCACAGGCA TGGGCTTGGC AGTGGCATTA AGCATTGTGC AACGCCATCG TGGCAAATTA TTGACTGAAG ATGTGTATCC GCATGGCGCT CGCTTCGTGG TGTTACTGCC AATTCATCAT GCACCCCAAC CAAAACCCCG CCCCGTTGCG ATTGTGCCCA AAACTGCGGC CCAGCGTATT TTGGTGGTTG ATGATGAGCC AGCAGTACGC AATATCGTGG CCAAAGTATT GCGCCACGAT CAGCATGAAG TGACCTTGGC TGGCTCTGGT GAAGAAGCTT TGCGCTGGAT CGACGAACAA GCCTTTGATC TAATTATTTC CGATTTGGGA ATGCCTGGGA TGAACGGCTG GGATGTGCTT GAACAAGCGC GACAACGCCG CCCAAATATC ATCGCGATTT TAATTACTGG CTGGGGCTAC CAACATGATG CTGATTACGC TGCGGCGCGA GGTGTTGATA GCGTGCTCGG CAAGCCCTTC GAGATGCAAA CCCTACGCAG CACCGTCGCC GATTTGATTC AAGCTCGCAA CACACAAGGC CCAAACCGTG TATCATAG
|
Protein sequence | MYDGDRAVST IWHEQLELLT KTAQEHFCLH NAELLIQRIC ALISHYLQTP RVTIALIQHD SFVAVASALG PLEDPSYQQA PRLVYSAGAA WSAVWQQQRS IITPAPTPDG MCQLSVPILR NAQLLGLIDI QSPQPEHKLS LLQPIIEMIA QQLALALGTL PSHRQHYRED QCTQIIEKIN LHILKHLDVL AEPQQIIELI WRFLPLQGAA LYLYDEQGLG LLTHINAAHV PTQRGIQPLS PSYYEQIGQQ SSHNSAKMLL AGQHGIDIHV PGNQANLGLL HLIPATPFEA NDLLALQEFG ARLGYILEHN RLFRSMYVAN ERNVLFARIV SHIRQTMNLR EVWQDVLATL GLGLRADFCT VALYEQEQRL RFHGTYTTLL LPDHLTAAEA LLHSEMLQAF QQKRSLTIDE YHDSNDPELR ERLLALNIHA LAWVPLVANG QWLGFVCVYK VQRPFLWTLD DLRLINDVAD QLALALRQMQ LYEAERQRRR ELEALQEIIR AISGELNLHA LCGNVVEKVI DVFKVAAAAV LMWHNDGSAM QVVAHAGFSE RYLDSLEMNT DTVNYWISRF KPPTPLYISD IRRVSLIGGD PASAEGLSSL FAQPLMVDGR FSGWLQMYSR GNVRVWNPEE SHLAASIAQQ ISQAIHTARR YEQEHLLRTD AEQSYYQLRS VLDELENTRE TLINSEKLRA LGQLASGVAH DFNNLLASIL GNAQFLLIHE DDADRRDALQ VIERAAKDGA VTVRRIQEFA RASETIYDDI VDLRDVVNIA LEFTRPSWRD KTQQRGIKLD ISTHLAAAHV QGSAAELREV CVNLVVNAID VLPQGGTITI STGTTGEWSY FTIADNGPGI APEDRTRIFE PFFSTKPIGE GTGMGLAVAL SIVQRHRGKL LTEDVYPHGA RFVVLLPIHH APQPKPRPVA IVPKTAAQRI LVVDDEPAVR NIVAKVLRHD QHEVTLAGSG EEALRWIDEQ AFDLIISDLG MPGMNGWDVL EQARQRRPNI IAILITGWGY QHDADYAAAR GVDSVLGKPF EMQTLRSTVA DLIQARNTQG PNRVS
|
| |