Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3933 |
Symbol | |
ID | 5735794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4926913 |
End bp | 4928181 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281084 |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_001546695 |
Protein GI | 159900448 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTAATC GATTGCGTTG GCGCTTAACC TTAATTTATG CTTGCACAGC CTTGCTGCTG TTGGCGGCAG TTGGCGGTAG CGTTTATTGG ATGACCGTGC GCTATTTTCG CTTTGCTACC GAACGGGCCT TGCTTGAACG CATGACCTTG GAATTTGAGC AATTGGCAGC GCCCTTGCCG CCCGCCCTGC AAGCCTATCG CCCCCAGCAA TTGCCCGATG AACGGCCCGT TGATAATGGA ACACTGGCTA GCACGTTCGT TTTTACCATC GACCAAAACG GCCAAGTCTT CAACCCAAAC CCGTGGAACC CGCCAATTCA GCCTGATCAA GCGGCGATTG AGGCCGCCAA GGCTGGCAAA CTTGATCTGC GCACCATCAA CTTGGCTGAT GGAACTCAAG TCGCTTTGTT GACTGAAAAA TTGACTCGCA GCGATGGCCC AGCTTTTTTG CAAGTTGGGC GGGTGCTCAA CGAGCAAGAA GCCGCCTTGA GCACCTTGTT AAAAGGCTTG GTGGCTTTAT GGGCTGGAAG TGTAGTGGTT TTGGGCTGGT TTGGTTGGTG GTTGGCCGGG CGATCGTTGC GGCCAGCCCA ACAAGCTTGG GAACGCCAAC AAGCGTTTAT CGCCAATGCC AGCCATGAAC TGCGTGCTCC TCTGACCTTG ATGCGAGCCA GCAGCGAAAT TGCCCTGCGC GAATCGACCG ACCCTGCCGA GCAACAAGAA TTGCTTGAAG ATATTTTGGC CGAAACCTAT CACATGGCAC GTTTGGTTGA AGATTTGCTG TTGCTCTCGC GGCTTGATGC TGCCAAAACC CATCTTCAGC GCGAAACGAT CGATCTGGCT GAGTTATGCC AGGATGTGGC TAAGGATGCT GGACGCTTGG CACACGATGC TGGGGTGGAG GTTCGTGTGG CGCATGCCGA GGGTCAGATC AAAGTTGATC GCACCCGTTT GCGCCAAGTG TTATTAATTT TGCTGGATAA CGCAATTGCC CATACGCCGC GTGGTGGGAG CGTAGTGATC CATGCTGAAC GCAAGCAAAC TAGCTATCAA ATTAGCGTAA TCGACACGGG CAAAGGCATT GAGCCTAAAC ATCTCAAGCA TATTTTCGAG CGTTTTTATC GGGTTGATAG CGCTCGGATC GCTGGCGGAC GTGGTAATGG GCTTGGTTTA TCAATTGCTC AAGCGCTGAT TCAAGCCCAC AATGGCACGA TTGAGGCCCA AAGCAACGTG GGCACTGGCA CAACAATGTT GATCAACTTG CCAAAATAA
|
Protein sequence | MLNRLRWRLT LIYACTALLL LAAVGGSVYW MTVRYFRFAT ERALLERMTL EFEQLAAPLP PALQAYRPQQ LPDERPVDNG TLASTFVFTI DQNGQVFNPN PWNPPIQPDQ AAIEAAKAGK LDLRTINLAD GTQVALLTEK LTRSDGPAFL QVGRVLNEQE AALSTLLKGL VALWAGSVVV LGWFGWWLAG RSLRPAQQAW ERQQAFIANA SHELRAPLTL MRASSEIALR ESTDPAEQQE LLEDILAETY HMARLVEDLL LLSRLDAAKT HLQRETIDLA ELCQDVAKDA GRLAHDAGVE VRVAHAEGQI KVDRTRLRQV LLILLDNAIA HTPRGGSVVI HAERKQTSYQ ISVIDTGKGI EPKHLKHIFE RFYRVDSARI AGGRGNGLGL SIAQALIQAH NGTIEAQSNV GTGTTMLINL PK
|
| |