Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4240 |
Symbol | |
ID | 5736094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5406080 |
End bp | 5409274 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281395 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_001547000 |
Protein GI | 159900753 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTAGCTA ACCTGTATAG CTCAGATCAA CAACGCTCAC GCTTTATCTG GTTAATGGGC ATTCAAGCGG TCTTGGTGGT TGTGTTGCTG ATCGCCTCAA CCCTCTTGAC GGTTGGCATG AACCGTTTAC GCGATCGTGA TGAAACGTTG CTGAGCTATC GGGTGCGTTT AAATGACGTT AATATTGCTC TGTTATCGTT GCACAACAAT TTGCGCGGCT ATACCAGCAC TGGCTCAGAT ATTTTCTACG CGGAAATTCA AAAAGAACAG GCAGTTGTTA CCGAGGGTTT GCAATTTCTC TTAGTCAACC CGCCCGACCC GAGCTTGCCC CAAGTCGCTG AGCAAGTCGA TCAATGGCTG CTCAATACCT ATCAGCCAGT GCTGGATGCA GTGGCCAACC AAGAAATGCT GTTGGCCGAA TTGACGCTTG AGCAAGGTCG CCCACAAATC GAGGCGGTAG TTAACACGAC AACCAGCTTG CGCACTGAAT TGCGCCAGCG CTCGGAAGAT TATCGCAACC AAATTGATAC TTACAATTGG ATTAAACTGG CCTCGATTGG TTTGCTCTCA GCATTAATTG TGGTTTCGAT GATTGTGACA GTGCGCTTCT GGCGCACCCA ACAGCACTTA ATTCACGAAA TTGAAGATAA AGGCAGTCAA CTGCTGCAAA GCAACCATGA ATTAAATTTC AATACCAATC AACTTTCAAT TATCAACAGT ATTTTGGGCG TGCGGATCAA CGAGTCGCGG GTGTTGCGCG AAATCAGCGA TTATTTGGTC AATAATCCTA CCCCCGCTGA GGCCTATAGC TTTGTGGCGC AAACTGTTGG TAATGTGCTG AATACCTGGT GTAGCATTGC GCTGCGTTTG CCCCCGCCCC GCGAGGAATT TCTCGATGTG GTGGCTTCAT ATCATTCGTT GCCCAGTCGC CAAGCCTACA TCGACCAAAT GGTGCAAACA GTCCAATTTC GCATCGATAA TGGTCTGTAT TCGCCAGTCT TTACCCAGCA TGCTCCCTTG GTTGAACTCT TCAATGTGCC CGTTGAGCAA CGTCACCCCA ACTATTTGAC CAACGAGGTT CGCGAACACT TGGAGCCATT CACGCTCTAT TCGTATATCG CTGTGCCGAT TAAAATTCAG GATGAAGCAG TTGGCCTGAT TTCGGCAGCC TCAGATTCGC CTGAACGGCT GTTTGACCAT GATCAAACCT TGTTTGTGCG CCAAGTCGCT GATCGTTTGG CCGCTTGGCT TGAAAATATT CAATTATTTT ATTTGCTCAA ACAACAGGCC AACGAGCTGC AAACGAGCTT CGATAGCCTC GACGATATTG TAGTTTCCTA CGATAGCCGT GGTCATCAAA CGCGAATCAA CGAGGCTGGC ACGCAGTTTT TTGCTGGTCG CCACTTTGAT TTTCTCTCCA ATAATTTGGT TTGGCGCACG GCTAAGGGCA ATGTATTGGG CTTCGATGAG CATCCGATTC AGCAGGCGTT GGCTGGCACA ACCGTGCGTG ATGTTGAAGT TTCGTTATCA CGCTCCGATG GTGTGCCGAT TATTCACGAA GTCAGCGTTT CGCCGCTGCG ATCTGCCGAT GGTTCAATTG AAGGCATTGT GCTGGTGGCG CGAGATTTGA GCGCTCGCAA GGAGCTTGAT CGGCTCAAAG AAGAACTTGT TGCCAATATG AGCCACGAGT TGCGCACGCC GCTCACTGCC ATTTTGGGCT ATAGCGAATT GCTGCTCAAA CGCCGCACCG AAGTGCTTAC GCCTTGGCAT ACCACCAAGA TTGAGGGGAT TCGCACTGGC GGCCAGCGTT TGCTGAGCCT CGTCAACGAT CTGCTAGATA TTGCCAAGCT TGATGCTGGC CGCATCGAGT TGCAACGCCA AACCACCATC ATTAATAGCT TGTTGCAAGA GCAAGTGGCG ATTTTGCAGC CGATGCTTCG CGAGAAACAG CAAACCCTCA CCTTGCAACT GGGCCAGCAC ATCCCCTTGC TCATGATCGA TCCTGAGCGG ATTGGCCAAG CAGTGACCAA TTTGTTGAGT AACGCCATTA AATTTACGCC AGAGCAAGGT ACAATCACCT TAGCCTCAAC CGCCTTGAAC ATTGATGAGC ATGGCCAAAT TGACTGGCTG GATCAAGTGT TGGCGACCGA AGTGCCGCCG ATGTTGGCAG GCCAATATGT CTTGATTCAG GTCAGCGATA GTGGGGTGGG CGTGCCAGCC GAAGCGCTCG TAAAATTGTG GGATCGTTTT TATCAAGTTG AGGGTGGCTC GACGCGCCGT TTTGGGGGCA CAGGCTTAGG TTTATCGATC GTTCAGCAAT TAGTTGAATT ACATGGTGGA CGGACATGGG CGACTAGCGC AGGCGAAAAT CAAGGCAGCA GCTTCACAAT TATGTTGCCA GTCAGCCGCG GTGCCCAATT TGTGAGCCTC ACTCAAGGCT TACGGCGTTC GATTTTGGTG ATTGAAAACG ATCACCAAAC GGCCCAATAT TTGGAAGAAC AATTGCAAGC GGCTGGTTTT GAAGTGATTG TGGCGACTGA TCATCATAGT GCCTTGACTT GGGCCAAAGA TCATTCACCA GCGGCAATCA CCCTCGATTT GCTCATGCCC AATAGCGAAA GCTGGGAAAC CTTGGCAGCC TTGCGCGAAA TTGACCATTT AGCGCAAGTT CCCGTCTTGA TCGCAAGCGA TGCTTCGGTA TACAATGAGC TACCAGGGGT TGGTGTCTCC ACATATGTAG TCAAGCCAAT TGATAGCCAA ATTCTGATTC GGATTATTCG TCAACTGATT GGGGCTCAAG GCCAAACAGG GTTTATCTTG GTGGTTGATG ATGATTATGA TATGGCCGAA CTGCTGTGTG CTACTTTGCA AGAGCATGGC TACGTTACCC AAGCCTCATA CGATGGGGCG GCGGCGCTCG ATCTGATTCA ACAGGGCAAT TATCCCCAAC TAATTTTGCT TGATTTGATG ATGCCAGAGG TTGACGGCTT TCAGCTACTC GAAAAACTGC GGGCAAATCC TGAAACTCGT AATATTCCCG TGATCATTGT GACTGCCCGT GACTTAACCA ATGAAGAAAT TCGCCAATTG CGCCAAGCTG CGCAAGCCAT TCAGACCAAA CATACCCTCA GTATGCGTAA GCTGGTTGCT GAAGTCCAAC GGTTTGCCCC GTTGAAGGAA TCGGATACCC CATGA
|
Protein sequence | MLANLYSSDQ QRSRFIWLMG IQAVLVVVLL IASTLLTVGM NRLRDRDETL LSYRVRLNDV NIALLSLHNN LRGYTSTGSD IFYAEIQKEQ AVVTEGLQFL LVNPPDPSLP QVAEQVDQWL LNTYQPVLDA VANQEMLLAE LTLEQGRPQI EAVVNTTTSL RTELRQRSED YRNQIDTYNW IKLASIGLLS ALIVVSMIVT VRFWRTQQHL IHEIEDKGSQ LLQSNHELNF NTNQLSIINS ILGVRINESR VLREISDYLV NNPTPAEAYS FVAQTVGNVL NTWCSIALRL PPPREEFLDV VASYHSLPSR QAYIDQMVQT VQFRIDNGLY SPVFTQHAPL VELFNVPVEQ RHPNYLTNEV REHLEPFTLY SYIAVPIKIQ DEAVGLISAA SDSPERLFDH DQTLFVRQVA DRLAAWLENI QLFYLLKQQA NELQTSFDSL DDIVVSYDSR GHQTRINEAG TQFFAGRHFD FLSNNLVWRT AKGNVLGFDE HPIQQALAGT TVRDVEVSLS RSDGVPIIHE VSVSPLRSAD GSIEGIVLVA RDLSARKELD RLKEELVANM SHELRTPLTA ILGYSELLLK RRTEVLTPWH TTKIEGIRTG GQRLLSLVND LLDIAKLDAG RIELQRQTTI INSLLQEQVA ILQPMLREKQ QTLTLQLGQH IPLLMIDPER IGQAVTNLLS NAIKFTPEQG TITLASTALN IDEHGQIDWL DQVLATEVPP MLAGQYVLIQ VSDSGVGVPA EALVKLWDRF YQVEGGSTRR FGGTGLGLSI VQQLVELHGG RTWATSAGEN QGSSFTIMLP VSRGAQFVSL TQGLRRSILV IENDHQTAQY LEEQLQAAGF EVIVATDHHS ALTWAKDHSP AAITLDLLMP NSESWETLAA LREIDHLAQV PVLIASDASV YNELPGVGVS TYVVKPIDSQ ILIRIIRQLI GAQGQTGFIL VVDDDYDMAE LLCATLQEHG YVTQASYDGA AALDLIQQGN YPQLILLDLM MPEVDGFQLL EKLRANPETR NIPVIIVTAR DLTNEEIRQL RQAAQAIQTK HTLSMRKLVA EVQRFAPLKE SDTP
|
| |