Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3458 |
Symbol | |
ID | 5735319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4348577 |
End bp | 4351453 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280605 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_001546222 |
Protein GI | 159899975 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAGT TGCCATTACA GATCTTGGTG GTCGATGACG AGCCACGCTT AAGTGGTCAG TTGCGGTTGT TATTGGAAGG GCAAGGCTAC AGCGTAACCA CGGCGGTTGG GGGTGCTGCG GGCATTGCCA CGCTCCAAGC AAGCGACGTT GATGTCGTGC TGACTGATGT CACCATGCCT GATGTTGATG GCTATACGGT GTTGCAATGG GTGCGCGAAA ATCGCCCAGA CATGCCGATT ATCGTCATGA CTGGCTATGG TTCGCTGGAA AGTGCCACCC GTGCGTTGCG CTTAGGGGCT TACGATTATA TTTTGAAGCC CTTTTCGTTG CCAACAGTGC AAGCAGCCTT GAATCGCGCT TCGGCGGCGG TTTCCAAGCG GCGTGCCGAT GGTCAGCGCA CCGCCGAGCT TTCAGCAATT GCGGCAATCG CCCGCCAAAT GGGGCGCTCG CTAGAGCCAG TGGCCATGGC CGATGCCACC TTGAGCGCAA TTGCCGAATC AACTGGCGCT CAAGTGGCCT TGCTCTACAC CGCCGCTGAA AATGGCATGC GCTGGCATTT ATGGCGACAC GTTGGCCTGA ATGAGCAACT TATTCAACAA TTAGCCGAAT TATTTCCGCC CAGCCCGCTG AGCAAAGCCA CAATCGTTTG GGAAGATCGT CCGCCTTGGT TGGCCGAGAT GCCCTACGAA TTGCTTCAAG GCCTGTGCAG CAGCGTTGGC GGCAAAGTCG CTTGGGGCGC ATTGCCACTC GATGCCGGCC CGCAAGCGCT TATGTCGCTG GTCTTGGTTG GCTCAAGTCG CAAAACTCAA GCTTGGTCGC CGCCATTTCT CATTTCGTTG GGCAATGCCT TGAGCATGGC CTTGGTCAAT ACGCGCTTGT TCAATGCAGT GCGCGAGGAA CGCGACCGCT TGCGCTTGCT CTATGGCATC AGCCGTGAGT TGGCTAGCTC GCTCGACCCC GATCAATTAC TTTCGCGAAT TATTCAACAC ACGGTGGCAG CAGTTAACGC CGAACGTGGC AGCATTATCA TTTCATCGCA AGATGGCCGC ACGACTCAGC GGATTGTTGC CCGTTATGGC ATGGATCAAT CGGTGACTGA ATCGGTAGCG GCGGCCTTGT TGCAAGCCGG GCTTTCGGGC TGGGTCTTTC GCCAACGTGA AGCCGCGCGA ATTGCCGATG TGCGCGTTGA TAAACGCTGG GTCGAATTGC CCTCAACTCG TGGTCGGGTG CGTTCGGCCT TGGCAGTGCC CTTGTTGCGC GAAGATCAAG TGCTTGGCGT GATGACCTTG ACGCACCCAC GGATCGATCA TTTTAGCGCT GCCGACTTGG AATTGGTGCG TTCGGTTTCG GCTCAAGCAG CCGTCGCGAT TGAAAATGCC AATTTGTTTG CTGAGCTAGA GCAGCGGGTA TTCGATCTTG AGGGCTTAAA TTCGACCAGC CGCGAATTGG CTAGCTCGCT TGATCCATTG GAAGTAGCGC GAAAAGTGGC TTATCGCTGT GCTGAAATGC TCGATGCCTC AATGGTAGCA TTGCTGCATG TCGATGATAA ACAAGGCACA TTGCCCTTGG TTTCCTTGAT TAACGGCCAA GAACAGCCGT TATTGAAATT GGGGCCGATT AGCGCTGCTC TGAATGATCC AGAACCTGTG CTATTAAACT CGGCTGGCAG CCAAATTGAA CTGTTGGTGA ATAACGAAGA ACCCTTGGGC GAAAGCTGGA TTGGCGTGCC CTTGATGCTG GGTGATGGGA TCAACGGCTT ATTGATTGCC GCCGATGAGC GCCGCGATGC CTTCGATGCT TACGAACGAC AATTACTCAC AGCCTTGGCA GGCCAAGCAG CGGTAGCAAT GGAAAGTGCG CGACTGTATG TCACTGCATC CGAGGAGCGA ACGCTCTTGG CAGCAGTGAT TGAATCGGTC AGTGATGGCA TTTTGCTGAC TGATGAAGGC CAAATTGTGG TTGCTAACCC TGCCGCTGGC GCAATTGCGG GCGTTTCTAA TAGTCGCTTG GTTAATCAAC CCTTGCTGAC CTTCTTCCCA ATGTTGGCAA TGCTAGCTCG CCGCGAAGAT CACGAAAGCA AAGAAATTGC CATCAGCAAC CGTTATTATG CGGTCAATAC TGCGCCATTG CAAAACAGTT CCTTGGGCGG CCAAGTGATC GTCTTGCAGG ATATTACCCA TTTCAAAGAG CTAGACCAAA TCAAGAGCCG CTTTGTTTCG ATGGTTTCGC ACGACCTCAA GTCGCCGCTG ACCGCAATTC AAGGCTATGC CCAATTGGTT GCCGACGGGC ATATGGGCAC GGTCAACGAG ATGCAGCGTG ATGCCTTGCA AGCAGTTGTG CGCAACACAG GCGCAATGAC TGCCTTGATC AGCGATTTGC TCGATTTGGG CAAAATCGAA GCAGGGATTG GCATTTCGCC GCAAGAAACT GATTTGGCGG TGGTGTTGCG CGAAGTTATC GACGAGCTGA AATTGCGGGC CAAAATGGGT CAAATCAGCG TCCAGCCTGA AATTCCGCCC AGCTTGCCCT TGGTGGCTGA TCCCTCGCGG ATGCGCCAAG TGTTTACCAA CATTCTTTCG AATGCAATTA AATACACGCC AAGCGGGGGG CAGGTCCAAA TTCGCGCCAA TAATGGCGAT GCTAAAATGC ATGTGCAAAT TCAAGATAGC GGCTTAGGAA TTCCCGAAGA TTCCTTGCCG CATATTTTTG AGCGCTTCTA TCGTGTCAAG CGGGATATTG ATTCGCCGAT TGAAGGGACT GGCCTTGGCT TAGCAATTAC CAAAAGCATT GTCGATGAGC ATGGCGGCAC GATCGAGGTG CAAAGTGTGA TCGGCGAGGG CACAACCTTC AATGTATATC TTCCCCAACA CAAATAA
|
Protein sequence | MNELPLQILV VDDEPRLSGQ LRLLLEGQGY SVTTAVGGAA GIATLQASDV DVVLTDVTMP DVDGYTVLQW VRENRPDMPI IVMTGYGSLE SATRALRLGA YDYILKPFSL PTVQAALNRA SAAVSKRRAD GQRTAELSAI AAIARQMGRS LEPVAMADAT LSAIAESTGA QVALLYTAAE NGMRWHLWRH VGLNEQLIQQ LAELFPPSPL SKATIVWEDR PPWLAEMPYE LLQGLCSSVG GKVAWGALPL DAGPQALMSL VLVGSSRKTQ AWSPPFLISL GNALSMALVN TRLFNAVREE RDRLRLLYGI SRELASSLDP DQLLSRIIQH TVAAVNAERG SIIISSQDGR TTQRIVARYG MDQSVTESVA AALLQAGLSG WVFRQREAAR IADVRVDKRW VELPSTRGRV RSALAVPLLR EDQVLGVMTL THPRIDHFSA ADLELVRSVS AQAAVAIENA NLFAELEQRV FDLEGLNSTS RELASSLDPL EVARKVAYRC AEMLDASMVA LLHVDDKQGT LPLVSLINGQ EQPLLKLGPI SAALNDPEPV LLNSAGSQIE LLVNNEEPLG ESWIGVPLML GDGINGLLIA ADERRDAFDA YERQLLTALA GQAAVAMESA RLYVTASEER TLLAAVIESV SDGILLTDEG QIVVANPAAG AIAGVSNSRL VNQPLLTFFP MLAMLARRED HESKEIAISN RYYAVNTAPL QNSSLGGQVI VLQDITHFKE LDQIKSRFVS MVSHDLKSPL TAIQGYAQLV ADGHMGTVNE MQRDALQAVV RNTGAMTALI SDLLDLGKIE AGIGISPQET DLAVVLREVI DELKLRAKMG QISVQPEIPP SLPLVADPSR MRQVFTNILS NAIKYTPSGG QVQIRANNGD AKMHVQIQDS GLGIPEDSLP HIFERFYRVK RDIDSPIEGT GLGLAITKSI VDEHGGTIEV QSVIGEGTTF NVYLPQHK
|
| |