Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3521 |
Symbol | |
ID | 5735382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4432572 |
End bp | 4435640 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280668 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_001546285 |
Protein GI | 159900038 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAGGC CGCTCTTGCT GAGTTTTTGC AGTGCGACCA TAACCGCGCT CTTGTTTGGA TTGGGTTGGC TAGCCAATCT GCCGATGCCT TGGATTTTAG CCTTAGGGGT TGTGCAAGCA GGCATTATGT GGTGGCGCAC ACCATGGGCT GTTGGCAGTG CCATCAGCAT GGCGCTCATC GCATGGTTTG CGGGCATTAA TCAGGTTGGG TGGCTAGCAC TGTTGGTTGG TGGTTGGTTG GCTCAAGGCG GCTTGCCAGC CCTCATTTTT ATTCTGTATC GGTATCCCTA CGATATGTCG CGCGAATCGG CCCTGAATGG TTTTTTGGCC AGTGGGGTGT TGTTCAACAC GGCGATTAGC GCATTTTGTA TGAGCATTGT CGGCTGGCAA GTCGGCTGGA TCGAGGAACA TTCGTTGGTG GTAACCACAG CATTGATGTG GCTCACCAAT AGTTTGGGTT CGTTGATCAT CACCTTGCCA ATGCTGCGTT TAGGCACGCC ATGGCTCTCA ACCCGTGGCT GGTTTGGCAC GCCCGCCGCT TTGCAACGCA GCATTCATCG CAGCACCATC ACCCGTAGCG ACGTATTGCA ATTGATCTTT ATGCTGACCA TCGTCGGCCT GATCGGCTGG GGTATCGATC GGGTTGCCTA TATTCCTGTG CAATTTTTAA ATGTGTTGTT TCTCTTGCCA ATCGTGGCCT TTACTGCCCG CCATGGGTTT GATGGCGCAT TGCTCTCGGC CACAGGCAGT ATGTTAATCG CACTCTCATT TTTCCTCGAT GAAACCAGCC GCATGCGCCG CGTTGGCACG ATGGGCAACG ACCGTGTAGC CTTTGCTAGC ATTGTTTTTG AATTAGGCGT GTACTACTTA ATTAGCATGA TCGGCGGTTT ATTAATTGAT ATTCAACGCG CCGAAAAGCA ACGCCTCGCC ATGCTGAGCC ATGTCAATCG GATTTTCAAC CAAGCGCAAA AAGAAGATTT GCTTGATCAA TTGGTGCAAA CGGTCTGTGA AGCGCTCCAA ACCAACGTTG GAATTATGTT TCAATATGAT GCCCACAGCC AAAGCCTTAA GCCGCTAGCC TGGCAAATGC CCAAAGATTG CTCAATTGAT GTGATCGCCA CCGCGCTATT AACGTATTTT CCCGATTTGA AACAGATGAC CCAAACCTCG GCCAGCGCCT ACCATCACAA CGATCGTAAT GCAGCCACCA ACACCAGCGC ATTTTGGCTC GAAACTGGCT TAGTATCGGC CTTACTCATC CCATTGGTTG GACGCAACGG CACCTTGGGC TTGATGGCGC TGTTTGATCA ACGACCTGAG CGCTATTTTG GCCGCAGCAA TATCAATTTT GCTGAAGCAA TTTGTTCGCA AGCAGCGATC GCCCTCGAAC AACGCGATTT GATTACCACG CTCCAGCAAC AAACCGAGCA ACTGAATGCG GTTTCGAATA TTACCGCCAC ACTCAACGCA ACCCTCAACC TTGAAGTGGT TTGCCATCGA ATCGCCCAGC AAATCGAACG GGTCGTGCCC TACGATTGGG CTTGTGTGGC CTTGGCAACC GAGCAAACCC GCTTTTTCAG CGTACTGATG AAAACTGGCC GTGCCGAAGT TGACCTGCTT GATAAAACCT TGGTGCTTTC ACAGGAAATT TGGCAGGATT TTGGGCCGAT CGATGCGCCG CCCTATCGCA TGATTATGAA TTCATCGCCA GTTAGTCGCG CAGCGGAGTT GCGCCAAATT GGCTTGGCGG TAGCACTGTT GGTTCCATTG CGCCGCGATG ATCGCTGGCT GGGTGTGTTG GTGCTTTCCA GCTTTGATCC TGATGCCTTT CCACCAGCAC ACCAACAGTT GTTGCAAATT TTGGCGCGAC ACATGGCCTT GGCAATTTCC AATGCGCAAC TGTACCAGGA GCTTGAGCAA GCCTACCGCG CCAAACAAGA AGCCCAAGAT GTGTTGCTGC AAACCGAGCG CTTGCGGGCT TTGGGTGAAT TATCGAGCGG GATCGCCCAT GATTTCAACA ATTTGATTGC GGGAATTTTG GGGCACACCC AATTATTGCT GATCGAAGCG CCCGAGGAGC AACGCGAAGG CTTGGCGGTG ATCGAACAAG CAGCGCGTGA TGGTCGGCAT ATGGTCGAGC GGATTCAGCA ATTTACCCGT GCTCAGCAGC CCGAAGATCA TGAAATGGTC GATTTGAATA CAATTATTAA TGATGTGATT AAGCTGATTC GCCCACGTTG GCGGAGCCGC CCTGCCAGCA CCATGATTCA AACCCGCATT GAAAAAGGCC AAATTCCATT AATTTATGGT TCACCTTTTG CCTTGCGCGA AGTGCTAACC AACGTTGTAC TGAATGCGAC CGATGCCATG CCCAAAGGCG GAATTTTGAC GATCCGCACC GAGCAAGTTG AGGCTGATGT GATTATGGAA GTCAGCGACA CAGGCATTGG CATGGATGAA GAAACCCAAA TTCGCATGTG GGAGCCATTT TTCAGCACCA AAGGGGAGCA TGGCACAGGC CTAGGGCTTT CGATGACCCA TGCGATTGTG GTGCAGCATC ATGGCGGGCG GGTCGCAGTG CAAAGCGAAG TAGGTACTGG CAGCACGATT TCGCTGGTGT TCCCAATTCC GCGCCCCGAA AGCTCCAACG ACCCCTTGCA AGTTGGGGGA GCACAAGCCT TGCAACGGGG CACAATTCTG GTGGTCGAGA GCGATACGCG GGTCCAATCG GCGTTGGCGG GCTTGCTCGA AAGTTTAGGG CACACGGTGG TTTGCGCCGA TAGCGGCACC TTGGCGCTCG ATTTGGCCTA TGCCCGTGAT TTTGATGCCT TGATTGCCGA TAGCGGCCTG ACCGACATCA ATGCTTGGGA TTTGACCGAA CTGCTGAAAA CCCGCGACCC GCTGTTGGTG GCCATGCTGT TGACGGTTTG GAATGCGCCC GATCTTGATA CTCGCCGCCA TGTGTTTGAT GCAGTGCTGC CCAAGCCATT TGAATCGCAG GTGTTGGGCG AAACCATCGG TTTTTTGCTG AATAATCGCG CCAAACTAGC CAATAATATG GAGAACTAA
|
Protein sequence | MTRPLLLSFC SATITALLFG LGWLANLPMP WILALGVVQA GIMWWRTPWA VGSAISMALI AWFAGINQVG WLALLVGGWL AQGGLPALIF ILYRYPYDMS RESALNGFLA SGVLFNTAIS AFCMSIVGWQ VGWIEEHSLV VTTALMWLTN SLGSLIITLP MLRLGTPWLS TRGWFGTPAA LQRSIHRSTI TRSDVLQLIF MLTIVGLIGW GIDRVAYIPV QFLNVLFLLP IVAFTARHGF DGALLSATGS MLIALSFFLD ETSRMRRVGT MGNDRVAFAS IVFELGVYYL ISMIGGLLID IQRAEKQRLA MLSHVNRIFN QAQKEDLLDQ LVQTVCEALQ TNVGIMFQYD AHSQSLKPLA WQMPKDCSID VIATALLTYF PDLKQMTQTS ASAYHHNDRN AATNTSAFWL ETGLVSALLI PLVGRNGTLG LMALFDQRPE RYFGRSNINF AEAICSQAAI ALEQRDLITT LQQQTEQLNA VSNITATLNA TLNLEVVCHR IAQQIERVVP YDWACVALAT EQTRFFSVLM KTGRAEVDLL DKTLVLSQEI WQDFGPIDAP PYRMIMNSSP VSRAAELRQI GLAVALLVPL RRDDRWLGVL VLSSFDPDAF PPAHQQLLQI LARHMALAIS NAQLYQELEQ AYRAKQEAQD VLLQTERLRA LGELSSGIAH DFNNLIAGIL GHTQLLLIEA PEEQREGLAV IEQAARDGRH MVERIQQFTR AQQPEDHEMV DLNTIINDVI KLIRPRWRSR PASTMIQTRI EKGQIPLIYG SPFALREVLT NVVLNATDAM PKGGILTIRT EQVEADVIME VSDTGIGMDE ETQIRMWEPF FSTKGEHGTG LGLSMTHAIV VQHHGGRVAV QSEVGTGSTI SLVFPIPRPE SSNDPLQVGG AQALQRGTIL VVESDTRVQS ALAGLLESLG HTVVCADSGT LALDLAYARD FDALIADSGL TDINAWDLTE LLKTRDPLLV AMLLTVWNAP DLDTRRHVFD AVLPKPFESQ VLGETIGFLL NNRAKLANNM EN
|
| |