Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4054 |
Symbol | |
ID | 5735912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5171495 |
End bp | 5174866 |
Gene Length | 3372 bp |
Protein Length | 1123 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641281205 |
Product | putative signal transduction histidine kinase |
Protein accession | YP_001546814 |
Protein GI | 159900567 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4585] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATACA TAATGCGTGT AATTAGCATC TTTTTTGGAT GTTTTGTTGT CATCTCAATG GCAATTTCCG TTCGAACTAG CTTAAACACG ATTGATGATC CAGCGCTTGA ATTAATTACC TATTGGTCTG CAATAAACAA TTGTAATGAA ATTCATGCTA TAACCCCAGC TGAATGGATG CCCATTGCCC AAGAAATTGT TGCACCAGGC GATTGTATTC TCAGTGTTTC AGGTTATGGT TATAATTCGC CAGAAATGAT TGATCATCTG AAGAAGAAAC TAGAGCTAGA TTATGCTAAT CGCTTTGTAC CGGTAGAAAT TAGACGCGAT CAATCTCAGT TTACCGTTTA TCTACCAATC ACAAAAATTA GTATTCATCA TATTTTACAA ATCTATTTAG CAATCATCCT TACCGCAGCA TTACTATGGA TTCTAGGTAT TATTGTATTA ATTTCAAACC CAGATAAGGA AAATAATATT GTATTTGGTT ATATTCTTTG GTTTCTTGCT TTAGCTATTA CAAGTATTCG GCATAGAGTA CCAGATTTTG GCGAATATCT TACCCTTGTT GCAACTGTAG TGCCATTAGC ACTACTTGGT GGAAGTTTCT TTCATTTAGG GTATATCTTT CCAGTAAAGA TTGAGCGTTA TCATGGGTTT AAATATGTGC TGTATATCCC TTCGCTCTTT AGTATTGCTT TATATACCTA TGTTATTATT AATATTGCCT CAGGTAAAAG TAATATTCCT ACAATTTCCA ATCAAGCCAA CTATATCATT GCGACCATAT TTATCATCGG ATTTAGCTTT ACATTAATTC GTTGGGCATA TCTCTGGATC ACCTATCGCC ATCAAACTAA ATCGAAAATT ATTTTGCAAG CTAAAATTGC GTTTACCACT TGGTTAATAG GTGGATTATT CAACGTTATA TTGATGATTG TGTATCAGGA ATTTAATTTA AAATTACCAA TAATCGGGAA ATCAATATTT TTTGGCTTAT TTCTATTAGT CATTCCGGCA GCTGGGACTG CATTTACGCT ATTACGTTAT GAATCAGTTC ATGCAAAACG TTCATTTAGC TTAGATTTAT TAATGATAAT TTTGATTAGT GCGGTTATTG TGGATATTTC AATCCTTATT AATAGTTATA TTGAAATTAA TGGAATTATA TTTATCAATA TCTTCTTTGT TGCAGTAATT ACCTCAATTT TCTGGTATAT CGACAACCCA TTTCGTAGAA TATTTGAAAA ATATTTCCTG CGCCATCAAA ATGATGCTGA AATATTGTTT GGCTTGTTAA ACCATCTTGA TATCACTAAA GATATTCATT CAGCAATCTA CTCATCAATT GATTATTTGA CTAAGCAATT AGAGATAGAA TCTTTGCGTT TTGTGATCGA AAAGAATATT ATCAAAACAT CTGAACATTT TTATATTATG GAAAACGAGC CAATTGATTT ATTTTTTGAG CACAATTCTG AAGCAGGATC GTTTTCAAAC TCGCTCAGTA AGTTCTATAA ACATCGAGAA ATTGTGTATG ACAATAACAA CAATCAAATT GGATACTTAT ACCTTGGGCC AAAAATTACC AATGAAGATT TTGATACCAA AGACTATGAG CTTATTCGAT TAATCACTCA ATATTTGAGT TGGTTTGTGC TGGCGAAAAA TCAGTTAATC TTAATCAACC AAATTATCCA AAGGATTATC AAGGCGCGTG ATAGTATTTT TGGTGATATC AATCATGTTA TTCACGACGA TATTTTAGGC AAACTAAACT CGGTTACGCT TGGCATTGAT ATGATCTGTG AGTTTGATCA GATTACACCT GATACAAAAG CTCGACTTGT TCAATACAAA GCTTCGACCG ACCAAACAGT AGAAATCCTG AAACGCTTTA TCATTCAAAA ACAAATTGCA ATTCCAGGTG TAAAAAAGAA TTTCTTACCA GAAATTCATC GGCTAATTCG TGAGCTTATT CAACATCAAC AAATCGAACT TCATTGGGAA ATGCCACCAA GAGATGATTT GAATCTTTGG AATAATCTAT CAATTGACAA AAAACGTGAT ATTTTTCGGA TTATTCATTC AGCAGTTGCA AATACCCTTG CTTATGCTCA AGCAAGAAAT ATTAATATTA CATTTAGTAA AAATAATGCT ATGTTATCCT TGTCAATCAT TGATGATGGT GTTGGTTTTG AGATTGATAA AGATATTAAA CAAAATAGCA CTGGCTTGAT GACGATGTAT GAGCGCACAA AAAATATTGA TGGTATCATC GAAATACGTT CGATTATCGA TCAAGGTACA ACAATCGAGT TATCAATTCC AATGGAAATA GCTATTCCGA CTCTGATCCA TGAACAATCA ACTGAGCAAG CAACTCCATT AATTAAAAAG CCCAAGCTAG AAATGCTCCA TTCAGATATT CAAGTCAAAG AACAATCCAA GAAATATAAT TTAGCATTAA TCTGTATGGC ATTTATGCTT GGCATTGGCA TAACATTTAT TTTAAAAACC GATTTATTAT CGCTAGAAAA AGTAGCCTCA CAACCAAAAG TCTGGTTTGA TGTTGATGTT CGTTTAGTTA ATCAACAAGC TGAGAATGGT GAATTTTGGC GAACTCGTTT ATTTGATGGT AGTCGTTTTA TCCAAAATTA TGATACTATC GCCGTCGGGA CAAGAGTAAC AATGACATTT AGTTTAAGAA ACCATAAGGC TAAACCAGCT TTACTCCGAA ACTTAGTCGC AGGGGCACGT GGTCCTAACG TCTTAGAACA AGGCTGGAGT GCCTCCACAA TGGATTTTCC TTCAGTTCAG AATATTCTGA TAGAACCATA CAAAACCTAT ACATTTACGG CTAGCCGAAT TTATGATCAG CCAGGTAATT ATTTTCTAGA GCCAATGTTT GAAGATGAGG CTGGTGATTG GCGAGCTATT GCTGACTTTA CAAGAATTAC GTTTTTTGTG GCCGATTTAA CTAATCCAAT TGTTAGTGAA GTCGTTGAGG TTGCTGCTGA TCGTGATTGG AAATATACGC CAATCTATGT TCAACCTGGC GATACAATCG AATTTATAGC TCAAAAAGGT TCTTGGACAA CTGATATGAA TAGTCTACCA TTCGTTAATG CTGATGGGTA TCAAAACCAG CATTACGATT GGACAACACT GCCAGCTGCA AATTATGGGC AATTAATTGG CTCAATTGGC GATTGGAAGT TTGCAATTGG CAAAGAATCG AGCATTCAAG CCCCGAACTA TCAGGGAATC TTACGACTTG GAATTAATGA TGCGCATTGT GCCGATGTCT GTTTGAGCGA TAATCGTGGC TCGATGATGG TTGCTATTGT GGTCAGACGC GCTAAAAAAT AA
|
Protein sequence | MKYIMRVISI FFGCFVVISM AISVRTSLNT IDDPALELIT YWSAINNCNE IHAITPAEWM PIAQEIVAPG DCILSVSGYG YNSPEMIDHL KKKLELDYAN RFVPVEIRRD QSQFTVYLPI TKISIHHILQ IYLAIILTAA LLWILGIIVL ISNPDKENNI VFGYILWFLA LAITSIRHRV PDFGEYLTLV ATVVPLALLG GSFFHLGYIF PVKIERYHGF KYVLYIPSLF SIALYTYVII NIASGKSNIP TISNQANYII ATIFIIGFSF TLIRWAYLWI TYRHQTKSKI ILQAKIAFTT WLIGGLFNVI LMIVYQEFNL KLPIIGKSIF FGLFLLVIPA AGTAFTLLRY ESVHAKRSFS LDLLMIILIS AVIVDISILI NSYIEINGII FINIFFVAVI TSIFWYIDNP FRRIFEKYFL RHQNDAEILF GLLNHLDITK DIHSAIYSSI DYLTKQLEIE SLRFVIEKNI IKTSEHFYIM ENEPIDLFFE HNSEAGSFSN SLSKFYKHRE IVYDNNNNQI GYLYLGPKIT NEDFDTKDYE LIRLITQYLS WFVLAKNQLI LINQIIQRII KARDSIFGDI NHVIHDDILG KLNSVTLGID MICEFDQITP DTKARLVQYK ASTDQTVEIL KRFIIQKQIA IPGVKKNFLP EIHRLIRELI QHQQIELHWE MPPRDDLNLW NNLSIDKKRD IFRIIHSAVA NTLAYAQARN INITFSKNNA MLSLSIIDDG VGFEIDKDIK QNSTGLMTMY ERTKNIDGII EIRSIIDQGT TIELSIPMEI AIPTLIHEQS TEQATPLIKK PKLEMLHSDI QVKEQSKKYN LALICMAFML GIGITFILKT DLLSLEKVAS QPKVWFDVDV RLVNQQAENG EFWRTRLFDG SRFIQNYDTI AVGTRVTMTF SLRNHKAKPA LLRNLVAGAR GPNVLEQGWS ASTMDFPSVQ NILIEPYKTY TFTASRIYDQ PGNYFLEPMF EDEAGDWRAI ADFTRITFFV ADLTNPIVSE VVEVAADRDW KYTPIYVQPG DTIEFIAQKG SWTTDMNSLP FVNADGYQNQ HYDWTTLPAA NYGQLIGSIG DWKFAIGKES SIQAPNYQGI LRLGINDAHC ADVCLSDNRG SMMVAIVVRR AKK
|
| |