Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4477 |
Symbol | |
ID | 5736328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5728801 |
End bp | 5730546 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281640 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_001547237 |
Protein GI | 159900990 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00493244 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACC AACCCGCAGC TTCAGATTCA GCCGCATCAA CCAATCTTCA GACAGCCCTC AACGCTGAGG TGGCGTTTGG TCTTCGCCCA ACCGTAACAG GATTGGGTTT TTTATATTTG TTATTTAGTA TTGCCCATGC GTTGGTTTTG CCCAGCCCAA TTAAACTACC AATGGTCATC GTAGCGCTCA GTAGTGTCAT TTTTTTTGGA TTTTGGTGGT GGCGCTTGCA AAGCTGGCGG CCAGCCCCCG AGCTGACCCA TCCGCTGGCC ACGCTCTTTA TTGTGGTTGG CGGCTTCAAT AGCGTTTTGC ATATCTGGCT GAGCGGCGAG ATTCACCAAA GCACCAATAT TGCCTTTATT TTAATTGGCA CTGGCTGTTT GTTGCTTTCT TGGAATTGGT TCATCGTAGC GAGTGGGGCT ATTTTGCTGG CGTGGATTGC GGCGATCATT TCATTGCCAA CCTCACCGTT GACCATGCAT TTTATTTTTA TGGTGGTTAG TGCCACGATT GCCGCTGCCA CAATTCAAGG CATTCGTTTG CGCACGGTCA AGGGCTTGAT CAAATTGCGC TTGCAAGAAA GCACCTACAA ACAAGAGCTA CAAGAGGCCT TAATCCAAAT CAAAATGAGT GAAGAGCGTT TTCGCGCCTT GGCCGAAGCA ACCTCCGAAG GGGTGGTATT GCAAGATGAA GGCGTGGTGA TGGATGCCAA CGAACGCTTT GGCGAGATGT TTGGCTATCA TCGTGATGAA ATTCTCGGCC ACTCGTTACG CGAATTTGTC GAGCCACAAT CGCTGCAACG GGCAATGCAA AAATATAAAG ATGGTGCGCC CTACGAAGTT ACCGCACTGC GCAAAGATGG CAGCACCTTT ATCGCCTTGG TGTTGGGCAC CAATTTGCCC TATAGCAATC GGGTGGTGCG GGTTGCGGCG GTGCGCGATA TTACTGAGCA GCGCCATTTT GAAAATTTAT TGCTGACTGC CAAAGATGAT GCTGAGGCCG CCAACCGCGC CAAAAGCACT TTCCTTTCAA CTGTCAGCCA CGAATTACGC ACGCCACTGA ATGCGATTGT TGGCTATAGC GAAATGATCT ACGAGGATTT GATCGATCGC AGCATGCCTG AGTTGGCCAT GGATATGACC CGCATTCGTA GCGCTAGCGA CCGCTTGTTG AGCTTGATCG ATGGCGTTCT GACGATTACC GATCTTGATG CGGAAGTTGT GCGTTTGGAG TATGAAACGA TCGATTTGGC GCTGGCGATT GGCAGCATCA GCGACCAATT GCAAGCCAAA GCCCAAGATA ACAAAAATAC TGTGCAATTA TTGGGTAGCC AAAACTGGGG TTCGATTATC AGCGATGATC ATAAATTGCG CATGATTATT TACCATCTGC TGGATAATGC AATTAAATTC ACCCACGCAG GCTTAATTAG CATCTCGGTG CAACGCTTGC AACACGCTGC TGGCGGTTGG CTCGAAATTG CAATTCGCGA TACGGGGATT GGCATTGCCC ATGAGCAATT TGAACGGATT TTTGAGCCAT TTGTTCAAGC CGATTCCTCG GCCACTCGCC AATATGAAGG TACCGGTTTG GGCTTGGCCG TGAGCATGCG GCTGGCTCGT GCTTTGGGTG GCACGATCGA GCTTGATAGC CGCTTAGGGA TTGGCTCAAC GTTTACTCTG CATATGCCCG AACATCCCAC CAAACCCAAT GTGCCTTCAC CTCAAATGTC ACACGCGAAC GTATAG
|
Protein sequence | MSDQPAASDS AASTNLQTAL NAEVAFGLRP TVTGLGFLYL LFSIAHALVL PSPIKLPMVI VALSSVIFFG FWWWRLQSWR PAPELTHPLA TLFIVVGGFN SVLHIWLSGE IHQSTNIAFI LIGTGCLLLS WNWFIVASGA ILLAWIAAII SLPTSPLTMH FIFMVVSATI AAATIQGIRL RTVKGLIKLR LQESTYKQEL QEALIQIKMS EERFRALAEA TSEGVVLQDE GVVMDANERF GEMFGYHRDE ILGHSLREFV EPQSLQRAMQ KYKDGAPYEV TALRKDGSTF IALVLGTNLP YSNRVVRVAA VRDITEQRHF ENLLLTAKDD AEAANRAKST FLSTVSHELR TPLNAIVGYS EMIYEDLIDR SMPELAMDMT RIRSASDRLL SLIDGVLTIT DLDAEVVRLE YETIDLALAI GSISDQLQAK AQDNKNTVQL LGSQNWGSII SDDHKLRMII YHLLDNAIKF THAGLISISV QRLQHAAGGW LEIAIRDTGI GIAHEQFERI FEPFVQADSS ATRQYEGTGL GLAVSMRLAR ALGGTIELDS RLGIGSTFTL HMPEHPTKPN VPSPQMSHAN V
|
| |