Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2729 |
Symbol | |
ID | 5734610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3485642 |
End bp | 3487639 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279872 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_001545495 |
Protein GI | 159899248 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4585] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGTTC TTGATGCCCC AACCAATCAA GCACGACCAA CCCCTCAACC ACGATTAACC CCAGCCATGA GTTTTGGCTT GGCAACACTT GGTGTTTTTG TGGGGGTTGG CTGGCTCTGG CTTGCGCCAC ACTGGCTAGC ATTGCTAATC ACGCTTGGTT CATTAAGCCT TTTGCTCGGC CAAGCGTGGG CCAGTATCCA ACGCACTGAG CGCGAACGCA CGGTTTTAGC TCAAGCACAT AGTGCCCTCG ATAGCGAATA TAACAGCATT CAAGCCGATT ATGCCACGCT CGAACGGGCC TACGAAGATT TGCAACAATC GTATCAACGC TTGCAACGGC GGATCGATGA ACGCACCGCC GAACGCGATC AATCGACCAA TCAGTTGCAA ACCTTGCTTG ATGATACCCG CTCGTATGTC GCCCAACTAA CGGCATTGAA TGAAGTTTCA GTTGCTTTGA ACGCCACGCT TGATCGCGAT GAGGTGTTGG CTTGTATTTT GCGCCAACTT GAACGGGTGG TTACCTTCGA TAGCGCTTCA GTTCAGCAGC TTGAAGATGA TGAATTGCAA GTAATCGCTG CCCGTGGCAT GAATAACGAA GTCTATGATA TGCGGATTTC GGTGCCTGAT AATGAATTGG CGCGGAAAGT GGTTGGATCG CCTGCACCAG TGGTTTTGGG CGATGTGCGC AAAGATCCAT CGTTTGTGAT GCAGCCTGGG CCAATTCGTT CATGGATTGG GGTGGCCTTG CGGGTGGGCG AACGCACGGT CGGCATTTTG ACCGTCGATA GCCATCGCGA AAATGCCTAT ACCGCCGATG ATGGTCACTT GGTCGCCAAT TTTGCCAACC AAGCTGCCTT GGCCTTGCAC AACTCACATC TGTTTGCGGC TGCTGAACGT CGCGCCACCG AAATGGCCTT GCTTTTGGAA ATGACCCGCA CGGTTGGCTC GACCTTGCAC TTGCCCGAAG TCTTGCTCCG TGCCGCCGCC GCCATTGGCG AGGCCTTGCA TGCTGAAGAT GTATTGGTGT TATTGCTTGA TGAACAGGGC GAGCGTTTGA CTCCCCAAGC TGGCGCAATG GGTGATCATA ACTACTTGCG CATGCGTTGG GCCAGCCCAA CCTCGTTGAG TAACGAGCCA ACTTTAGCCA AAGTGATCAA ACAAGGCTGC GCCAGAGTAA TTCATGCGGT TTCGCCGACG ATTCCTTACC AAACCTTGTT GGCCTTGCCA TTGAGCATCA AAGAGCAAGT TTTAGGGGTT GTCTTGGTCG CCACACCTGA TCGCGATGCC CCATTTGGGC CACAGCAATT GGTTTTAGCT GAAGGTTTAG CCACCTCAGC CGCCATCGCG ATCGAACAAG CACGTTTGTA TGATCAAGCT CGACGCGCGG CTCAAGCTGA GGAGCGTTCA CGTTTGGCGC GTGAATTGCA CGATTCGGTC ACCCAGACCT TGTTTAGCAT GACCTTGACG GCTGAAGCTG CCCGTGCCCA AGTTGAGCGC AACCCAGTTC GGGCGGCCAC TCAAATTGAT CGCTTGAAGG CCGCAGCCCA TCAATCACTA GGCGAAATGC GTGAACTCTT GCTGCAATTA CGCCCAACGC CGTTGCAAGA ACATGGCATT ATCAAATCGC TGCGCGACCA TATTGCCAGC TTGAATGCTC AAGAAGTTGC CATTAATTTG GAGTGTGAGG GCGAGGATTC GGTACTTAGC CCAGCCAATG CTGCTGGCTT ATATCGAATT GCCCAAGAAG CGATTGCCAA TGCGCTGCGC CATGCCCAAG CAACTAATGT ACGAGTGAGC TTCAGCTTTG CCAGCAAATC GACGACCTTG ATGGTGAAAG ATAATGGCTG TGGCTTCGAC CCCGATTTAT TGGAGCGCAG TGGCCGCCAT TTGGGGCTAA CCAGTATGGC CGAGCGTGCC GGAGAACTGG GTGGCACGCT CGATGTGCAA AGTGTCGTTG GCGAAGGTAC CTGCTTAACC GTGCACCTGC ATGGGTAA
|
Protein sequence | MRVLDAPTNQ ARPTPQPRLT PAMSFGLATL GVFVGVGWLW LAPHWLALLI TLGSLSLLLG QAWASIQRTE RERTVLAQAH SALDSEYNSI QADYATLERA YEDLQQSYQR LQRRIDERTA ERDQSTNQLQ TLLDDTRSYV AQLTALNEVS VALNATLDRD EVLACILRQL ERVVTFDSAS VQQLEDDELQ VIAARGMNNE VYDMRISVPD NELARKVVGS PAPVVLGDVR KDPSFVMQPG PIRSWIGVAL RVGERTVGIL TVDSHRENAY TADDGHLVAN FANQAALALH NSHLFAAAER RATEMALLLE MTRTVGSTLH LPEVLLRAAA AIGEALHAED VLVLLLDEQG ERLTPQAGAM GDHNYLRMRW ASPTSLSNEP TLAKVIKQGC ARVIHAVSPT IPYQTLLALP LSIKEQVLGV VLVATPDRDA PFGPQQLVLA EGLATSAAIA IEQARLYDQA RRAAQAEERS RLARELHDSV TQTLFSMTLT AEAARAQVER NPVRAATQID RLKAAAHQSL GEMRELLLQL RPTPLQEHGI IKSLRDHIAS LNAQEVAINL ECEGEDSVLS PANAAGLYRI AQEAIANALR HAQATNVRVS FSFASKSTTL MVKDNGCGFD PDLLERSGRH LGLTSMAERA GELGGTLDVQ SVVGEGTCLT VHLHG
|
| |