Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4910 |
Symbol | |
ID | 5736746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6248870 |
End bp | 6249913 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641282077 |
Product | histidine kinase |
Protein accession | YP_001547668 |
Protein GI | 159901421 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4585] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00236658 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGCCG ACAAAGACGC ATTATTAAAA GAATTGCGTG ACCAACAAGA CCGCATTCGT CGCCAAATGA CCGAACTTGA TGCCTTGGTT CGCCAAAATC AAGCTGAAGT CGATAAAATG TCGCAACGTG AAATGTCGGT TTCCAGTCGC CGTCGCGATA TGGAAGTCAA TTTCGAGCGC TATGAAAAAG TTGATATTAA AAATTTCTAC ACTTCGGCCC AAGAGGTTCA AACCCGTGTC CAAATGATGC GCAGCCAAGT GGAGCAACTG CAAACTAAAC AACAAGTGCT GCGCGAACAA CAAGATACGC TGCAAAAACT GATTCAAAGC CTCGATGAGG TCAGCGTTGT GGCCGAAACA ACCATTTCAA ACTTGCCACA CGTTAATCCA CAAGAACAAA TTGCCGCGAT TATTCAAGCC CAAGAAAAGG AGCGCCTGCG GATTTCGTTG CAAATGCACG ATGGCCCAGC TCAATCGATG AGTAACTTGG TGTTACGCGC CGAAATTTGC GAACGCTTCC TTGATCACGA TACCAATCAG GCTCGTTCAG AGATGTCTAG CCTCAAAACG GCGATCAATA CTGTGTTGCA AGATACACGG CGTTTTATCT TCGATTTACG CCCGATGACC CTTGACGATT TAGGGTTGTT GCCAACGCTC AAGCGTTATA GCCAAGAATT TGGCGATAAA AACAATATCG AGATCAATTT AATGGTACAA GGTTTGGAAA CCCGCTTGCC TAGTCATTAT GAAGTAACGA TTTTCCGTTT TGTCCAAGAA GCGCTGAATA ACGTGCAACG CCACGCCAAT GCTTCGCATG TCCGAATTAT CCTTGAGGCT GATGCTAGCC GCATTCAAAT TGCGATTGAG GACGATGGGG CTGGCTTCCA TGTTGCCGAA ACCCTCAACG ACCCAACAGG TAAACGCAAT ATGGGGATAG CCAGCTTACG TCAACAGGCC GAAGTGCTCT TACGTGGCCA AATGGGCATA GAAAGCACTG TAGGCAGGGG TACTCGGGTT GTAGCAGTCG TACCTTCACC CTAA
|
Protein sequence | MAADKDALLK ELRDQQDRIR RQMTELDALV RQNQAEVDKM SQREMSVSSR RRDMEVNFER YEKVDIKNFY TSAQEVQTRV QMMRSQVEQL QTKQQVLREQ QDTLQKLIQS LDEVSVVAET TISNLPHVNP QEQIAAIIQA QEKERLRISL QMHDGPAQSM SNLVLRAEIC ERFLDHDTNQ ARSEMSSLKT AINTVLQDTR RFIFDLRPMT LDDLGLLPTL KRYSQEFGDK NNIEINLMVQ GLETRLPSHY EVTIFRFVQE ALNNVQRHAN ASHVRIILEA DASRIQIAIE DDGAGFHVAE TLNDPTGKRN MGIASLRQQA EVLLRGQMGI ESTVGRGTRV VAVVPSP
|
| |