Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3020 |
Symbol | |
ID | 5734877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3814273 |
End bp | 3816168 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280164 |
Product | histidine kinase |
Protein accession | YP_001545786 |
Protein GI | 159899539 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000249854 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTATC TTTTCGATCA TCCAACATGT ATTCGGCGTA CATCGCCTCA ACATTTAGCA AGTTATGCCT ATGGCGTGCG GGCAAGCTGC TCGGTGCGCC ATCCGGCTGG CGATGGAGTC GATGACCCAC GCTTTTTATT ATTTGGCCCC CAACGTGGCA ATTATCGCCA TTGGTTCGAT ACCTATGAAA TTAAGCCCGA TAATGCGCCC GATTGGTATG CCCTGACCTT TCCCAAGCCT ACCAAATTAA ATGTGGTTTA TTGGATGCAT GGCCCCATGT TTGATGATCA TGGTTGGTGG ACTTCGCTCC AGGTCGAATA TCGCGATGCT GAGGGCGCTT GGCATAAAGT TGACGATTTG CAGATCACGC CGAATTACCA TTTTAGCCCG CAACGCGGCG ATCGTAAGCC CTTCAGTGCT TATGCTGCTC ATTTCAATCC AGTGCGAACC AGCGCGATTC GACTGATCGG CCAGCCCAGC GGCAAGCCGC AAATTACGAC CATGGCCTAT CTTGCCGCCG ATTGGAGCAC TTGCGAAGGC ATCGCCGCCC ACTTGCGCCA TATTCAGCAA CCATTGCCCG CTATTTTTGA TCTGTTACCA GCCAATACCT TGTGGGATAC CTTGGCCAGC CTGCGTGATT TAACCAAAAT TGCCTTTGAT CTACAAACCA GCGCGGGCTT GGGACTTGAC CACTTCTTGG AGCCACAGCA TTATCAACGG TTTCACGAAA GCCAGCGTCA ATGCTTCGCT GATGATTCGC TCTATCAATT AATTGGTCAG CATGCTGGTT GGCGCAAGCT GGGCCAAACC ATCGAGCAAG CTCGCGAACA AGCCTATGAA ACCAAACAAC CAGTCATCGC CCAACATCAT GGTGGTTTGG TCTGGCTGGT CGTGCCAGTC ATTAGCAATG ATCACGTTTT AGGCACAATC GAAAATCGCA ATTTTATCGC CCAAGAGCCA ATCGATTGGC AATGGCATCG TTGTTATGCC CAAGAATTGG GGCTGGATTG GGAGCGTTAT CGCACGGCGT TGGAGCAAAT TAGCACATTC AGTCAACAAC AAATCAATGC CACAATCAAT CATGTGCAGC AGCTGGTACG CCTAGCTCAA CAATTGCTCA AAGATAGCCA AGAAATTCAT ACGCTGAAAA GTAGCGCCTT GGCTGCCGAG GTGTTTAGTC GCTCCAAAAG CACCTTTATG AGCATGATGA GCCATGAATT ACGCACCCCG CTGAATGCCA TTATTGGCTA TAGCGAGCTG CTCATCGACG ATCTGAGCGC AACCAACCAA CAACAATCGA CTGATGATGC CCGCCAAATT CGCAGCGCAG GTCGCCATTT ACTGCATGTG ATCGAAACAA TTTTGCAGGT ATCCAACTTA GAATCAGGCG CATCAAATGT ACATTACGAC GATGTTGATC TTGAAATGCT GACCAATTCG TTGGAACTTA GTTTGCGCAC CCAATTTCAA AAACGCAGCA ATCAACTTGA AATTAAGATC GATCCTAATG CCTCGTGGGT CTATTCAGAT AGCTCGAAAG TGCGTCAGAT TATCTTCCAA CTGCTGAATA ATGCCAATAA ATTTACCGAT CATGGTCAAG TGCGGCTAAC GATTGCCCCT GATAGCCTTG ACCCAGAGAT GCTCGCATTC GAGGTTTGCG ACACAGGTAT TGGCATCGAT CATGAGCATT TGCCGCTCTT ATTTGCTGAA TTCAGCCAAC TTGATGCCTC AAGCACTCGC CGCTATGATG GCACAGGTAT GGGCTTGGCG CTTTGTCGCC ACTTGGCGCG GCTGCTTGGC GGCGATATTC GGGTTTGGAG CGAGCCTGGC ATTGGCTCGA CCTTTACCCT CGCCATCCCA CGTCAAAGCG TGATGACTCC AGTTAACGAC CTCTGA
|
Protein sequence | MHYLFDHPTC IRRTSPQHLA SYAYGVRASC SVRHPAGDGV DDPRFLLFGP QRGNYRHWFD TYEIKPDNAP DWYALTFPKP TKLNVVYWMH GPMFDDHGWW TSLQVEYRDA EGAWHKVDDL QITPNYHFSP QRGDRKPFSA YAAHFNPVRT SAIRLIGQPS GKPQITTMAY LAADWSTCEG IAAHLRHIQQ PLPAIFDLLP ANTLWDTLAS LRDLTKIAFD LQTSAGLGLD HFLEPQHYQR FHESQRQCFA DDSLYQLIGQ HAGWRKLGQT IEQAREQAYE TKQPVIAQHH GGLVWLVVPV ISNDHVLGTI ENRNFIAQEP IDWQWHRCYA QELGLDWERY RTALEQISTF SQQQINATIN HVQQLVRLAQ QLLKDSQEIH TLKSSALAAE VFSRSKSTFM SMMSHELRTP LNAIIGYSEL LIDDLSATNQ QQSTDDARQI RSAGRHLLHV IETILQVSNL ESGASNVHYD DVDLEMLTNS LELSLRTQFQ KRSNQLEIKI DPNASWVYSD SSKVRQIIFQ LLNNANKFTD HGQVRLTIAP DSLDPEMLAF EVCDTGIGID HEHLPLLFAE FSQLDASSTR RYDGTGMGLA LCRHLARLLG GDIRVWSEPG IGSTFTLAIP RQSVMTPVND L
|
| |