Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1142 |
Symbol | |
ID | 5733034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1306365 |
End bp | 1308260 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641278281 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_001543918 |
Protein GI | 159897671 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG2202] FOG: PAS/PAC domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCGGA AATTGAAATG GCCACTCGCT GGAGCATGGC AACGCCGCTT GATTAAAGCA ACTCGCCGTA AAGATGTAAC GCTTGAGCAA TTTCCGGTTG GCTTGGTTTG GTTTCAAGGC TCACAGGTTT TTTTTAATCA AGCGGTCACG GCGATGATTG GCTATAGCAA TGCGGAAATT GCGACAACTG AGCAATGGTT TAGCACCTTG TATGGCCCTG AAGCCGCCAG CATGCGCCAA TTGTATGCAA CAACCTCGGC AGCGACCTTA GGCCAGACAA TCCATGGCTT TGTCGTGAAT CGCCAGAACC AAAGTTGCCT TTTAGAATGT ACGTTGGCTT CCCATGGCCA TCGTCAGGTT TGGTTGGTAC GCGATATTAC CGAGAGCAAT CGGCTTGAGC GCTTGCTGTT GCAAACTGAA CAAACCGCCC GCGTCGGCGG CTGGGAGATC GATTTGCGCA CCAACCAAGT ATTTTGGACG CGCGAAATGT ACCATATTTT GGATACCACA GCACACGAAT ACACGCCCAC GATTGAGAAT CAGAATTTTT TCCATACCAA TGCCACGTTA ATCCAACTTG AGGCAATTTT TCGCCAAATG ATCGAGCAGC GTGGCTCGTT TGATATGAGT GTGGAAATGC GGACGTTTCG GGGGCGTTCG TTCTGGGGTC GTTTTACTGG GCGAGTTGAG CTAGAGTTTG GCCAGCCAAT CAAAATTTAT GGCTCGTTGC AAGATGTAAC TGAGCACCAT CAATTAACTG AGGCCTTGCG GGTGGCCGAG CACGACTATC GCACAATTTT TGAAACCACC AAAATTGGCA TCTTTCGCAT TACTCCCGAT GGACGGGTGT TACGGGCTAA TCCGGCGTTG GTGCGCTTGG CGGGCTTTGC CCATGAACAT GAATTGGTCG ATTATGTGGC CGATTTAACC ACGATGTATG TTGAGCCGCA GCGCTTTGAA TACTTGCGTG AGCTGCTGCA AACCAATGGC TCGTATGATG AAATTGAATC GGAAGTTTAT CGGCCTGCCA CTGGCGAACG GATTTGGATC AGCGAGACCT CGCGTTTGGT GTATGCTGAG GATGGCTCGA TGCTCTACGC CGAAGGCACG ATTCAAGAAA TTACGGCACG CAAACAGGTC GAAGAGGCGT TGCGCCATGC CCGTGATGCT GCCGAAGCCG CCAATCATGC CAAAAGCACT TTTTTGGCCA ATATGAGCCA CGAGCTGCGC ACGCCACTGA ATGCAATTAT TGGCTATAGC GAATTGCTGA TGGACGATAC TGATTTTGAT GATCCGACGA TGGTTGAGCA GTTTCGCCAT GATATTGCGC AAATTAATGA TGCAGGTCAT CAATTGCTCA ATTTGGTCAA CGATGTGCTT GATTTGGCCA AAGTTGAGGC TGGCAAATAT CAAGTTGCTG CTGAAACCTT CGATCTCAAC AGCCTTGTAC GTGATTTGAT TGCCACAATT AACCCAATGG CTCAGAAAAA TGCCAATAGC CTTTACTTTG AGCCAAACAA ACATCTGCCG TTAATTCATA CTGATCGCTC GATGTTGCGC CAGATTTTGC TGAATTTATT GAGTAATGCC GCCAAATTTA CCAAAGCAGG CAGCATCAAC ATCAGCGTCA GCTTTGATCC AGCCAGCCAA CATGTGCAAT GTCGGGTGCG CGATACTGGC ATTGGTATGA ACGATGAGCA AATGCAGCGT TTGTTTGAGC CATTTACCCA AGGTGATGCC TCGACGACGC GGCGCTATGG TGGCACTGGC TTGGGCTTGG CGCTTTGTCG CCATTTTATC GAACTATTGA ATGGCTCAAT TCAAGTTGAA AGTGTCTTTG GCCAAGGCTC GATCTTTACC ATTGTCTTTC CATGCTTGGT TGAGGCAATT GATTAG
|
Protein sequence | MPRKLKWPLA GAWQRRLIKA TRRKDVTLEQ FPVGLVWFQG SQVFFNQAVT AMIGYSNAEI ATTEQWFSTL YGPEAASMRQ LYATTSAATL GQTIHGFVVN RQNQSCLLEC TLASHGHRQV WLVRDITESN RLERLLLQTE QTARVGGWEI DLRTNQVFWT REMYHILDTT AHEYTPTIEN QNFFHTNATL IQLEAIFRQM IEQRGSFDMS VEMRTFRGRS FWGRFTGRVE LEFGQPIKIY GSLQDVTEHH QLTEALRVAE HDYRTIFETT KIGIFRITPD GRVLRANPAL VRLAGFAHEH ELVDYVADLT TMYVEPQRFE YLRELLQTNG SYDEIESEVY RPATGERIWI SETSRLVYAE DGSMLYAEGT IQEITARKQV EEALRHARDA AEAANHAKST FLANMSHELR TPLNAIIGYS ELLMDDTDFD DPTMVEQFRH DIAQINDAGH QLLNLVNDVL DLAKVEAGKY QVAAETFDLN SLVRDLIATI NPMAQKNANS LYFEPNKHLP LIHTDRSMLR QILLNLLSNA AKFTKAGSIN ISVSFDPASQ HVQCRVRDTG IGMNDEQMQR LFEPFTQGDA STTRRYGGTG LGLALCRHFI ELLNGSIQVE SVFGQGSIFT IVFPCLVEAI D
|
| |