Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3230 |
Symbol | |
ID | 5735098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4089256 |
End bp | 4091166 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280376 |
Product | signal transduction histidine kinase |
Protein accession | YP_001545995 |
Protein GI | 159899748 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3920] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00732898 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGATA CAAGCGCTCC GAGCGCAGCT ATTCTCGCGA TTCTGCAGAA TGATAATGCA GTGGCGCAGG CGCAGGAACT TCGGGCGTTT GGCGCAACCT GGCAAAGCCA AAACGCTGAT CCGCAAATGC TGACCAAACT TGGCATCGAA GCCGCTACCC AAGCTCGGCT TGATCCCAGT GCGGTGGTAA CCGATCTCTT GCTTGGCTAT GCCGCCGCCC GCGAGGAGCA ACGCACTCGC CATTGGCAAA ACCACTTGAT GCGCCGCGTG CAGCAGTTGG ATGGTTTGCA TCGTATTATT AGCGCCGCCA ACTCAACCCT CGATATTGAT GCCTCACTCC AAACCGTCGT CGATACAGTC AGCGATGTGA TGCGGGTCGA TGTTTGCTCG ATTTATCTCT TTGATCAGCA TCGGCGCATG TTGCGATTGG TCGCGACTCG TGGGCTGAAT CCCAAGGCAA TCGGCACGGT TGAGGTGGAA ATCGGTGTGG GCGTGACTGG TTGGGCTGGC GAATTGGGCA AGCCGGTGGC GATTTTTGAT GTGCGCAATG AGCCACGCTA CCAGCTTGAG CCACTGCTCG AAGAATGGCA TTTTCGCTCG CTGTTGGCAG TACCAGTAAT TTTATTTGCC AGCGAACGCC ACCATATCGA AACCCTGCAA GGCGTGATTA CGGTGCAAAA TCGCGATCCC CATGAATTTT CGCAAGAGCA AACCTCATAT CTCGAAGTTG TGGCGGGGGA AATTGCGCTC TCAATCGCCA ATGCCCAAAT GTATCAACAA ACTGATGCTC GTTTACACCA AAAAATTCGT GAATTAACCA CCTTGCAGCG CGTAACAGCG GCGCTAGCCT CAACCTTGGA TGTTGATACG TTGCTGCATT TGATTGTGGA GCAAGCTGTC AAGATTGCCG ATGTTGATCG GACTGATATT TTCCAAGTAC GCCCCAACAA CAAGGTGAAA ATGTTGGCTT CATATGGGCC TGGCCGCACT TCCGGCGTAG AAGATATGAT CGTGCAGGTG ATTCGTGAGC ATCGCGCGAT CGCGGTGCCA AATGCCTACA CCGACGAACG TTGGTCGGGC GTGCAGGCGA TTGCCTATCG CGAGGGCTTT CATTCGTTGT TTGCGCTGCC GATGCGCACG GGCAATCGGA TTATCGGGGC ATTATGTTTT TATAGCTATG CGCCGCGCCA TATTGAATAT GAGCAGGTGC AATTGCTCAC CACCTTTGCC GATGAAGGTG CAATCGCGAT TGAAAATGCC CGCTTGTACG AAGAAACCCA GCGCAATCTG ACGATCAAAT CAACCTTGCT GCAAGAAATT CATCATCGCG TCCGCAACAA CTTACAAACG ATCTCAGCGT TGCTGCAAAT GCAGGCACGG CGTTTGAACA CCGAAACTGA AGGTCGGCAA GCACTTGATG ATAGTGTACG CCGCATCCAT GCCATTGCCG CAGTGCATAA TTTGCTCAGC CACGATGGCG AAGGCCAAAC CACGGTGCAA GATATTGCCA AGCAAATTGC CGAAAATATT CAGATGAGCC TGCCAAGCGA AACGCCAGTT GAGTTTTTGA TCACTGGCGA TTCAGTTTCG GTCAATGCGC GGGCGGCTAC TGTGCTGGCG ATCGTGATCA ACGAATTGGT GCATAATGCG CTTGATCATG GCTTGAGTGC CGAAGGCGGC ATTATTGGCA TCGACGGCTG GATGGAAAAC GAAGAACAGG CTTGTGTCCA AGTGCGCGAC GATGGCCCAA TCCGACCGGA GCCAGTCAAA CGTCGTGTAA GCACTGGGCT AGGCCTTGGC ATTATCGAAA CTCTCGTCAA CACCGATCTT GGCGGCAAAT TTGAGTTCAA ACGTGAAACT GAATGGACAC GGGCATTAAT TACTTTTGCG CCCGATGAGT TGGATGATTA A
|
Protein sequence | MIDTSAPSAA ILAILQNDNA VAQAQELRAF GATWQSQNAD PQMLTKLGIE AATQARLDPS AVVTDLLLGY AAAREEQRTR HWQNHLMRRV QQLDGLHRII SAANSTLDID ASLQTVVDTV SDVMRVDVCS IYLFDQHRRM LRLVATRGLN PKAIGTVEVE IGVGVTGWAG ELGKPVAIFD VRNEPRYQLE PLLEEWHFRS LLAVPVILFA SERHHIETLQ GVITVQNRDP HEFSQEQTSY LEVVAGEIAL SIANAQMYQQ TDARLHQKIR ELTTLQRVTA ALASTLDVDT LLHLIVEQAV KIADVDRTDI FQVRPNNKVK MLASYGPGRT SGVEDMIVQV IREHRAIAVP NAYTDERWSG VQAIAYREGF HSLFALPMRT GNRIIGALCF YSYAPRHIEY EQVQLLTTFA DEGAIAIENA RLYEETQRNL TIKSTLLQEI HHRVRNNLQT ISALLQMQAR RLNTETEGRQ ALDDSVRRIH AIAAVHNLLS HDGEGQTTVQ DIAKQIAENI QMSLPSETPV EFLITGDSVS VNARAATVLA IVINELVHNA LDHGLSAEGG IIGIDGWMEN EEQACVQVRD DGPIRPEPVK RRVSTGLGLG IIETLVNTDL GGKFEFKRET EWTRALITFA PDELDD
|
| |