Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3004 |
Symbol | |
ID | 3681239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 3725337 |
End bp | 3728162 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637718350 |
Product | Signal transduction histidine kinase |
Protein accession | YP_323509 |
Protein GI | 75909213 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2203] FOG: GAF domain [COG3920] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCTTA TCATCAATTA TTCATCAAAT AAAGCAATAA TGTTGTTTAC CGACAGTGAA AGCCTACGCT TAGAAGCGCT CTATCAATAT TGCATTTTAG ATACACCACC AGAAGAAGTA TTTGACGATC TGGTAAATAT TGCTGCCGAT AGCTGTAATA CACCAATTGC TCTGATCAGT ATTTTAGATT CTCAAAGAGA ATGGTTTAAG TCAAAAGTAG GAATTTCTGA ATTAGAAATA CCACGTGATA TATGTCTTGG TATTCATACA ATTGGGCAAA ATGACATATT GATTATTCCT GACACCTGGC AAGATGAACG ATTTGTAGAA AATCCCCTAG TGAGGCAAAA GCCAAGCGCT TTCCGTTTCT ATGGAGGAGT TCCCTTAATT AACTCTGAGG GTTTTGCCTT GGGTTGTCTG GCCGTGATAG ATTTTACCCC CCGTAACTTA AGTTTAAAAG AACAGCAAAT ACTCCAACGT TTAGCGCGTC AAATCATCAG ACAATTAGAG TTACACAAAA AGCGAATTAA TCATGAAAGC AGCTTCAATG CTCATCTCTT GTTCACTAAT AATCCTCGTC CTATGTGGAT ATGTGAGATG CAAAGTCTAC AAATTTTAGA TGTTAATCAA GCTGCTATTA CACAATATGG TTACTCAAAA ACAGAGTTTT TACAAATGCA GTTTGCCCAA GTTTTTGTAC CTGAGTTTAT ATCGGACTTA ATCAGGGATA TAGAACAGGA ATATTCTCAA CTTCCCTTCC TAATGGAATG TCAACATCGT CTAAGGGGTG GACAAGTTAT TGATGTTGAA TTAGCTATTA ATTATATAGA ATATTCAGGT TATCAAGCTT GTTTAGTTGA TACCATAAAT ATTACTGAAC ATATTCAAAT AGAACGGAAT CTACAAAAAA GTGAAGCCAG AGTTAGAACT ATTCTGGAAG CAATTCCCGT ACCTTTGGTG ATTTCCCGCG TTGATGATGG CTTAATTTTA TATACTAATT CAGAGTTTCT GCAAACATTC CAACTATCTG GAAATGATTT AATTAATCAC TATGCCGCAG ATTTATATGA AAACTCCGAA GACCGACAGC AGATATTAGA AGCTCTTAGT CAACATGGAT CACTTCAGAA TTATGATATT CAATTTAAGA AAAGTGACGG AACTTCATTT TGGGCGATCG CCTCAATTCA GTACTTAAAT TTCAACAACG AGTATGCAAT TTTAACCGTC CTCTACGATA TTACAGAGCG CAAAAATATT GAAGCCAAGT TACAAGAGCA AAATGCACTT TTACAAAGTA TTTTTGCTGG TATCCCGTTG ATGATTGCAC TGATTAGTCC TGAAGGTCAG GTTCAATGGA TAAATCAAGA ATTAGAGCGT CTTTTAGGTT GGAGTTTGAG AGATTATCAA ACCCTTGATA TTTTTGCAGA GTTATATCCT CAGCCTGAAT ATCGTCAATC GGTCATCAAG TTTATGCAAT CAGGAGAATG TATTTGGGGT GATTTTAGAA CTCAGACACG ATATGGGCAA GTATTAGATA CTTCTTGGAT AAATATCAAA CTTGCCGACG GTCGAATAAT TGGTATTGGT CAAGAAATTA CCGAGCGTAA ACAAACCGAA CGGGCTTTGA AAGGACAGAT TGAGCGAGAA CAGTTAATGC GTGCTGTTGC TCAACGAATT CGCCAATCGC TGAATCTACA AAATATTCTG AATGCCACAG TTAAAGAAAT TAAAGACTTG CTTGGGGTTG ATCGGGTCGT GGTTTATCAG TTTGCCCCAG ATATGAGTGG TAAAATCGTA GCAGAATCGG TGAAGCCTGG ATGGAAAATT GCTCTAGGTG CAGATATTCA AGATAATTGT TTCCAGTCAG GCGCAGGAGC AGATTATTGT CAAGGACATA AAAGAGCGAT CGCCAACATC TACACAGCTG GATTAACTGA TTGTCATCTG CATTTATTAG AACAATTTCA AGTTAAAGCT AACTTAGTTG TACCTATTCT GTTAGAAGTG AGTGAAGGCA ACACTGTGCC GCAGCTTTGG GGTTTATTAA TAGCCCATCA ATGTTCTACC CCACGGGACT GGGAAGCGCA TGAACTAGAT TTGCTCGACC AACTTTCCGT CCCCATTGCG ATCGCCATCC AGCAATCAAG CATACTTCAG CAAGCCCAAA ATGAATTGGC TGAACGCCAA AAAGTAGAAG TTCGCTTGAG AAGTGCCTTA GCGGAAAAAG AGGTTTTACT CAAAGAGGTT CATCATCGGG TTAAAAATAA TTTACAGATA GTTTCTGGAT TATTACTACT TCATTCTCAA ACACTCAAAG ACCCAGAATT AATCAGAACT CTGCAAGAAA GTCAAAACCG TATTGAGTCC ATCTCAATGA TTCACAAGAA CTTATATACT TCACCAAATA TTGGGCAACT TGATGTTGTT GATTATGTTA ATAATTTAGC TACGAGTATT TTAATATCCT ATCAATTAGA GCCAGGGAGA ATCAGTTTAG AAACTCATAT TCACCCGGTT GATTTAAATC TTGATCAAGC CATTGCCTGC GGTTTAATTA TCAATGAACT AATTTCCAAT TCACTGAAAC ACGCTTTTCC TCAAAATACA ACAGGTGCAA TAAAAATTGA TTTACAAAAA GTCGATGACA AAATTGAGAT GACTATTCAA GATAATGGTA TCGGTTTACC AGATAATTTA GATTGGCGGT ATACAGATTC TTTAGGCCTT TCGCTAGTTC ATGACTTAGT AACAGAACAA CTAGAAGGCA CTGTTAGTAT CGAACGTCAA CCAGGAACTA CATTTAAAAT CCAATTTTCG CATTAA
|
Protein sequence | MQLIINYSSN KAIMLFTDSE SLRLEALYQY CILDTPPEEV FDDLVNIAAD SCNTPIALIS ILDSQREWFK SKVGISELEI PRDICLGIHT IGQNDILIIP DTWQDERFVE NPLVRQKPSA FRFYGGVPLI NSEGFALGCL AVIDFTPRNL SLKEQQILQR LARQIIRQLE LHKKRINHES SFNAHLLFTN NPRPMWICEM QSLQILDVNQ AAITQYGYSK TEFLQMQFAQ VFVPEFISDL IRDIEQEYSQ LPFLMECQHR LRGGQVIDVE LAINYIEYSG YQACLVDTIN ITEHIQIERN LQKSEARVRT ILEAIPVPLV ISRVDDGLIL YTNSEFLQTF QLSGNDLINH YAADLYENSE DRQQILEALS QHGSLQNYDI QFKKSDGTSF WAIASIQYLN FNNEYAILTV LYDITERKNI EAKLQEQNAL LQSIFAGIPL MIALISPEGQ VQWINQELER LLGWSLRDYQ TLDIFAELYP QPEYRQSVIK FMQSGECIWG DFRTQTRYGQ VLDTSWINIK LADGRIIGIG QEITERKQTE RALKGQIERE QLMRAVAQRI RQSLNLQNIL NATVKEIKDL LGVDRVVVYQ FAPDMSGKIV AESVKPGWKI ALGADIQDNC FQSGAGADYC QGHKRAIANI YTAGLTDCHL HLLEQFQVKA NLVVPILLEV SEGNTVPQLW GLLIAHQCST PRDWEAHELD LLDQLSVPIA IAIQQSSILQ QAQNELAERQ KVEVRLRSAL AEKEVLLKEV HHRVKNNLQI VSGLLLLHSQ TLKDPELIRT LQESQNRIES ISMIHKNLYT SPNIGQLDVV DYVNNLATSI LISYQLEPGR ISLETHIHPV DLNLDQAIAC GLIINELISN SLKHAFPQNT TGAIKIDLQK VDDKIEMTIQ DNGIGLPDNL DWRYTDSLGL SLVHDLVTEQ LEGTVSIERQ PGTTFKIQFS H
|
| |