Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2995 |
Symbol | |
ID | 9340798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 3085455 |
End bp | 3088463 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_003721911 |
Protein GI | 298491734 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.175208 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGGAA AGGACAATTA CCAAGAGCAA CAGGAAATTA ATTCTACTGA TGAAGAGGGG ATGATTTTCC AGCTTGCTGA TACCACTATA CAGGCGTGTA ATGCTGCTGC TGAGAGAATA ATTGGATATA AAAATGAGCA AATATTGGGA AAAACTCCCT TTAAACCTCC ATGGCAAATA GTTCATGAAG ATGGAACAAT TTTTACGCCA GAAACAGTTC CACCTATAGT TGCTTTACAA ACAGGTAAGC CATGCAAAAA CGTCGTCATG GGGCTTTATC AGCCAACTGG TCAGCTAGTT TGGTTATTGC TTAACTCCCA GCCTTTATTT CAAACTAACA AATCTACTCC CTATGCAGTT GTCACCACAT TTACCGACTT TACAAAAGTG AAATGTCGGC AATCTAGGAG TGATGCTTGT ATTACCAATA ATAAAAAACT AATAGTAACT TGGAATCAAG ATCAAGCTGT AGAAATTTCT CCAAATACGG ATAAGCAAAA GCTAAACCAA CCAGTTGCAG AACCTATATC TGACGATTGC ACCAAAGCCT TCTTATCCAT GCAAAACCAT ATTTTAGAAC TAATTGCTAC GGGTAATTCA CTCTCTGAGA TCCTCGAAAA GTTAGCTCAG TCAATAGAAG CGCAGTTGGC TGAAGTTTTC TGTTCCATCA TGTTACTAAA TGATACAGGC ACAAAATTAT ATCCTGGCGT AAGTTCTAGT CTACCAGAAA AATATATTCA AGAATTAAAG AATGGTGTTC CCGTAGGTGC AGATGTAGGT AGTTGTGGCA CATCAGCCTA TACCAAACAA ACAGTAATAG TCTCAGACAT TGCCAGCGAT CACAAATGGG TAAAATACCG TGACTTGACA TTGACGAATA ACCTCAAAGC TTGTTGGTCT ACACCCATTT TGGACAGTCA AGGCCATGTT TTAGGAACAT TTGCACTATA TTATCCCACA GTTCGCATTC CTCAAAAATC AGATCTGCAA CTCATTGAGA GAGCATCTCA CCTAGCAGGT GTAGCAATTG AACGTCAAAG ATGGCAACTA GCTTTGCAAC AAAGTGAATA TCAATTGCGA TTAATGACGG AAACAATTCC CCAACAGGTA TGGACAGCAC AACCTAACGG TCTGGTTGAT TATTATAACC AGCGGTGGTG TGACTTTACA GGTAAAACGC AAGAACAACT GAAAACTGAG AGTTGGGAGC AGATTATTCA CCCAGAAGAC TTACCAAAGG TAGAAGAATT GTGGGCTAAA GCGGTACAAA CCGGCAGTGA CTATGAAGTA GAAACTCGGA TGCTCTCTGG GAGTGGAGAG TACCATTGGA TTTTGGGACA GGCACGTCCT TTGCGAAATC AACAGGGACA GATTGTTAAA TGGTACGGTA CCAATACCGA CATTACAGAC CATATCCAAG CCAGAGAAGC TTTACGAGAA AGTGAGCTAA ACTTCCGCAC CTTGGCAGAT ACCATGCCGC AGATGATCTG GACTGCTAAA GGAGATGGCT GGTTAGAATA CTGTAACCAA CGTTGGTTGG ACTATACTGG TATGACACTG GAGGAAACTC AAGGTTGGGG TTGGCAACCT GTACTGCATC CCGATGATGC ACAAATCTGT ATAGATATTT GGAGTGAGTC AGTTCGCACA GGTCAAAATT ATCAAATTGA GTACCGCTTA CGTCGCGCTA GAGATGGGCA ATATCGTTGG CATCTAGGTC GAGCCTTTCC ACTACGCAAT CAAAAGGGAC AGATTATCAA ATGGTTTGGT TCCTGTACTG ATATTGACGA CCACAAGCGG TCTGAGGAAG CATTGCGAAA TGCCCTCCAA GAAAAGCAAG CAGCTTATGA AGCCGCTGAA ACAGCGAATC GCGTTAAAGA TGAGTTTCTG GCTATCCTTT CCCACGAACT CCGTACCCCG CTTAATCCGA TCCTGGGATG GTCTCAATTG TTAAAAACTG GCAAACTGGA CAAAGATAAG ACAGATGAAG CCCTCAAAAG TATTGAGCGT AATGCAAAAT TGCAGGTTGA CCTGATTGAC GATTTACTAG ATGTGTCACG CATCCTCCGA GGCAAACTAA CTCTGAATGT GACTACAGTT AATTTGGCAG ATATAATTTC TGCAGCATTG GAAACAATGA GTTTAGCAGC CTCTGTTAAA CAAATTGCCG TTCAAACCCT AATTGAACCG AATATAGGAA AAGTTAGAGG TGATCCGGCT CGCTTGCAAC AAATTGTCTG GAATCTTTTG TCTAATGCTA TTAAATTTAC ACCTCAAGGC GGAGAGGTAG TAGTACAGCT AGAGCAAATT GGTTCAATGG CTCATATTAT AGTTAGTGAT CAAGGTAAAG GTATCAACCG TAAATTTCTA CCTCATATAT TTGAACACTT CCGGCAAGAG GATGGCTCAA CTACCAGAAA ATTTGGTGGT CTGGGATTGG GACTAGCAAT AGTCCGCCAT TTAGTAGAAT TACACGGTGG TACAGTTGAG GCCGATAGTC CAGGAGAAGG ACAGGGCACA ACTTTAAAAG TTATACTACC GTTAATATCT TCCTTGCCAG TGACAGTTAC GAATGGTCAA TTTGAGGACG TAGCTTTAAG TTTAAAAGGA ATTAAAATTT TGGTTGTGGA TGATGATGTT GATTCGCTGG CTTTTGTTAA AGTTGTGCTG GAATTGTATG AGGCAACTGT GACCACAGCG GCATCAGCAC CAGAAGCATT ACAAATTGGG GTACAGTTAA AGCCTGATGT TTTGATCAGT GATATTGGAA TGCCGGAGAT GGATGGTTAT GAACTGTTAA GAAGAATGAG AGCTTTGTCG TCAGACTTGG GTGGAGACAT TCCAGCAGTA AGCAAAGCTA TACCGAAGGC GATCGCTCTA ACAGCATTTG CTGGAGAACT TGACGAACAA CAGGCAATGG CGGCAGGTTT TCAAATACAC ATACCTAAAC CAGTAGAACC AGAGGCGTTA GCAGCAGCTG TTGCCCAAGT TCTGAAAATA AATGTTTAG
|
Protein sequence | MQGKDNYQEQ QEINSTDEEG MIFQLADTTI QACNAAAERI IGYKNEQILG KTPFKPPWQI VHEDGTIFTP ETVPPIVALQ TGKPCKNVVM GLYQPTGQLV WLLLNSQPLF QTNKSTPYAV VTTFTDFTKV KCRQSRSDAC ITNNKKLIVT WNQDQAVEIS PNTDKQKLNQ PVAEPISDDC TKAFLSMQNH ILELIATGNS LSEILEKLAQ SIEAQLAEVF CSIMLLNDTG TKLYPGVSSS LPEKYIQELK NGVPVGADVG SCGTSAYTKQ TVIVSDIASD HKWVKYRDLT LTNNLKACWS TPILDSQGHV LGTFALYYPT VRIPQKSDLQ LIERASHLAG VAIERQRWQL ALQQSEYQLR LMTETIPQQV WTAQPNGLVD YYNQRWCDFT GKTQEQLKTE SWEQIIHPED LPKVEELWAK AVQTGSDYEV ETRMLSGSGE YHWILGQARP LRNQQGQIVK WYGTNTDITD HIQAREALRE SELNFRTLAD TMPQMIWTAK GDGWLEYCNQ RWLDYTGMTL EETQGWGWQP VLHPDDAQIC IDIWSESVRT GQNYQIEYRL RRARDGQYRW HLGRAFPLRN QKGQIIKWFG SCTDIDDHKR SEEALRNALQ EKQAAYEAAE TANRVKDEFL AILSHELRTP LNPILGWSQL LKTGKLDKDK TDEALKSIER NAKLQVDLID DLLDVSRILR GKLTLNVTTV NLADIISAAL ETMSLAASVK QIAVQTLIEP NIGKVRGDPA RLQQIVWNLL SNAIKFTPQG GEVVVQLEQI GSMAHIIVSD QGKGINRKFL PHIFEHFRQE DGSTTRKFGG LGLGLAIVRH LVELHGGTVE ADSPGEGQGT TLKVILPLIS SLPVTVTNGQ FEDVALSLKG IKILVVDDDV DSLAFVKVVL ELYEATVTTA ASAPEALQIG VQLKPDVLIS DIGMPEMDGY ELLRRMRALS SDLGGDIPAV SKAIPKAIAL TAFAGELDEQ QAMAAGFQIH IPKPVEPEAL AAAVAQVLKI NV
|
| |