Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_5067 |
Symbol | |
ID | 9342876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 5190162 |
End bp | 5193170 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_003723286 |
Protein GI | 298493109 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATA AATACTTTAA CATACCGGAA AGTCTCGATT ATAGTAAATA TGAACAACAA CCAATTCACA TCCCTGGTTA TATACAGGCA CATGGTGTAC TCCTAGCTTT ACAAGAACCA GATTTAACAA TCTTACAAAT TAGTGAAAAT ACAAATCATT TTTTTGGGAT TCCCCCAGAG TCTTTACTGG GGCAAAACTT AAACTGCCTC TTATCTTCAG AACAAATAGA ATCAATTACT CATTCGTTAC AAGAGGATAA ACCCGTAGTT ATCCATTCAT TAGAAATTAC AATTTACAAC CAAAAAAAGT CTTCAAGATT ATTAGGAATC CTGCATCGTT CTAATGGAGT TCTAATCCTA GAGTTAGAAC CTCATGCAGC TATCACAACA AATGCATTAG TAGGATATAC TTATCTTTTA GAAACAGCAA TTTTCAACAT ACGTAACGCC TTCCATTTAT CGGATATTCA TCAAATAATT GCCCAAGAAG TGAGAAAAAT AACTCAGTTT GATAGGGTAA TGATATACAG CTTTACCCCA GACTACAGTG GAGTTGTTAT TGCCGAAGAT AAACAAAGTA ATTTAGAAAG TTATTTAGGA TTACATTATC CTGCTTTTGA CGTTCCTAAT CAAGCCAGAC AACTTTACTA CAAAAATCGG CTGCGAATCA TTCCAGATGT TAATTATCAA CCTGTAAAAA TTATTCCCAG CAATCATCCT CTTAGCAATG AACCCCTTGA TTTAAGTTGT GCAATATTAA GAAATATCTC ATCCTGTCAT GTAGAATACT TACAAAATAT GGGTGTGACT GCTTCAATGT GCATTTCACT CATTCATGAA AATAAGCTTT GGGGTTTAAT TGCTTGTCAC CATTACTCCC CCAGATATAT TGATTATGAG ATTCGGAAAA TTTGTGAAGT AATCGGTCAA TTCTCATCTA TTGAATTAGT TAATCATCGC GAACGAGCCT TAAATTTCTA CCGTCAAAAA GTCAAATTAA TTCAACAAGA ACTAAATCAC AATTTAACCA ATGAATTAGA TCACATCGGT AAGGTCTTCC AGCGGAATAA AACAAATTTA TTAAAATTAT TAAAAGCGAA TGGCACTGCT GTTTTATTAC AAGGACAATT AATATTAATT GGTAATACCC CATCAGCAGA AGATACGCGG GATCTAGTAG AGTGGATATT AAATCAAAAA AATCAACAAG AAATTTTTTA CACAGATTCG CTCCCACCAC TATATCCAAA CGCTAGCAAG TTTCAAGATA TAGCAACTGG TATTTTAGCA ATTTCAATAT TTCTCCATCA GCAGTCTTAT CATATTATCT GGTTCAGACC AGAGCAAGTT CAAACTGTAA ATTGGGCAGG TAATCCCCAT GAAGCAATAG CTGTGAACTC ACAGCAAGAA TATTACCTCA CCCCACGTAA ATCATTTGAA CTATGGAAAC AGACAGTTAA ACACAAATCA CTTCCTTGGC AAATATTTGA AATTGAAGTG GCACAAGAGA TGAGAAATAG CCTCATGCTT GCTGCACTAG AGTTTTCCCA AGTAGCATTA CAACAAGCAG CTGAACAGGC AGAAATTGCC AACCGTGCTA AAAGTCAATT CTTGGCCAAG ATGAGTCATG AATTACGAAC TCCACTCAAT GCCATCTTGG GTTTCACTCA AATATTAACC CGTAATAGCT CATTATCTCA AGAAGATCAA GAAAATTTAG ACATCATTAG TCGCAGTGGT GAGCATTTGC TCTCATTAAT TAATGATGTA TTGGAAATGT CTAAGATTGA AGCTGGTAAA CCAACCCTAA ATGAAACCTA TTTTGACTTG TATCGTCTCA TTTACTCAAT TCAAGAAATG TTTGCTCTCA AAGCTGCAGA TAAGGGAATA AATTTAATTA CTGAATTTGG TGTCGAGGTG AATAAATATG TTTTTGGAGA TGAAGGCAAA CTCAAACAAA TACTAATTAA TTTAATAGGT AATGCTATAA AATTTACCGT TGTTGGTCAT GTCGCTATCA GGGTATCCTG TCCTCCAAAT TATGCACCTT TAGCTGTAAC AAATGGCAAA GTTCTAATTG AATTAGAAGT GGAAGATACA GGTGCCGGTA TTGCTCTTGA AGACCAAGAA TTAATTTTTG AAGCATTCCT CCAGTCCAAA GGTGGAAGAC AGTTTATGCA AGGGACTGGA CTGGGATTAG CTATTAGCCG TCAATTTGCT AGACTCATGG GAGGTGACAT AACAGTTAAG AGTATTTTAG GTGAGGGTAC AACCTTTACT TGTCGTGTTC AACTCAGTCT TGCTAATAAG GGAGATATCA TTTCTCCTAT TAAAAATACC AAAAGAGTTG TTGGTCTAGA ACCAGGACAA CCTAGATACA GAATCTTAAT TGTTGAAGAC ATTTTAGAAA ATCGTCTTTT ACTAGTTAAA CTACTAGAAT CTGTTGGGTT AGATGTACGT GCAGTAGAAA ATGGTTTACA AGCAATTGAT ATTTGCCAAG AGTGGGAACC CCATCTGATT TGGATGGATA TACAAATGCC AGTGATGAAC GGGTATGAAG CAACCAAGCA AATTAGAGCC ATGAATCAAG GAAAAAATAT CATCATGATT GCTCTAACTG CAAGTGCTTT TGAAGAAGAT AAACAAGCTA TTTTACAAGT CGGTTGTGAC GATATTATTT CTAAGCCTTT TGAAGAAAAG ATGCTCTTTG AAAAAATGGC TCAGTATCTA AATTTACGCT ATCTCTATGC AGAGATTCAT CAGCAACAAA AACAAAATAA GCTGAAAACT GATCATAGAT TAATAAAATT GAGTAACTCT GATTTACAGG TTATGGCACC ATTTTGGATT GCTCAGGTTC ATGCAGCAGC AATAGTGATT GACGATGCAC AACTTTATGA ACTGTTTGAA CAAATACCCG CGGAACACAG ACAACTAGCT GATGGTTTAA AAAATTTAGT TTATAACTTC CATATAGAAA CAATTATTAA CCTCACTGCT CCTGACTAA
|
Protein sequence | MKNKYFNIPE SLDYSKYEQQ PIHIPGYIQA HGVLLALQEP DLTILQISEN TNHFFGIPPE SLLGQNLNCL LSSEQIESIT HSLQEDKPVV IHSLEITIYN QKKSSRLLGI LHRSNGVLIL ELEPHAAITT NALVGYTYLL ETAIFNIRNA FHLSDIHQII AQEVRKITQF DRVMIYSFTP DYSGVVIAED KQSNLESYLG LHYPAFDVPN QARQLYYKNR LRIIPDVNYQ PVKIIPSNHP LSNEPLDLSC AILRNISSCH VEYLQNMGVT ASMCISLIHE NKLWGLIACH HYSPRYIDYE IRKICEVIGQ FSSIELVNHR ERALNFYRQK VKLIQQELNH NLTNELDHIG KVFQRNKTNL LKLLKANGTA VLLQGQLILI GNTPSAEDTR DLVEWILNQK NQQEIFYTDS LPPLYPNASK FQDIATGILA ISIFLHQQSY HIIWFRPEQV QTVNWAGNPH EAIAVNSQQE YYLTPRKSFE LWKQTVKHKS LPWQIFEIEV AQEMRNSLML AALEFSQVAL QQAAEQAEIA NRAKSQFLAK MSHELRTPLN AILGFTQILT RNSSLSQEDQ ENLDIISRSG EHLLSLINDV LEMSKIEAGK PTLNETYFDL YRLIYSIQEM FALKAADKGI NLITEFGVEV NKYVFGDEGK LKQILINLIG NAIKFTVVGH VAIRVSCPPN YAPLAVTNGK VLIELEVEDT GAGIALEDQE LIFEAFLQSK GGRQFMQGTG LGLAISRQFA RLMGGDITVK SILGEGTTFT CRVQLSLANK GDIISPIKNT KRVVGLEPGQ PRYRILIVED ILENRLLLVK LLESVGLDVR AVENGLQAID ICQEWEPHLI WMDIQMPVMN GYEATKQIRA MNQGKNIIMI ALTASAFEED KQAILQVGCD DIISKPFEEK MLFEKMAQYL NLRYLYAEIH QQQKQNKLKT DHRLIKLSNS DLQVMAPFWI AQVHAAAIVI DDAQLYELFE QIPAEHRQLA DGLKNLVYNF HIETIINLTA PD
|
| |