Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4843 |
Symbol | |
ID | 9342650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4956503 |
End bp | 4958227 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_003723119 |
Protein GI | 298492942 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAATC AAAAGCCAAC CTTTGTTGAC AAAAGTTCGG AACTCAAAAA AGTGTCAACT CAAGAGCGGA AAATAGAAGA ACTCCCAACT ATAGAATTTC CCTCTCGTCG GAAACTCAAA GCCAGTTCCT GGCGTATCCA TCAAAAAATT GGCTATGGTT ACTTTGTGGC TATTGCTATT GGCTTTTTTG GCTCATTGAC AGGATTGGTA CTGGCTAACT ATTACCGAGG TAGAGAAGTC AGGCAGTTCA ATCAAGCTAA TGAGCAAGAA CGATTACTAA ATAATTATAA AGATGCAGTA ATTGAGGCTC AACTCCATAG CTCAAGCTTG ATAGCGGTAT TGAACGACCC AGAAAAATTA AAAATTAAAA AAGATAAGTT TCTCAAGTCT GTTGCTCAAG CACAAAAATT AGAAGAAAGC ATTGATACAT ATATCAACAG CGATCCTCAG ATCTTAGCAG CAAACAAATC GACTTTAGAA ATTTTATTTC TCAATTATTC TAAGTATTTA GAAGAATTTA TTAACCAGAT AGAACCTATT TTGACGAGAA TTGACACGCC AAGAATACAA CCGGAACAAG TAGAGATAGC GCGAGAGCAG TTACTGGGAG TTATGAATAG TCAGATAGTT GAGCAGCTCA ATCTTTTATC GGATCAATTA ACTAGAATTT TAGAGACTGC TCAAACACAA GAGCTAAAGA GACGGACAGA TGTAGAGCAA GCTAGATTTG TTGAGAGAGC TATTGTCATA GTTAGTATGC TGGTTTCGGT TGCTGTTGCT GCTGTTGTGG CATGGCGTAC CAGTCGAGCG CTCGCAGAAC CTGTAATTAC ATTAACACAA GTAGCTGAAC AGGTAGCTAG AAAATCCAAT TTTGATTTAC GCGCTCCCAT TACTACTGAC GATGAAATTG GGTTGCTAGC TAAATCGCTA AATCGCTTAA TTGAGCGAGT ATCTGAGCGT ACTAGACAAC TCCAACAAGC CAAAGAATTA GCCGAAGCTG CTAGTAAAGC CAAAAGCCAA TTTTTGGCCA ATGTTAGTCA TGAGTTACGT ACACCTTTAA ATGCAGTAAT TGGTTTAAGT CAACTGCTAA AAGATGATGC TGTTGATATT GGTGCATCGC CAGATTTTAT CACCGACTTA GAAACTATCA ATTCTGCTGG TAGACATTTA CTAGAACTAA TTAACGACAT CCTAGATGTA TCAAAAATCG AAGCGGGGAA AATGACTCTC TACCCAGAGA CATTTGACAT CATAACTCTG ATTAATAATG TTGTTCTGAC AGTGAAACCA GCAATAGAAA AAAATGGCAA CACCTTAGTA TTAGAATGTG ATGAGTATTT GGGGACGATG TATGCTGACC AAACGAGGAT GCGACAGGTA TTATTAAATT TACTGAGCAA TGCAGCCAAG TTTACTACCA ACGGCAAAGT AACACTAACT GTTAAGATTG ATAAAACAAT CATCCTTAAA GAAGCACCCT TCGGATCAAT TATTTTCACC GTAACTGATA ACGGAATCGG AATGTCACCC AGTCAAGAGC AGAAATTGTT TCAACCCTTT ATACAAGGGG ATATTTCCAC TACCAAAAAA TATGGTGGTA CCGGCTTGGG ATTAGCTATT AGCCGTCATT TTTGCCAGAT GATGGGGGGT GAAATTCTTG TTAACAGTCA GCTTGGGGTT GGTTCTAATT TTAAGGTAAG TTTACCTTTG ACAGTGCGGC AATGA
|
Protein sequence | MPNQKPTFVD KSSELKKVST QERKIEELPT IEFPSRRKLK ASSWRIHQKI GYGYFVAIAI GFFGSLTGLV LANYYRGREV RQFNQANEQE RLLNNYKDAV IEAQLHSSSL IAVLNDPEKL KIKKDKFLKS VAQAQKLEES IDTYINSDPQ ILAANKSTLE ILFLNYSKYL EEFINQIEPI LTRIDTPRIQ PEQVEIAREQ LLGVMNSQIV EQLNLLSDQL TRILETAQTQ ELKRRTDVEQ ARFVERAIVI VSMLVSVAVA AVVAWRTSRA LAEPVITLTQ VAEQVARKSN FDLRAPITTD DEIGLLAKSL NRLIERVSER TRQLQQAKEL AEAASKAKSQ FLANVSHELR TPLNAVIGLS QLLKDDAVDI GASPDFITDL ETINSAGRHL LELINDILDV SKIEAGKMTL YPETFDIITL INNVVLTVKP AIEKNGNTLV LECDEYLGTM YADQTRMRQV LLNLLSNAAK FTTNGKVTLT VKIDKTIILK EAPFGSIIFT VTDNGIGMSP SQEQKLFQPF IQGDISTTKK YGGTGLGLAI SRHFCQMMGG EILVNSQLGV GSNFKVSLPL TVRQ
|
| |