Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4674 |
Symbol | |
ID | 9342481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4777397 |
End bp | 4780405 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003723010 |
Protein GI | 298492833 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.597863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTGA AAAATCACTG GTTGACTCCC AATAGAGGCG CTTTGGCACA AATGCTTAAG TGGGTGAATC TCCGACCAGA GGAGTTAGAA CGCACTTGGA CGATGTTTGC CTTCTACACG ATTGTATCTG TGGGATTGCG ATGGGCAGAG GATAGCACTC TAGCTCTGTT TTTGGATCAA TATGGTGCGG AAAAACTACC TTGGATTTAT ATTGCCAGTG CGGTGACAGG TGCTGGACTG GTAGTTTTAT ATTCTTGGTT ACAAAAGGTT TTTCCCTTAC GTGCAGTGAT TGTGGCGATC ACACCGGGCA TGGTGGTGCC ATTATTTCTG TTAGTGTTTT TGTATGGAGG GATAAATATT CCCTACCTAG CAGTAATTAT CATCTTTCTG CTCAGGTTAT GGGTAGATGC CTGCTATGTG GTCAATGACC TTAACACATC TATTGTTGCC AACCAACTAT TCAACATTCG CGAGATTAAG CGCACCTACC CGTTAGTCAG TAGTGGCATA TTAGTAGCTG ATGTGATCAG TGGCTTTAGT TTACCTTGGT TACTCAAATT CACCAACCTA AATACAGTTA TTCTCATCGC CTGTTTTGTC ATTTTATTCG GCTCAGGAAT TTTATTTTAT TTAACTTATA AATATCCGAC AGCTTTCCCC CATACGCCAC AACGGGAGAT TACGGAAGAA CAAGCCTCTC GTCATCGCCG CTTAGAAACA CCTCTAAAAC GCTATGTTTG GCAGTTGTTT GCATTTTTTG CACTGTTACA AGTGATTGGC TTGTTAATAG ATTTTCAGTA TTTGCGCGAA CTGAATTCTA GTTTAGGTCA ACAAGAACTA GCCAGCTTTT TAGGGGTGTT TGGTGGCATT GTAGGACTGT GTGAATTAGT TACCCAATGG TTTATTTCTA GCCGATTGAT TGAAAAGGTG GGAGTATTTT TCACAGCCGC ACTTTTACCA ATTACTGTGG GCTTTTTATT ACCAGGTGGA ATCTCAGTTT TGAACTTATT TCCAGCGATT GAAGAGCCTG GATTTTTTTG GGGTTTAATG AGTCTCAAGT TCTGTGATGA ACTCTTGCGT TATACCTTTG TGATTAGTAG CGGTCCGGCA CTTTATCAAC CTATTCCTGA CCGAATTCGT AGCCGGATGC AAGCTTTATC CAGTGGGACA GCCGAAGCGA TCGCCTCTGG TTTGACGGGA TTAGTCATTT TTGGTAGTGT ATCGCTGATT GATCAATTTG TCCCTCAATC ACTGCAAAAG TGGGTATTAA TAGGTGAAAT AGTCATCGTT GCGGCCACCT GTTTAAAAGT TGTTTGGGAA TTGCGATCGC GCTATGTGGA CGTATTAGTT TTAAGTGCAG AACGGGGTGG ATTGAGTCCC GTAACTGTGG GTTTGCGAGC CTTCCAACAG GGAATAGTTA AAGCTTTGGT AGAAACAGGA ACTACAGCCG ATAAAAGCTC TTGCTTAGAA CTTTTAGCCC AAATTGATCT CCCCAGCGCA GGACAAGTTT TAGCACCATT ACTGCTTAAA TTACCTACAG ATTTACAATC CCAAAGCCTA GAATTAATGC TCAAATCAGG TATAAATCCC CTTTATGTAC CTGAAGTCCG TCGATTATTA GACCAACCCC AAGCAACCAT TGACCCAGAA GTGTTTGCTT TGGCTATGCG TTACCTTTGG CTGGCCGAAG AAAATCTCAA TTTAAATCTT GTAGAAGAAT ACCTGCATCA ACAAAACCAT CCATTGATCC GCGCCACCGC AGCTTCATTA CTACTGCGTC AGGGAACAGA ACAACAAAAA ACAACAGCCA CCAAAACAAT GCGTCGGATG CTCACCCATC AACAAGAACG AGAACGGATT AATGGAGTCA GGGGACTCAA AGAATTGGTT TATTTACAAG CTCTGCGGGT TCACATTCCC AATTTATTGC AGGATGATTC ATTGCGGGTA CGCTGTGCTG TATTAGAAAT GATTACCGCT ACCTGCTTGG AGGAATACTA TCCCACACTC TTAGCAGCAC TCTACTACAA ATCCACTCGA CAAACAGCCA TGCAATGCTT GGTGAGTTTG GAAAATGAAG CCATCCCGAT GTTGTTGAAA CTAACCACAA ATATTTACAA ACCAGACGTT GTGAGAATAT GCGCTTGGCG TATCATTGCT CAAATCGGCA CACTAGAAGC AAGAGAAACT CTATGGCTAC ATTTGGAAAC ATCCAGGGGT AATACGAGGG ACTATATTCT CCAAAGCTTA CTGAAGATTC AGCAAAAACC AGGAAATATC AATGTCGTAG ATCAATTCTA TGAAAGTAGG GTAGAAATTT TAATTGAGCA AGAATTACGA TTTCTCGGTG AAATTTATGC TGCGTACATA GACTTCCAAA ATCTATACTC TCTAGAAAAT TATCAAGGAA ATGAGAGGCT TTTGACTATT GCTCAATTGC TGCAACGCTC ACTACTAGAA TTAGCATTGG ATGTGAGAGA TAGGTTATTG CTGTTGTTGA AGTTGCTTTA TCCCGCAGAA AAAATGCAAG CCGCAGCCTT CAATCTCCAA TCTAAGTCAT TAATCAATTT AGCAAGAGGC TTAGAAATAT TAGATCACAC TGTAAATTTG CCTTGTAAGT CTTTGTTGTT GAATATTTTA GATCGACGAC CAGAACATGA AAAGCTCAAA TATCTGATCG AAGCGGGATT TTGGCAAAAT GAAAATATGC CAGTCAAAAA GCGCCTCTCT AAGATGATAT CTCAGGGACA TTTGCTTTCT GATTGGTGTT TAGCTTGTTG CTTGCACTTT GCACAAGCTG CTTATGTTCG ACTCACGACT GCGGAAATTT TAGCAAATTT GCGCCATCCT ACAGGGTTTG TTAGGGAAGC AACAATTTCA TACTTGAGTG TAGTTTCCCA GCGCGTTCTT CAGGAAATTC TCCCCCATTT AAAAACAGAT CCCCATCCAC TGGTGGCTGC TCAAGTCAAC GAGTTGATAG CAAAATATAA AGAAGGACTT CAGGATTAA
|
Protein sequence | MELKNHWLTP NRGALAQMLK WVNLRPEELE RTWTMFAFYT IVSVGLRWAE DSTLALFLDQ YGAEKLPWIY IASAVTGAGL VVLYSWLQKV FPLRAVIVAI TPGMVVPLFL LVFLYGGINI PYLAVIIIFL LRLWVDACYV VNDLNTSIVA NQLFNIREIK RTYPLVSSGI LVADVISGFS LPWLLKFTNL NTVILIACFV ILFGSGILFY LTYKYPTAFP HTPQREITEE QASRHRRLET PLKRYVWQLF AFFALLQVIG LLIDFQYLRE LNSSLGQQEL ASFLGVFGGI VGLCELVTQW FISSRLIEKV GVFFTAALLP ITVGFLLPGG ISVLNLFPAI EEPGFFWGLM SLKFCDELLR YTFVISSGPA LYQPIPDRIR SRMQALSSGT AEAIASGLTG LVIFGSVSLI DQFVPQSLQK WVLIGEIVIV AATCLKVVWE LRSRYVDVLV LSAERGGLSP VTVGLRAFQQ GIVKALVETG TTADKSSCLE LLAQIDLPSA GQVLAPLLLK LPTDLQSQSL ELMLKSGINP LYVPEVRRLL DQPQATIDPE VFALAMRYLW LAEENLNLNL VEEYLHQQNH PLIRATAASL LLRQGTEQQK TTATKTMRRM LTHQQERERI NGVRGLKELV YLQALRVHIP NLLQDDSLRV RCAVLEMITA TCLEEYYPTL LAALYYKSTR QTAMQCLVSL ENEAIPMLLK LTTNIYKPDV VRICAWRIIA QIGTLEARET LWLHLETSRG NTRDYILQSL LKIQQKPGNI NVVDQFYESR VEILIEQELR FLGEIYAAYI DFQNLYSLEN YQGNERLLTI AQLLQRSLLE LALDVRDRLL LLLKLLYPAE KMQAAAFNLQ SKSLINLARG LEILDHTVNL PCKSLLLNIL DRRPEHEKLK YLIEAGFWQN ENMPVKKRLS KMISQGHLLS DWCLACCLHF AQAAYVRLTT AEILANLRHP TGFVREATIS YLSVVSQRVL QEILPHLKTD PHPLVAAQVN ELIAKYKEGL QD
|
| |