Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0653 |
Symbol | |
ID | 9338439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 686625 |
End bp | 691340 |
Gene Length | 4716 bp |
Protein Length | 1571 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | glutamate synthase |
Protein accession | YP_003720244 |
Protein GI | 298490067 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.58912 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCATC AATCATTGAA TCAGGGTCAA AACATTACAT TGTCAGACAT GGAAATTACG GATACTTACT CAGGACAACG ATGGTTATCA GAGGAAAGAG ATGCTTGTGG TGTAGGGTTT ATAGCCCATC GGCAAAATCT CGCCAATCAT GAAATTTTAA CAAAGGCTTT AACTGCTCTA ACTTGCTTAG AACATAGAGG AGGATGCAGC GCCGACCAAG ACTCTGGTGA TGGTGCAGGG ATTTTGACAG CGATTCCTTG GGAATTGTTT CAACAAGAAG GGATTAATGT TTCAGATAGT GGTAATATGG CAGCAGGAAT GATATTCTTG CCTCAAAACC AGGAATCAGC AAAAAAAGTC AAAGCTATAT TTGAGCAAGT AGCGGCTGAG GAAAAATTGA CTGTGCTGGG CTGGCGAGTA GTCCCCGTGC GTTCAGAAGT ATTAGGGATG CAAGCAAAAG AAAATCAACC CCAGATAGAA CAAGTTTTTT TAGCGTCGGC TGATAAAAGT GGCGATGAAT TAGAACGGGA ATTGTATATT GCCCGTCGCC GTATTGTTAA AGCAGCCAAA AACATTTCCG AAGAATTTTA TGTCTGTTCC CTATCGACCT GCACAATTGT CTATAAGGGT ATGGTACGTT CGGCGGTGTT GGGAGAATTT TACCCGGATT TAAAAAATCC GGCTTTCAAG ATTGCTTTTG CTGTTTATCA TCGTCGGTTT AGTACTAACA CGATGCCGAA ATGGCCTTTA GCACAACCCA TGCGGCTATT GGGTCACAAT GGCGAAATTA ATACACTGCT GGGTAATATT AACTGGATGA TGGCCAGAGA AGCTACCTTA GATCATCCGG TTTGGAATGG AAGAGCTGAG GAATTTAAAC CATTGGTAAA TACTGACAGC AGCGATTCAG CTAGTCTTGA TAACGTACTG GAGTTGCTGG TGCGTTCTGG ACGCAGCCCT TTGGAAGCTT TAATGATGAT GGTTCCAGAG GCTTACAAAA ATCAACCTTC TTTGCAAAAT TATCCTGAGA TTGTTGATTT TTATGAATAT TACAGTGGTC TGCAAGAACC TTGGGATGGC CCTGCGCTGT TGGTATTTAG TGATGGAAAA AGAGTGGGTG CGACTTTAGA CCGTAATGGT TTAAGACCAG CTCGCTATTT GATTACTAAA GATGACTATA TCGTAGTTGG TTCTGAAGCA GGTGTGGTGG AATTCCCAGA AGCCAATATT TTGGAAAAAG GCAGATTGGG TCCAGGACAA ATGATTGCTG TAGACTTAAC CAGCAATGAG ATCCTTAAGA ACTGGGAAAT TAAACAGCGT ATTGCCAAGC TACATCCTTA TGGGGATTGG TTGCAACAAT ATCGCCAAGA ATTGAAACAT TTGGTAAAAC CGTCAGCTGT CAATGCTAAC GGTAATGGTC ATCATAGGAC TGACAATGGA CATCTGACCA CTGACATACC AGAAAAACAA ACTTTACTCC AACAACAAAT TGCTTTTGGC TACACCACAG AAGATGTGGA AATGGTGATT CAGCCGATGG CTAATACTGG TGCAGAACCG ACTTTCTGTA TGGGGGATGA TATTCCCTTG GCGGTGTTAT CAGAAAAACC CCATCTGCTC TATGACTATT TTAAACAGCG GTTTGCTCAG GTGACAAACC CACCAATTGA CCCTTTACGG GAAAAGCTGG TGATGTCTTT GACGCTGGAA CTGGGTGAAA GAGGTAATTT ATTAGAACCG AAACCAGAAC ACGCTCGCAA ACTGAAGTTA GATTCGCCTG TTTTGACGGA GACTGAGTTA GCAGCAATTA AGTTGTCTGG TTTTGCGACT GCTGAGTTGT CAACTTTGTT TTCTATTGCT GCAGGTCCAG AGGGTTTGAA AGAGGCGGTG GAAGCTTTAC AAAGACAAGC AGTGGAATCT GTGCGCGCTG GTGCGAAGAT TTTGATCTTG AGTGATAAGA TCCCCCCAAC CCCCCTTGAA AAGGGGGGCG AAGAAGGCAT AAGTGGGATA AGTGCTGATT TTACTTATAT TCCACCTTTG TTGGCTATTG GTGCGGTTCA CCATTATTTG ATCCGCGAAG GTGTGCGGAT GAAAACATCT TTGGTTGTTC ACACCGCCCA ATGTTGGAGT ACCCATCATT TTGCTTGTTT GTTGGGCTAT GGTGCTGGTG TGGTTTGTCC CTATATGGCT TTGGATACTG TGCAGGATTG GTGGTCTGAT CCGAAAACGC AACAGTTTAT GGAAAGGGGG AAAATTAATA CTCTGACTCT AGAACAGGCG ATCGCAAATT ATCGCCAAGC TGTAGAATCA GGTTTGTTGA AGATTCTCTC AAAAATGGGG ATATCTCTAC TTTCTAGCTA TCAAGCAGCA CAAATTTTTG AAGCTATTGG TATCGGTGGG GATTTGTTGG CTTTGGGTTT TAGAGGAACA ACTTCCCGCA TTGGTGGCCT TAGTTGCAAG GAATTAGCTC AAGAGGTGCT TTCTTTCCAC AGTAAGGCTT TCCCGGAACT GACAGCGAAG AAGTTAGAAA ATCTCGGCTT TGTTCAGTAT CGTCCTGGTG GTGAATATCA TGGTAATTGC CCGGAACTGG TCAAGGCGCT GCATAAGGCT GTGGATGGTA AGAAATATGA ACATTACGAA GTTTATAAAC AGTATTTACA AGGTAGACCG ACAACGGCGT TACGGGATTT GTTGGATTTT GCCAGTGAGT GTCCTTCTAT TCCGATTGAA GAAGTAGAGT CTGTCAGCGA AATTGCTAAA CGCTTCTGTA CTGGGGGAAT GTCTTTAGGT GCATTATCAC GAGAGGCACA TGAAACTTTA GCGATTGCCA TGAATCGCAT TGGTGGTAAA TCTAACTCTG GTGAAGGTGG GGAAGATCCA GTTCGTTACA AAGTATTAAA TGATGTTGAC TCAACTGGTC ATTCATCCAA TTTCCCCCAT TTAAGTGGGT TGCGAAATGG TGATATAGCT TCAAGCGCCA TCAAACAAGT TGCTTCTGGT CGTTTTGGTG TCACACCGGG ATATTTAGCC AGCGCCAAAC AAATCGAAAT CAAAATCGCT CAAGGTGCCA AACCAGGGGA AGGTGGACAG TTACCGGGTC CAAAGGTCAG CCCCTATATT GCTATGTTGC GCCGTTCTAA GCCTGGTGTA ACTTTGATTT CTCCACCACC ACATCATGAT ATTTACTCGA TTGAAGATTT GGCGCAGTTG ATTTTTGACT TGCATCAAAT TAACCCAAAA GCACAGGTTT CTGTGAAGTT GGTTGCAGAA ATCGGGATTG GTACTATCGC GGCTGGTGTG GCGAAAGCTA ATGCTGATAT TATCCAAATT TCTGGTCATG ATGGCGGTAC TGGTGCATCT CCTCTAAGTT CTATTAAACA CGCTGGTAGT CCTTGGGAAC TCGGTTTAAG TGAAGTGCAT CGGGTGTTGA TGGAAAATGG ATTGCGAGAT CGCGTGATTT TAAGAGTTGA TGGTGGTCTC AAGAGTGGCT GGGATATTCT GGTTGCTTCC TTAATGGGTG CCGAAGAGTT CGGTTTCGGT TCTATCGCCA TGATTGCTGA AGGCTGTATT ATGGCACGGG TGTGTCATTT AAATTCTTGT CCCAAGGGTG TAGCCACTCA GAAGGAAGAG TTGCGTCAAC GCTTTACAGG TATCCCAGAC CATGTCGTTA ACTTCTTCTA TTTTGTGGCG GAAGAGGTTC GCAGTTTGTT AGCTAAACTG GGTTATCGAT CCTTAACAGA ATTGACTGGT AGGGCAGATT TGTTAACAGT GCATTCTGAT GTGAACCTGG CTAAAACCCA ATCCATCAAT TTAGGCTGCT TAACTAAGCT ACCAGACGCA AAACAAAACC GTAGCTGGTT GGTACATGAA GAGGTTCACA GCAATGGATC TGTGTTAGAT GACCAAATTT TAGCTGATAG GGATATTCAA GCTGCAATTA GCAATCAATC GACTATCAGT AAGACTTTTA CAGTCGTAAA CACTGATAGA ACAGTAGGTT CAAGACTAGC AGGGGCAATC GCATCCCAAT ATGGTGACAG TGGTTTTGAG GGTCAAATTA ACCTAAATTT CCAAGGTAGT GCAGGCCAAA GCTTTGGCGC GTTTAACCTT CCTGGTTTAA CCCTCGCTTT GACAGGGGAA GCTAACGACT ATGTAGGTAA GGGAATGCAC GGGGGAGAAA TTATCATTAA GCCCCCAGCA GATGCTAACT ATGACCCCTC ACAAAATGTG ATTGTTGGCA ATACCTGTCT TTATGGTGCA ACTGGTGGTG TATTATTTGC CAACGGTTTA GCCGGAGAAC GCTTTGCTGT ACGTAATTCT AAAGGTACAG CGGTAATTGA AGGGGCTGGT GATCACTGCT GTGAATATAT GACTGGTGGT GCAGTTGTGG TTCTGGGTAA AGTCGGCCGC AACGTTGGCG CTGGAATGAC TGGTGGACTG GGTTACTTCT TAGATGAAGA TGGTGCTTTC CCTGAGTTGG TTAATAAGGC TCTTGTGAAA ATTCAGCGGG TGGTGACGGA AGCTGGTGCA AAACAACTAT ATGATTTGAT TAAAGCTCAT GGTGATCGCA CTAGTTCACC AAAAGCACAA CTAATTTTAC AAAATTGGTC AGAATATCTC CCTAAATTCT GGCAAGTTGT ACCACCTTCA GAAGCTGACA GTCCAGAAGC TAATGGTGAA ACTGAGAAGG TAAGTAAACA GTTAAGTTCG GTTTAA
|
Protein sequence | MNHQSLNQGQ NITLSDMEIT DTYSGQRWLS EERDACGVGF IAHRQNLANH EILTKALTAL TCLEHRGGCS ADQDSGDGAG ILTAIPWELF QQEGINVSDS GNMAAGMIFL PQNQESAKKV KAIFEQVAAE EKLTVLGWRV VPVRSEVLGM QAKENQPQIE QVFLASADKS GDELERELYI ARRRIVKAAK NISEEFYVCS LSTCTIVYKG MVRSAVLGEF YPDLKNPAFK IAFAVYHRRF STNTMPKWPL AQPMRLLGHN GEINTLLGNI NWMMAREATL DHPVWNGRAE EFKPLVNTDS SDSASLDNVL ELLVRSGRSP LEALMMMVPE AYKNQPSLQN YPEIVDFYEY YSGLQEPWDG PALLVFSDGK RVGATLDRNG LRPARYLITK DDYIVVGSEA GVVEFPEANI LEKGRLGPGQ MIAVDLTSNE ILKNWEIKQR IAKLHPYGDW LQQYRQELKH LVKPSAVNAN GNGHHRTDNG HLTTDIPEKQ TLLQQQIAFG YTTEDVEMVI QPMANTGAEP TFCMGDDIPL AVLSEKPHLL YDYFKQRFAQ VTNPPIDPLR EKLVMSLTLE LGERGNLLEP KPEHARKLKL DSPVLTETEL AAIKLSGFAT AELSTLFSIA AGPEGLKEAV EALQRQAVES VRAGAKILIL SDKIPPTPLE KGGEEGISGI SADFTYIPPL LAIGAVHHYL IREGVRMKTS LVVHTAQCWS THHFACLLGY GAGVVCPYMA LDTVQDWWSD PKTQQFMERG KINTLTLEQA IANYRQAVES GLLKILSKMG ISLLSSYQAA QIFEAIGIGG DLLALGFRGT TSRIGGLSCK ELAQEVLSFH SKAFPELTAK KLENLGFVQY RPGGEYHGNC PELVKALHKA VDGKKYEHYE VYKQYLQGRP TTALRDLLDF ASECPSIPIE EVESVSEIAK RFCTGGMSLG ALSREAHETL AIAMNRIGGK SNSGEGGEDP VRYKVLNDVD STGHSSNFPH LSGLRNGDIA SSAIKQVASG RFGVTPGYLA SAKQIEIKIA QGAKPGEGGQ LPGPKVSPYI AMLRRSKPGV TLISPPPHHD IYSIEDLAQL IFDLHQINPK AQVSVKLVAE IGIGTIAAGV AKANADIIQI SGHDGGTGAS PLSSIKHAGS PWELGLSEVH RVLMENGLRD RVILRVDGGL KSGWDILVAS LMGAEEFGFG SIAMIAEGCI MARVCHLNSC PKGVATQKEE LRQRFTGIPD HVVNFFYFVA EEVRSLLAKL GYRSLTELTG RADLLTVHSD VNLAKTQSIN LGCLTKLPDA KQNRSWLVHE EVHSNGSVLD DQILADRDIQ AAISNQSTIS KTFTVVNTDR TVGSRLAGAI ASQYGDSGFE GQINLNFQGS AGQSFGAFNL PGLTLALTGE ANDYVGKGMH GGEIIIKPPA DANYDPSQNV IVGNTCLYGA TGGVLFANGL AGERFAVRNS KGTAVIEGAG DHCCEYMTGG AVVVLGKVGR NVGAGMTGGL GYFLDEDGAF PELVNKALVK IQRVVTEAGA KQLYDLIKAH GDRTSSPKAQ LILQNWSEYL PKFWQVVPPS EADSPEANGE TEKVSKQLSS V
|
| |