Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3594 |
Symbol | |
ID | 9341400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 3661939 |
End bp | 3663450 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | anthranilate synthase component I |
Protein accession | YP_003722305 |
Protein GI | 298492128 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATTCC CCGATTTCCA GCAATTTACA GAACTAGCAA AACAAGGTAA TTTTGTCCCT GTATATCAAG AATGGGTCGC TGATTTAGAT ACCCCTGTTT CTGCTTGGTA CAAGGTTTGT GCAGGTCAAC CTTATAGCTT TTTGCTGGAG TCGGTGGAAG GTGGAGAAAA GGTAGGACGT TATAGTTTAC TTGGTTGTGA TCCGCTGTGG ATTTTGGAAG CGCGGGGAGA TAAAACTACT CAAACACACC GCGATGGTTC CCAGGAAGTT TTTACAGGTG ATCCTTTTAC TGTTTTAGCG GATTGTTTAG CACCTTATCA CCCAGTAAAA TTACCACAGT TACCTTCAGG AATCGGCGGA CTGTTCGGGT TTTGGGGTTA TGAATTGATT AACTGGATTG AACCGTGTGT ACCAATTCAT CCTCAAGATG AGCGTAATAT CCCTGATGGG TTATGGATGC AAGTAGACCA ACTGTTAATT TTTGACCAGG TGAAGCGAAA AATCTGGGCG ATCGCCTACG CTGATTTAAG GAATACTGAT AATTTAGCAG CAGCATATCA AAAAGCGAGC GATCACATCC AACAAATGGT GAGTAAGTTA TCTTTACCTT TATCACCACA AAATACCCAA CTTCCTTGGA CATCTCCCCA AAATAAACCC AAAGCGGGAA TGGAAGAATA TATCAGCAAT TTTACCCGTC CCGATTTTTG TGCTAGTGTG GAAAAAGCTA AAGAATATAT CAAAGCAGGT GATATTTTCC AAGTCGTGAT TTCTCAACGT CTATCCACAG AATATACAGA AAATCCCTTC GCTTTATATC GTTCCCTACG CCAAATTAAC CCTTCACCTT ACATGGCGTA TTTTAACTTC CAAGACTGGC AAATTATCGG TTCTAGTCCT GAAGTTATGG TGAAAGCAGA ACGAGATGAA GAAGGGGGAA TAATCGCCAC TGTCCGACCG ATTGCGGGAA CTAGACCCAG AGGTAAAACC ACCCAGGAAG ATGAGGCTTT TGCAGCAGAT TTACTTCAAG ACCCTAAAGA AGTTGCAGAA CATATCATGT TAGTTGATTT AGGACGCAAT GATTTAGGAC GAGTATGTAA AAATGGCACT GTTAAAGTTG ACGAATTAAT GATAATTGAA CGCTATTCTC ATGTTATGCA CATTGTCAGT AATGTGGTAG GGAAATTAGC GAAAAATAAA ACGGCATGGG ATTTATTAAA AGCTTGTTTT CCTGCGGGTA CAGTTAGCGG TGCGCCAAAA ATTCGCGCTA TGGAAATTAT CAATGAATTA GAACTAACCC GCAGAGGTGT ATATTCTGGT GTCTATGGAT ATTATGACTT TGAGGGACAA TTAAATAGTG CGCTCGCTAT CAGAACTATG GTTTTACATA ATCAAACTGT TACTGTCCAA GCTGGTGCAG GTTTAGTCGC TGATTCTGAA CCAGAAAAGG AATACGAAGA AACTCTCAAT AAAGCCAGAG GTTTATTAGA AGCAATTCGA TGTTTGCGGT AA
|
Protein sequence | MIFPDFQQFT ELAKQGNFVP VYQEWVADLD TPVSAWYKVC AGQPYSFLLE SVEGGEKVGR YSLLGCDPLW ILEARGDKTT QTHRDGSQEV FTGDPFTVLA DCLAPYHPVK LPQLPSGIGG LFGFWGYELI NWIEPCVPIH PQDERNIPDG LWMQVDQLLI FDQVKRKIWA IAYADLRNTD NLAAAYQKAS DHIQQMVSKL SLPLSPQNTQ LPWTSPQNKP KAGMEEYISN FTRPDFCASV EKAKEYIKAG DIFQVVISQR LSTEYTENPF ALYRSLRQIN PSPYMAYFNF QDWQIIGSSP EVMVKAERDE EGGIIATVRP IAGTRPRGKT TQEDEAFAAD LLQDPKEVAE HIMLVDLGRN DLGRVCKNGT VKVDELMIIE RYSHVMHIVS NVVGKLAKNK TAWDLLKACF PAGTVSGAPK IRAMEIINEL ELTRRGVYSG VYGYYDFEGQ LNSALAIRTM VLHNQTVTVQ AGAGLVADSE PEKEYEETLN KARGLLEAIR CLR
|
| |