Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4588 |
Symbol | |
ID | 9342394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4681799 |
End bp | 4684831 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003722962 |
Protein GI | 298492785 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.200354 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGAAA TGTACATATT GTGGTGGATG GCACAAATAC CTGTAAACAC CCCCAACGTC ACACCAGCAC AGGCATCGGT TCTTACTTCC GGTCCGCGCT TTTTTGTGGC TTTAATTTCT GGGGTAATTC TCGCTTTTGC CTTCCAATTA GTGTTAACAA ATCTCTCAAT TGCCGCTGGG ATTTCCTATT TGGGGCATCC GTCAGAGTCG CAGGAAGTTG AAAGTTTCGG GGGTACGATT CGCAAAATTG GGACGAGGTT GGGAATTTGG ACTTTAGTTA CGGTAACAGT TGCTTTATTC ATAGCTTGTT TCTTGGCTGT AAAATTGAGT CTATTAATTT TAGATCCCAG ATTAGGTGCA ATTTTGGGCT TGGTAATTTG GGGTGCATAC TTTTTATTAC TAATGTGGGT CAGCACAACT ACGGTAGGAT CTTTAATTGG TTCGGTCGTG AATACGGCCA CTTCTAGTTT TCAGGCGATT ATGGGAACTG CTACGGCTGC TTTGGGTACT AGGGCTGTAA ATCAGCAAGT GGTGGCAACG GCGGAAGCGG CTGCATCGGC TGTGCGGCGG GAGTTAGGGA GTGCATTGAC ACCAGCAAAT ATTCGCCAAA ATATAGAAGA GTACATAGAA AAGCTACGTC CTCCAGAAAT TGATATATCG AGGATTCGTT CTGAATTTGA AAGGTTACTG AGTGAACCGC AATTTAAAGC CTATGCCAGT AGTTCAGACC TCCGTAATAT TGACCGTCAG CGGTTTATTG ATTTAGTCAG CAGTCGCACG GATTTATCTA AACGAGAAGT TAACCGTGTC GCTGATTCTT TATATGATGC TTGGCAACAA GTAATCACTC ATAACGTACC CGCAAATAAG CGCTTGGCTG AGTTGGTTGA TTATCTCAAA TCCCTGCCGC CAGGACAAAC TAAAACAGAC GAACTCAATG CTAAACTTGA CCAGTTAATT ACAGAAATTC ATTCTTCCAA AGAAACAGAA CAAAAGCCTG GTATGATAAA ACAGGCAATA TCAGCACTGA GTGCGGTAGT TTTAGATAGG GCAGATTTGT CTGATTTAGA TGTAGAAAAA ATTTGGGATT CCCTAGCTAC TGCTAGAGAA AAATTCACAA AACAAATTAT ACAACAGCCT TATAATCCAA TTCGCGCTGA TATTGAAAAT TACTTACTCA ATACCCATCC TTGGCAATTA AGTCCGAAGA ATATTGTTCA AGAATCTCGA GATGTGATCT ATGATCCTGC TGCTGATCCT GGTGTAATAC GTAGCGAATT AGAGAAAATT ACCCGTCAAG ATTTTGTCAA AATTTTGCAA GCCAAGGGAC TGTTAACTCA AGGTCAAATT CAGGAAATTG CTGATCAATT GGTAGCAGTA AAAAATGAAG TATTGATAAC AGTAATTGCC CAGGAAGAAA GAGAGATTGT CCAAGATTTG CAAAGAAGAG TTGAAAGTTA TTTACTTGTG ACTCCTAAAG CAGACTTGAC ATCAGCAGGA ATTGAGGAAA ATTTTAAACC TTTGTTAGCA GATTCAGAGG CAGATTATCA ATCTTTATCC CGGCGATTAG CACAAGTTGA ACGGGAACAA ATGGGAGAAA TCCTGTTAGG ACGTAATGAT ATTCAGGAGT GGGAATTAGA CCCGATTTTA GATGAGTTGG AAATGCAGCG TGATCGCGTG TTGTTAGAGT CTCTCAGCAT GTCCAAACAG GCACAACATC AAGTGGAAAC TCTGTGGTTG AATGTTGAAT CATATCTACG TAACACGGGT AGGCAAGAAT TAAATCCTGA TGCTATCCGC ACTGATCTGA AACGACTGTT AGAAGACCCC CAAGGGGGAA TTATGGCTAT TCAGGCGCGG TTGTCTCGCT TTGACCGAGA TACTTTGGTG CAATTGTTGA GTCAACGCCA AGATTTAAAT GAAGCACAAG TTAATCAAAT TATTGACGTG GTGGAAGAAA TATGGGGTGG TATTCTCCAC ACACCACAAA AAGCAAAAGA ACAATATGAT TCTATCACCT CTACTATTGC AGATTATCTG CGAAATACTG GTAAGGAAGA ATTGAATCCT GAAGCTATTC AGCAAGATTT AACTAGGTTA TTTGCACATC CCAGAGAAGG TGTTGTCGCA CTGCGTCACC GTTTATCGCA CATTGACAGG GATACTTTGG TGAAGTTGTT GACTCAACGT CAAGATTTGA GTGAGGAACA AGTTAATCAA ATTATTGATA GTGTACAGAC ATCAATTAGA AATATTATCC GTGCGCCGCG TCGGTTAGCA AGCAGGACAC AACAAAGAAT ACAAACTTTT CAAACTTATT TGCAGGAGTA TTTACGATTA ACTGGTAAAG CAGAATTGAA CCCCGAAGGC ATTAAGCGGG ATGTGCAATT ATTGTTGCAT GATCCACGAG TGGGGATGGA AAGTTTGAGT GATCGCCTCT CGCATTTCGA CAGAGACACA ATTATTGCGT TGTTGAAAAT CCGGGAAGAT ATAAGTGATG AAGAAGCAGG GAGAATTGCT GATAATATTA TCCTCGTGCG TGATCAATTT GTAGAACAGG TTCGGGGTAT TCAACGACGT ATTCAAGATG TAATTGAGGG GATTTTTGCC AGTATTCGCA ATTATCTTAA TTCTCTAGAA CGCCCGGAAC TTAATTATGA TGCCATTAAG CATGATATCC GCCAATTGTT TGAAGATCCC CAAGCCGGGT TTGATGCATT GCGCGATCGC CTCTCATCTT TCAATCATGA TACCTTGATA GCTATTTTAA GTTCTCGTGA GGATATCTCT GAGGACGATG CTAAACACAT TATTGACCAA ATTGAACGCG CCCGAAATAC TGTTTTACAA CGGGCTGAAC AGCTACAGCA CGAAGCGCAG CATCGGCTAG AACAGGTGAA ACATCAGGCA CAGCGTCAAG CCGAGGAAAC GCGCAAAGCA GCTGCTAATG CCTCTTGGTG GTTGTTTGCA ACAGCGGTTG TTTCCGGTAT TTCTGCGGCT TTAGGAGGTG CAATCGCTGT GGTGTTGATT TAA
|
Protein sequence | MQEMYILWWM AQIPVNTPNV TPAQASVLTS GPRFFVALIS GVILAFAFQL VLTNLSIAAG ISYLGHPSES QEVESFGGTI RKIGTRLGIW TLVTVTVALF IACFLAVKLS LLILDPRLGA ILGLVIWGAY FLLLMWVSTT TVGSLIGSVV NTATSSFQAI MGTATAALGT RAVNQQVVAT AEAAASAVRR ELGSALTPAN IRQNIEEYIE KLRPPEIDIS RIRSEFERLL SEPQFKAYAS SSDLRNIDRQ RFIDLVSSRT DLSKREVNRV ADSLYDAWQQ VITHNVPANK RLAELVDYLK SLPPGQTKTD ELNAKLDQLI TEIHSSKETE QKPGMIKQAI SALSAVVLDR ADLSDLDVEK IWDSLATARE KFTKQIIQQP YNPIRADIEN YLLNTHPWQL SPKNIVQESR DVIYDPAADP GVIRSELEKI TRQDFVKILQ AKGLLTQGQI QEIADQLVAV KNEVLITVIA QEEREIVQDL QRRVESYLLV TPKADLTSAG IEENFKPLLA DSEADYQSLS RRLAQVEREQ MGEILLGRND IQEWELDPIL DELEMQRDRV LLESLSMSKQ AQHQVETLWL NVESYLRNTG RQELNPDAIR TDLKRLLEDP QGGIMAIQAR LSRFDRDTLV QLLSQRQDLN EAQVNQIIDV VEEIWGGILH TPQKAKEQYD SITSTIADYL RNTGKEELNP EAIQQDLTRL FAHPREGVVA LRHRLSHIDR DTLVKLLTQR QDLSEEQVNQ IIDSVQTSIR NIIRAPRRLA SRTQQRIQTF QTYLQEYLRL TGKAELNPEG IKRDVQLLLH DPRVGMESLS DRLSHFDRDT IIALLKIRED ISDEEAGRIA DNIILVRDQF VEQVRGIQRR IQDVIEGIFA SIRNYLNSLE RPELNYDAIK HDIRQLFEDP QAGFDALRDR LSSFNHDTLI AILSSREDIS EDDAKHIIDQ IERARNTVLQ RAEQLQHEAQ HRLEQVKHQA QRQAEETRKA AANASWWLFA TAVVSGISAA LGGAIAVVLI
|
| |