Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4454 |
Symbol | |
ID | 9342256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4539060 |
End bp | 4540943 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003722882 |
Protein GI | 298492705 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCAT CAGAAATTTT GGCGAAGCTT ACTCAACAAG GTGTTCAATT TTGGGTAGAA AATAACAAAA TTAATATCCG TTCTCCTAAA GGTGTAATAA CATCAACAAT TAAGGCAGAA ATAGCTACAT ATAAAGGAGA TATTTTAGCA TTATTACAGG AGATGAACCT TGTTACAAAA TCTGCTTCTG AACCATTAAG TCAGGGAATC AGCTTGCCAA CTATTGGGAG ATTAATAGGT GGTTTTGCTG GTGAATCACC TGTAGGATAT CAGCCACCAA TTATTAACCC CAAATTAATG GCTCAAAACC TCAATGTTAC ATTTAGACCT TTACCTGATG GCTATCATAA TCAAATTATT ATGAAATTCC GGCAGGAATT AGCATTAAAA TTAAAAAGTT TTGGAGTTAA TGTTCTCTCT TGGCAGGAAG CGACAACAGA TATTTTTTAT GATATTAGAA TTCCTATATT AAACTTGAAT TGCTCATTTA AAATTAAGGG AGTTCGAGCA GAAATTGATG CAGTAATAGA TGTGGAAAGA CCAAATTCAT GGCTGAGAAA GTTAGGAATA TTCATAGCTG AAACTTTTTA TAAGTTATCT TATCCTTGGT TACTCAATCA GCAAAAAATG TCTGTGGTAC AAATTGCTAA ATTAAGTAGT TGGGCTGAAG ATCATGCTGC TAAATATGTT GAAGATCCAA CCAATACGCA GGTGATCATT CTTAGTGATA TAAATTATGA TTTTATCAAT CCCTTAACGA AATATCAGGA AAAAATTAGG ATTGGTATTA ATACACTAAT TAAAACATTC TCAGAAATCG TAATTGGCAT ATCTCCTGAG CAAATTTCTA TCCTGAATAT GAATCTTTCT GATTCTACTT TCTTTAAATC AGAAATGGAT GCTTTTGTTT CAAACTCACT TATTCCTAAA GTTTTTGTCC CTATTACTCC TCTATTAATG AGTAGATTTA AAATAGCACA ATATAATCCT TATATGTCTA AATATACCCC TAAATTGGTT AAACTAGGTC AAGAATTAGC CTCAACTGGT CTACTTCCAC CTGGATTTAA GTTGGCTGAA CTTATTAAAA GAAAATCCCA CAGAGATATT GTCAATGTTA TTGTTAATGG TAGAACTGGT GTTTCTTACG GGTTTGTAGC TTATGCTGAA CCTCCCTACT ATGTAGGAAA ACCAGAAATC TCTACTTATG AATGGGATAA ATTATTACCT GTTGCTGGAT TTAGTAGTAA TGAAATTCGG AAAAATGATG AAAGTAGACG TTATATAAAA ATCATCATTA ATGGAGAATA TGTATTTAAG CAAATTCCCG ATATTTGGCT AGTGAGTTCT CGTTCTGGTT CTAATAAAAC AGACTTAAAT CTTGAAGAAG ATATTATTCG TATTGGTTTA AAAGATGATT TACATTTGCA GTTACCTGTA GGAAGTTTGT CACGTAAATC TGATTTCAAA CCTTCTTATG ATATCTATGT GATGCTGGCT ATTAGTCTAG CTGCTGCTTT ATATACGCCA GAATTAATCA AAAATGGTGC GCCAATTGTT CATTTTCACG GTTATCCGGC ATTTGATTGG TTTAAAGAAA ATGAATATTG CGTCGGTGTT AATAATCCTT CTGTACCCTG TGGAACTTAT GAATCAGGTG TGTTTAATTT TTTAGGTCTT TCTAATTTAG CTAGTCAACA AACGAAAAAT ATTAAATTAG TGAGTTTGAT AGAACCAGAT CATGGTACAA ATTTTATTGC TCATGATATG GATTATCTAG TTGATAGGTT AAAACATGGG TGTGTAGCAG AACAAATTGA ACTAGGTGGA CAACATTTTG CTTCTTTGAA AGCAAATTTA GGTGATGGTG GGATTCCCAT ATAG
|
Protein sequence | MNASEILAKL TQQGVQFWVE NNKINIRSPK GVITSTIKAE IATYKGDILA LLQEMNLVTK SASEPLSQGI SLPTIGRLIG GFAGESPVGY QPPIINPKLM AQNLNVTFRP LPDGYHNQII MKFRQELALK LKSFGVNVLS WQEATTDIFY DIRIPILNLN CSFKIKGVRA EIDAVIDVER PNSWLRKLGI FIAETFYKLS YPWLLNQQKM SVVQIAKLSS WAEDHAAKYV EDPTNTQVII LSDINYDFIN PLTKYQEKIR IGINTLIKTF SEIVIGISPE QISILNMNLS DSTFFKSEMD AFVSNSLIPK VFVPITPLLM SRFKIAQYNP YMSKYTPKLV KLGQELASTG LLPPGFKLAE LIKRKSHRDI VNVIVNGRTG VSYGFVAYAE PPYYVGKPEI STYEWDKLLP VAGFSSNEIR KNDESRRYIK IIINGEYVFK QIPDIWLVSS RSGSNKTDLN LEEDIIRIGL KDDLHLQLPV GSLSRKSDFK PSYDIYVMLA ISLAAALYTP ELIKNGAPIV HFHGYPAFDW FKENEYCVGV NNPSVPCGTY ESGVFNFLGL SNLASQQTKN IKLVSLIEPD HGTNFIAHDM DYLVDRLKHG CVAEQIELGG QHFASLKANL GDGGIPI
|
| |