Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4936 |
Symbol | |
ID | 9342742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 5050986 |
End bp | 5052248 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003723192 |
Protein GI | 298493015 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.350642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA TTGCTTATCT TCAATGTCCA ACGGGAATTT CCGGTGATAT GTGCCTGGGT TCGTTGGTAA GTCTGGGTGT TCCCGTAGAG TATTTAGTAG GAAAACTTAA TGCTTTGGGA ATTGAGCAGG AATACCAATT ACGAGCAGAA CTTGTTCATC GGCAGACTCA GCAAGCTACT AAAATCCATG TCCATTTAGT ACACCACCAC CATTATCACC ATCACCATCA TGGCCGCCAT CTACCAGAAA TAGAGAAGAT GATTCTCAAA GCTAAATTGC CATCACAGGC AGAAGCTTGG AGTTTAGCTG TATTCCGCCA GTTAGCAGTG GCAGAGGGAG CGGTACATGG TATTGAACCG GAAAAAGTTA ATTTTCATGA GGTGGGTGCT GTAGACGCGA TTGTAGATAT TGTTGGTACT TGCTTGGGGT TGGATTGGTT AGGTATCGAC AGCAATCAGG AAGGATTACC TTTGATATAC TGCTCACCGT TTCCGACTGG TGGGGGAACT GTTCGCGCTG CACATGGTCA AATGCCAGTA CCAGTACCAG CAGTATTGAA GTTGTGGGAA ATGCGGAGTT GTCCAGTGTA TAGTAATGGT ATTGACAAAG AACTAGTAAC ACCGACAGGG GCAGCGATCG CAACTACCTT AGCTAGAAGT TTTAGTTCAC CACCTCCAAT TATCCTCAAA CAAGTGGGAC TAGGAGCAGG TTCTCTCGAT TTACTCATCC CCAATATACT ACGACTCTGG ATCGGTGAAA GTGTAAATGA AAAGACAAAT ATTCCTGATT TTGCTAATAC TAGTCCTAAT TTAGAAATCA TCTCTGTCCT AGAAACTCAA ATTGATGACT TAAATCCCCA AGCTGTTGGC TATGTCTTTG ATGCTTTATT TGCCGCAGGG GCCGTGGATG TCTTTACTCA AGGTATAGGA ATGAAAAAAT CCCGTCCAGG GCTTTTGTTG ACTGTAATTT GTTATCCAGA ACATTTAGCC AGTTGTGAAG AGATTTTATT CCGCGAAACC ACTACTTTAG GAATTCGTCG GACTACTGAA CAACGCACTA TTTTACAACG GGAAATACAA CAAGTAGAAA CGCCTTATGG TAATGTGCGT GTGAAAGTGG CCTGGAAAGG ACAAGCAACA GAGAAAAGTA TTACTAACGT GCAGCCAGAA TATGAAGATT GTGCAGACTT AGCACGAAAA CATAATATTC CCTGCCGTGA AATTCAACGT TTGGCGTTAC ATAATTGGTA TTGTCAAACT TAA
|
Protein sequence | MNKIAYLQCP TGISGDMCLG SLVSLGVPVE YLVGKLNALG IEQEYQLRAE LVHRQTQQAT KIHVHLVHHH HYHHHHHGRH LPEIEKMILK AKLPSQAEAW SLAVFRQLAV AEGAVHGIEP EKVNFHEVGA VDAIVDIVGT CLGLDWLGID SNQEGLPLIY CSPFPTGGGT VRAAHGQMPV PVPAVLKLWE MRSCPVYSNG IDKELVTPTG AAIATTLARS FSSPPPIILK QVGLGAGSLD LLIPNILRLW IGESVNEKTN IPDFANTSPN LEIISVLETQ IDDLNPQAVG YVFDALFAAG AVDVFTQGIG MKKSRPGLLL TVICYPEHLA SCEEILFRET TTLGIRRTTE QRTILQREIQ QVETPYGNVR VKVAWKGQAT EKSITNVQPE YEDCADLARK HNIPCREIQR LALHNWYCQT
|
| |