Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4841 |
Symbol | |
ID | 9342648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4954064 |
End bp | 4955353 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | homoserine dehydrogenase |
Protein accession | YP_003723117 |
Protein GI | 298492940 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00593981 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTATAA AACTAGGAAT ATTGGGATTA GGTACTGTGG GAACGGGGAC TGTACAACTG TTGCAAGACA GCGAAGGTCG TCACCCTCTA TTAAAAGAAA TAGAAATATA TCGGGTGGGA ATGCGATCAC TCTCCAAACC CCGTTTGGTA GAATTGCCAC CAGAGGTAGT AACTACAGAT TTAGAAGCGA TTGTCAATGA CCCAGAAATA GATATAATTG TTGAAATTAT GGGTGGATTG GAACCAGCGA GATCGCTCAT CCTCACCGCT ATTAAAAATG GTAAGCACGT AGTTACTGCT AATAAAGCCG CTATCTCTCG ATTTGGTGCA GAAATATTTA CCGCCGCCAA TCAAGCCGGG GTATATGTGA TGATAGAAGC ATCCGTAGGA GGTGGCATTC CCGTAATTCA ACCCCTGAAA CAGTCTTTGA GTGTCAACCG CATTCATGCT GTCACAGGCA TTGTTAACGG TACAACTAAC TATATCCTGA CACGGATGCA AACAGAAGGC AGTGACTTCG ATGATGTCCT AGCTGATGCT CAACGATTAG GTTATGCCGA AGCTGACCCC ACCGCTGATG TTGATGGCTT AGATGCAGCC GATAAAATAG CTATCTTGGC CTCATTAGGC TTTGATGGCC GCATCAACTT ACAAGATGTG TATTGTGAGG GAATTCGTCA AGTCAGTAAG ACAGATATTA GCTACGCCAC CAAGTTAGGA TTTGTGATTA AATTATTAGC AATTGCCAAA GGGCAAACTA GCGATAACTC TCAACTATCA GTTAGAGTTC ATCCCACCCT AGTACCTCAA ACACATCCTT TAGCCAGTAT TAACGGTGTT TATAATGCCA TTCTTGTCGA AGGAGAACCC ATTGGCCAAG TGATGTTTTT TGGGCCTGGT GCTGGTGCTG GTGCTACTGC TAGTGCCGTT TGCTCGGATA TCTTAAATCT GTTAGCAACT CTGAAAATTA ATACCCCCAA ATCTAATCCT CTCTTAGCCT GTAGACATCA AAATTACCTA CAAATTGCAC CAATTTCTGA ACTTGTAACC CGATTTTATG CGCGGTTTTT GACAAAAGAC CAACCAAGTG TAATTGGTAA ATTAGGTACT TGCTTTGGCA AATATGGCGT GAGCCTGGAG TCCCTAGTGC AAACTGGTTT TCAGGCAGAA CTAGCCGAGA TTGTTGTTGT CACCCATGAT GTTCGGGAAG GAGATTTTAG ACAAGCTTTA GCCGAAATTC AAACCCTAGA AGCAATAGAT AGCATTCCCA GCATCTTAAG AGTACTTTGA
|
Protein sequence | MGIKLGILGL GTVGTGTVQL LQDSEGRHPL LKEIEIYRVG MRSLSKPRLV ELPPEVVTTD LEAIVNDPEI DIIVEIMGGL EPARSLILTA IKNGKHVVTA NKAAISRFGA EIFTAANQAG VYVMIEASVG GGIPVIQPLK QSLSVNRIHA VTGIVNGTTN YILTRMQTEG SDFDDVLADA QRLGYAEADP TADVDGLDAA DKIAILASLG FDGRINLQDV YCEGIRQVSK TDISYATKLG FVIKLLAIAK GQTSDNSQLS VRVHPTLVPQ THPLASINGV YNAILVEGEP IGQVMFFGPG AGAGATASAV CSDILNLLAT LKINTPKSNP LLACRHQNYL QIAPISELVT RFYARFLTKD QPSVIGKLGT CFGKYGVSLE SLVQTGFQAE LAEIVVVTHD VREGDFRQAL AEIQTLEAID SIPSILRVL
|
| |