Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0368 |
Symbol | |
ID | 9338153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 369770 |
End bp | 371461 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_003720062 |
Protein GI | 298489885 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAAA ATATTAAGAG TAAATTTATC ACACAAGGGG TACAGCGATC GCCTAACCGG GCTATGTTGC GGGCTGTTGG TTTTCAAGAT GCAGACTTCA ACAAAGCCAT AGTCGGTATT GCTAATGGTT ACAGTACCAT CACCCCCTGT AATATGGGGA TTAACCAACT AGCACAAAGG GCAGAAATGA GCCTCAGAAA TGCAGGTGCA ATGCCCCAAG TTTTCGGCAC AATTACCATC AGTGATGGGA TTTCAATGGG AACAGAGGGG ATGAAGTTCT CTCTGGTGTC ACGGGAAGTG ATCGCAGATT CCATTGAAAC TGCATGTACT GGCCAAAGTA TGGATGGTGT TCTGGCTATT GGTGGTTGTG ACAAAAATAT GCCAGGGGCA ATGTTAGCAA TCGCTCGCAT GAATATCCCT GCTATCTTTG TTTATGGTGG CACAATCAAA CCCGGACACT ACGATGGCAA AGATTTAACC GTTGTTAGTT CCTTTGAAGC CGTCGGCCAA CACAGCGCCG GCAAAATTGA CGAAACCGAA CTTTTAGAAA TTGAACGCCG TGCTTGTCCT GGTGCTGGGT CCTGTGGTGG AATGTTCACA GCTAATACCA TGTCTTCAGC CTTTGAAGCA ATGGGAATGA GTTTACCTTA TTCCTCAACC ATGGCCGCAG AAGATGCCGA AAAAGCGGAT AGCACAGAGA AATCCGCCTT TATCTTAGTC GAAGCCATCC GTAAGCAAAT ATTACCCCGG CAACTTATCA CCCGGAAATC TATCGAAAAT GCCATATCTG TAATTATGGC CGTTGGTGGT TCTACCAATG CAGTTTTACA TTTTTTAGCG ATCGCCCGCG CAGCTGGTGT AGAACTAACT TTAGACGACT TTGAAACCAT CCGGGCCCGT GTTCCAGTTT TGTGCGATTT AAAACCCAGT GGTAGATACG TAGCCACAGA CTTGCATAAA GCTGGTGGTA TTCCTCAAGT CATGAAAATC TTATTAGTTC GTGATTTACT GCATGGTGAC TGTCTAACTA TCTCTGGTCA AACTGTAGCC GAAGTCTTAG CAGACATACC AGCAGAACCA TCACCCAAGC AAAATGTAAT TCGTCCTTGG GATCGTCCCA TCTATGCACA AGGACATTTA GCTATTCTCA AAGGTAACTT AGGTACTGAA GGTGCAGTTG CTAAAATTAC TGGTGTGAAA AAACCCATCA TCACCGGGCC AGCGAGAGTA TTTGAATCAG AAGAATCTTG CTTAGATGCA ATTTTAGCAG GTAAGATTAA AGCAGGTGAT GTGATCATCA TCCGTTACGA AGGTCCAAAA GGTGGGCCTG GTATGCGGGA AATGTTGGCT CCCACCTCAG CAATTATTGG TGCGGGATTA GGTGATTCTG TGGGCTTAAT TACGGATGGA CGCTTTTCTG GCGGTACTTA TGGGATGGTA GTTGGTCACG TCGCTCCAGA AGCTGCAGTC GGCGGTAATA TTGCCTTGGT AGAAGAAGGT GATAGTATCA CCATTGATGC TAATTCTCGA TTATTACAAG TGAATATATC GGATGCAGAA TTAGCTAGTC GTCGTGCTAA CTGGCAACCG CGTCCACCAC GTTATACAAA AGGGGTGCTG GCGAAATATG CCAAGTTGGT ATCTTCTAGT AGTGTTGGTG CTGTTACAGA CTTAGATTTG TTTGGTAATT AA
|
Protein sequence | MSENIKSKFI TQGVQRSPNR AMLRAVGFQD ADFNKAIVGI ANGYSTITPC NMGINQLAQR AEMSLRNAGA MPQVFGTITI SDGISMGTEG MKFSLVSREV IADSIETACT GQSMDGVLAI GGCDKNMPGA MLAIARMNIP AIFVYGGTIK PGHYDGKDLT VVSSFEAVGQ HSAGKIDETE LLEIERRACP GAGSCGGMFT ANTMSSAFEA MGMSLPYSST MAAEDAEKAD STEKSAFILV EAIRKQILPR QLITRKSIEN AISVIMAVGG STNAVLHFLA IARAAGVELT LDDFETIRAR VPVLCDLKPS GRYVATDLHK AGGIPQVMKI LLVRDLLHGD CLTISGQTVA EVLADIPAEP SPKQNVIRPW DRPIYAQGHL AILKGNLGTE GAVAKITGVK KPIITGPARV FESEESCLDA ILAGKIKAGD VIIIRYEGPK GGPGMREMLA PTSAIIGAGL GDSVGLITDG RFSGGTYGMV VGHVAPEAAV GGNIALVEEG DSITIDANSR LLQVNISDAE LASRRANWQP RPPRYTKGVL AKYAKLVSSS SVGAVTDLDL FGN
|
| |