Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_5028 |
Symbol | |
ID | 9342836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 5146633 |
End bp | 5148036 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | |
Product | 3-isopropylmalate dehydratase large subunit |
Protein accession | YP_003723260 |
Protein GI | 298493083 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.44739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGG GAACTCTGTT TGACAAAGTT TGGGACTTCC ACACCGTTGG GACACTTCCA TCAGGACTAA CGCAACTATT TATTGGACTT CATCTCATCC ATGAAGTTAC CAGTCCCCAA GCCTTTGCTA TGCTCAAAGA AAGGGGTTTA AAAGTTTTAT TTCCACAACG CACAGTAGCG ACAGTTGATC ATATCGTTCC TACAGAGAAT CAAGCCCGTC CCTTTGTGGA CAGTATGGCC GAAGAAATGA TCCAGGCTTT AGAAAAGAGT TCTCAAGAAA ATGACATAAC TTTTTACAAT ATTGGTTCAG GAAATCAAGG TATAGTTCAC GTCATTGCCC CGGAACTGGG ACTAACTCAA CCGGGAATGA CCATAGCTTG TGGAGATAGC CATACATCGA GTCATGGTGC CTTTGGTGCG ATCGCATTTG GTATTGGTAC AAGCCAAGTT CGCGATGTTC TAGCCTCCCA AACCTTAGCA TTATCTAAAC TCAAAGTCCG CAAAATCGAA GTTAACGGCA ACTTAAAACC TGGAGTTTAC GCCAAAGATG TAATTTTACA CATCATTCGC ACATTAGGCG TAAAAGGTGG TGTAGGCTAC GCTTACGAAT TTGCAGGAAC AACCCTTGCA AAAATGAACA TGGAAGAACG GATGACCGTT TGCAACATGG CCATAGAAGG TGGTGCAAGA TGCGGTTACG TCAACCCCGA TCATATTACC TACGACTATT TAAAAAATAG AGACTTCGCC CCTAAAGATG CCAATTGGGA ACAAGCCGTT ACTTGGTGGG AATCCCTACG GAGTGATGCC GATGCTGAAT ATGATGATGT AGTACTATTT AATGGCGAAT ACATTCCCCC CACAATCACA TGGGGAATTA CACCAGGTCA AGGAATTGGC GTAGATCAAA AAGTTCCCAC AGCCGAAGAA CTCTTAGAAG AAGACCGCTT TGTAGCCCAA GAAGCATATC GCTACATGGA CTTATACCCC GGTCAACCCA TCCAAGGAAC AAAAATTGAC GTTTGCTTCA TAGGTAGCTG CACCAACGGA CGGATTAGCG ACTTACGAGA AGCTGCTAAA ATTGCCCAAG GTCGCAAAGT AGCAGAGCAT GTGAAAGCTT TCGTTGTTCC CGGTTCAGAG AGAGTCAAAA AAGAAGCCGA AGCCGAAGGA CTAGATAAAA TATTTCTCGC AGCCGGTTTT GAATGGAGAG AACCAGGATG TTCCATGTGT TTAGCCATGA ACCCCGACAA ACTCCAAGGT AGACAAATTA GCGCCTCCTC CTCCAACCGC AACTTTAAAG GAAGACAAGG TTCTGCTTCC GGTCGTACCC TACTCATGAG TCCCGCAATG GTAGCTACAG CCGCTATTAA GGGGGAGGTG TCCGACGTGC GCGAATTGCT TTAA
|
Protein sequence | MSKGTLFDKV WDFHTVGTLP SGLTQLFIGL HLIHEVTSPQ AFAMLKERGL KVLFPQRTVA TVDHIVPTEN QARPFVDSMA EEMIQALEKS SQENDITFYN IGSGNQGIVH VIAPELGLTQ PGMTIACGDS HTSSHGAFGA IAFGIGTSQV RDVLASQTLA LSKLKVRKIE VNGNLKPGVY AKDVILHIIR TLGVKGGVGY AYEFAGTTLA KMNMEERMTV CNMAIEGGAR CGYVNPDHIT YDYLKNRDFA PKDANWEQAV TWWESLRSDA DAEYDDVVLF NGEYIPPTIT WGITPGQGIG VDQKVPTAEE LLEEDRFVAQ EAYRYMDLYP GQPIQGTKID VCFIGSCTNG RISDLREAAK IAQGRKVAEH VKAFVVPGSE RVKKEAEAEG LDKIFLAAGF EWREPGCSMC LAMNPDKLQG RQISASSSNR NFKGRQGSAS GRTLLMSPAM VATAAIKGEV SDVRELL
|
| |