Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4800 |
Symbol | |
ID | 9342607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4906566 |
End bp | 4908224 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | 2-isopropylmalate synthase/homocitrate synthase family protein |
Protein accession | YP_003723093 |
Protein GI | 298492916 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACAA CTCCCTCAAA TCAACTTTGG CTCTATGACA CTACTCTCCG GGATGGTACG CAACGGGAAG GACTATCAGT GTCTATAGAA GATAAGTTAC GCATTGCTCA CAGACTCGAT GAATTAGGCA TACCCTTTAT TGAAGGTGGT TGGCCAGGAG CTAATCCTAA AGATGTTCAA TTTTTCTGGC AACTCCAAGA AAATCCCCTC AAACAAGCAG AAATAGTTAC CTTTTGCTCA ACTCGTCGTC CTCACTCTAC AGCAGCAGAA GAACCAATGC TGCAAGCCAT ACTGTCTGCG GGTACTCGCT GGGTGACAAT TTTTGGTAAG TCTTGGGATT TCCACGTCAT TGAAGGACTC AAGACCAGTT TAGAAGAAAA CTTAGCCATG ATCAGCGATA CAATTGCATA TCTCCGTTCT CAAGGACGGC GTGTGATTTA TGATGCTGAA CATTGGTTTG ATGGCTACAA ACAAAATCCT GACTATGCTC TCCAGACAAT AAAGGCAGCC GCAACAGTAG GGGCCGAGTG GTTAGTACTA TGTGATACTA ATGGTGGTAC TTTACCTCAT GAAATTACGC GGATCGTTAA AAATGTTGTG TTGGCAACTG GGGACTGGGA ACTGGGGACT GGGAATACAG AAAAAACTCT GACCCAATCT CCACTTCCCC AAATCGGAAT TCACACCCAT AATGATTCAG AGATGGCGGT TGCTAATGCC CTAGCAGCAG TCATGGCAGG AGCGAAGATG GTGCAGGGTA CAATCAATGG TTATGGGGAA CGCTGTGGTA ATGCTAACCT GTGTTCTGTG ATTCCCAATT TACAGTTAAA GCTTGATTAT AGCTGTATCG GTGAACACCA GCTAAATCAA CTTACAGAAA CTAGTCGGTT TGTCAGCGAA GTTGTCAATC TTGCGCCTGA TGAACACGCG CCTTATGTGG GACGTTCTGC TTTTGCTCAC AAAGGTGGTA TTCATGTATC TGCGGTAGAA CGTAATCCTT TTACTTATGA ACATATTCAG CCGGAACAAG TGGGGAATCG TCGCCGCATT GTTATTTCTG AACAGTCTGG TTTAAGTAAT GTCATAGCTA AAGCTCGGTC TTTGGGAATG GAATTAGATA AAAATGATCC CCAAGCCCGG CAAATTCTTC AACGTATGAA AGAATTGGAG AGTGAAGGAT ATCAATTTGA AGCTGCAGAA GCCAGTTTTG CTCTATTAAT GTATGAAGCT TTGGGACTGC GAGAACAGTT TTTTGAAGTT AAAGGTTTTC AAGTCCACTG TGATTTAGTA GAGGTGAAAG AAACTACTAA TTCTTTAGCA ACGGTGAAAG TAGGTGTAAA TGGTATAAAT ATTCTTGAAG CAGCAGAAGG TAATGGACCA GTGGCGGCTT TAGATGATGC TTTACGTAAA GCTTTGGTGA ACTTTTATCC GCAAGTTGCC GACTTTGAGT TGACAGATTA TAAAGTCAGA ATTCTCAACG GAAATACGGG TACAGCAGCA AAAACCCGTG CTTTGGTAGA ATCGGGAAAT GGTCAACAAC GTTGGACAAC TGTAGGAGTT TCTACAAATA TTTTGGAGGC TTCCTATCAA GCTGTAGTTG AAGGTTTGGA ATATGGTTTG TTGTTGAATT TCCAAGCTGA AAAGGCTCTG AAAGTTTAG
|
Protein sequence | MTTTPSNQLW LYDTTLRDGT QREGLSVSIE DKLRIAHRLD ELGIPFIEGG WPGANPKDVQ FFWQLQENPL KQAEIVTFCS TRRPHSTAAE EPMLQAILSA GTRWVTIFGK SWDFHVIEGL KTSLEENLAM ISDTIAYLRS QGRRVIYDAE HWFDGYKQNP DYALQTIKAA ATVGAEWLVL CDTNGGTLPH EITRIVKNVV LATGDWELGT GNTEKTLTQS PLPQIGIHTH NDSEMAVANA LAAVMAGAKM VQGTINGYGE RCGNANLCSV IPNLQLKLDY SCIGEHQLNQ LTETSRFVSE VVNLAPDEHA PYVGRSAFAH KGGIHVSAVE RNPFTYEHIQ PEQVGNRRRI VISEQSGLSN VIAKARSLGM ELDKNDPQAR QILQRMKELE SEGYQFEAAE ASFALLMYEA LGLREQFFEV KGFQVHCDLV EVKETTNSLA TVKVGVNGIN ILEAAEGNGP VAALDDALRK ALVNFYPQVA DFELTDYKVR ILNGNTGTAA KTRALVESGN GQQRWTTVGV STNILEASYQ AVVEGLEYGL LLNFQAEKAL KV
|
| |