Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1372 |
Symbol | |
ID | 9339167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 1442218 |
End bp | 1443351 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | |
Product | homocitrate synthase |
Protein accession | YP_003720748 |
Protein GI | 298490571 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAA TCATTATTAA TGATACCACA TTGCGTGATG GAGAACAAGC AGCAGGTGTT GCTTTTAACT TAGAAGAAAA GGTAGCAATC GCACAATTTC TTGATGCCAT TGGTGTTCCC GAATTAGAAG TCGGTATACC CGCAATGGGA GAAGAAGAAA TTCGTGCAAT TCAAGCAATT GCTGACTTAG ATTTAGATGC TAAATTACTA GGTTGGAACC GCGCTGTTAT CTCAGATATT AAAGCTTCTA TTGCCTGTGG ATTAGAAAGA ATACATATTG CTATTCCTGT TTCTGGTATA CAAATTGCCG CTAAATTTCA TGGTCAATGG AGAGTAAGTT TACAAAGACT CAAAGACTGT ATTAGCTTCG CAGTTGATAA TGGTCTTTGG GTTGCAGTAG GAGGAGAGGA CTCTTCTAGA TCTGATGAAA ACTTCCTTGA AGAGGTTGCA TTAAACGCTC AAGAATGGGG TGCGTCAAGA TTTCGTTTTT GCGATACAGT TGGAGTTCTT GATCCCTTTG GAACTCACCT CAAAGTTAAG CGATTAGTAT CTACTTTATC AATTCCTGTA GAAATTCACA CCCATAATGA TTTTGGACTA GCAACTGCTA ACGCTATTGC CGGTATTAAA GCCGGAGCAA CTTCTGTTAA TACCACCGTT AATGGTTTAG GTGAAAGAGC AGGAAATGCA GCTTTAGAAG AAGTTGTCAT GGCAATAAAA TGTATCTACG GTGTTGATTT AGGAATTGAC ACTCGACATT TATTAGGACT ATCTCAATTA GTTGCTGCTG CATCCGGTTC AAAGGTCCCA CCTTGGAAAG CAATTGTTGG TGAAAATACC TTTGCTCACG AGTCTGGAAT TCATGCTCAT GGTGTACTCA AAAACCCAGA AACCTATGAA CCATTTTCAC CAGAAGAAGT AGGTTGGGAA CGTCGTTTAG TAATCGGTAA ACATTCTGGA AGGCATTTAT TATCTAACTT GTTAGAACAG TATGGGATCT TTTTAAACTC AGAAGAAACC CAGTCTGTAT TAGATGCAGT GCGCTTACAA TCAACACTGA GAAAACGCAG TCTCACCACA GAAGAGTTAT TGAATTTAGT AGGTGAACAG AGGTATTCCC ATGCAACGCG ATGA
|
Protein sequence | MSQIIINDTT LRDGEQAAGV AFNLEEKVAI AQFLDAIGVP ELEVGIPAMG EEEIRAIQAI ADLDLDAKLL GWNRAVISDI KASIACGLER IHIAIPVSGI QIAAKFHGQW RVSLQRLKDC ISFAVDNGLW VAVGGEDSSR SDENFLEEVA LNAQEWGASR FRFCDTVGVL DPFGTHLKVK RLVSTLSIPV EIHTHNDFGL ATANAIAGIK AGATSVNTTV NGLGERAGNA ALEEVVMAIK CIYGVDLGID TRHLLGLSQL VAAASGSKVP PWKAIVGENT FAHESGIHAH GVLKNPETYE PFSPEEVGWE RRLVIGKHSG RHLLSNLLEQ YGIFLNSEET QSVLDAVRLQ STLRKRSLTT EELLNLVGEQ RYSHATR
|
| |