Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3904 |
Symbol | |
ID | 9341708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 3957728 |
End bp | 3958786 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_003722531 |
Protein GI | 298492354 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00581284 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGTTG TCATGAAAAT CGGTTCTCCA CAGGCAGAAA TAGACCGTAT GAGTCAGGAA CTAATTAGCT GGGGTTTAAC ACCAGAAAAA ATCATTGGAC AACATAAAGT AGTAATTGGT TTAGTAGGTG AAACTGCTGA TTTAGATCCG CTACAAATTC AGGAACTCAG TCCATGGATT GAGCAAGTGT TACGAGTGGA AGTACCATAT AAAAGAACTA GTCGTCAGTT TCGCCATGGG GAAGCTTCTG AGGTGGTGGT GAATACTCCC AACGAGGATG TGGTGTTTGG TGAACATCAT TCCTTAGTAA TCGTTGCTGG CCCCTGTTCG GTGGAAAATG AAGAAATGAT TGTGGAGACA GCGCGGCGTG TAAAAGCTGC TGGTGCTAAA TTTTTACGAG GTGGTGCGTA TAAACCCCGG ACTTCCCCTT ACGCTTTTCA AGGTCACGGT GAAAGTGCTT TGGAATTGTT AGCTAAGGCG CGGGAAGTGA GTGGACTAGG AATAATTACG GAAGTGATGG ATGGGGGTGA TTTGGAAAAA ATCGCCGAGG TTGCGGACAT GATTCAGGTT GGGGCGAGAA ATATGCAGAA TTTCTCCCTG CTCAAACAGG TCGGGTCGCA ATCGAAACCC GTGCTGTTAA AACGCGGAAT GGCGGCAACA ATTGAAGATT GGTTAATGGC GGCTGAGTAT ATTTTGGCGG CGGGTAATCC CAATGTGATT TTGTGTGAAC GAGGAATTAG AACTTTTGAC CGTCAGTATA CGCGCAATAC TCTGGATTTA TCAGTTGTGC CAGTGTTAAG AAAGCTGACC CACCTGCCCA TTATGATTGA TCCTAGTCAT GGTGTGGGTT GGTCTGAGTT TGTACCTTCG ATGGCGATGG CAGCGATCGC AGCTGGTACA GATTCTCTCA TGATTGAGGT ACATCCCAAC CCCGCCAAAG CCTTATCTGA TGGACCCCAA TCTTTAACAC CAGACCGATT TGATAAGTTA ATGTCAGAAT TGGCAGTAAT TGGTAAAGTC ATGGAACGCT GGCCACAAGC AGCACTTGTA GCAGTCTAA
|
Protein sequence | MIVVMKIGSP QAEIDRMSQE LISWGLTPEK IIGQHKVVIG LVGETADLDP LQIQELSPWI EQVLRVEVPY KRTSRQFRHG EASEVVVNTP NEDVVFGEHH SLVIVAGPCS VENEEMIVET ARRVKAAGAK FLRGGAYKPR TSPYAFQGHG ESALELLAKA REVSGLGIIT EVMDGGDLEK IAEVADMIQV GARNMQNFSL LKQVGSQSKP VLLKRGMAAT IEDWLMAAEY ILAAGNPNVI LCERGIRTFD RQYTRNTLDL SVVPVLRKLT HLPIMIDPSH GVGWSEFVPS MAMAAIAAGT DSLMIEVHPN PAKALSDGPQ SLTPDRFDKL MSELAVIGKV MERWPQAALV AV
|
| |