Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3314 |
Symbol | |
ID | 9341118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 3391591 |
End bp | 3392634 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | thiamine-phosphate pyrophosphorylase |
Protein accession | YP_003722112 |
Protein GI | 298491935 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTACAG CCCATCATCA AGGACAAGAA ACACAACAAG TGGTATACCG CATTCTGGAT GCTAATTTGG ATCGCGCTCG TGAAGGGTTG CGTATTATTG AAGAATGGTG TCGCTTTGGA TTAAATGATG CCTCGTTAGC TGAAGCTTGT AAGCACTTAC GTCAAGAGCT CGGTCGGTGG CATACCGCAC AAATGAGGGC AGCACGAGAT ACACTTGGTG ATCCTGGTAC GGGTTTAACT CATCCTCAAG AGGAACAACG GGCTGATATC ACATCTTTGT TACAAGCTAA TTTTTGTCGT CTCCAAGAAG CACTGAGGGT TTTGGAGGAA TATGGCAAAC TGTATAACCC AAATATGGGG AGTGCTTTTA AGCAGATGCG TTATCAGGTT TATACTCTAG AAAGCACTTT GATGGGTTAT CAACGTCATC AATTACTGGG GCAATCGCGT CTATATTTGG TAACATCGCC AGTAGATCAT TTTTTGGAGA CTGTGGAAGC AGCTCTCAAA GGCGGACTAA TGCTGCTACA GTTTCGTGAA AAGACATCTG ATGATCTGAC TCATCTGGAA AGAGCCAGGA AACTCCAGCA ACTATGTCAC GATTATGGTG CTTTGTTTAT CATCAATGAC AGGGTAGATT TGGCGCTGGC TGTGGGTGCA GATGGAGTGC ATTTAGGACA ACAAGATATG CCGATCGCAG TTGCTAGGCA ATTATTGGGT TCACAACGGT TAATTGGGCG GTCTACCACA AATCCCCAAG AGATGCAAGG GGCTATTGCG GAAGGTGCAG ATTATATTGG TGTGGGTCCA GTCTATGAAA CACCAACTAA AGTAGGTAAA GCCGCAGCAG GTTTGGAATA TGTCAGATAT GCGTCCAAAA ATTGTCCAGT TCCTTGGTTT GCGATTGGGG GAATTGATGC GAGTAATATT AATGATGTGA TTGATGCTGG AGGTCAAAGA GTAGCTGTAG TACGAGCGCT CATGCAAGCT GAACAACCTA CATTAGTCAC ACAGTATTTT ATTTCTCAGC TTTATCGGAA GTAG
|
Protein sequence | MVTAHHQGQE TQQVVYRILD ANLDRAREGL RIIEEWCRFG LNDASLAEAC KHLRQELGRW HTAQMRAARD TLGDPGTGLT HPQEEQRADI TSLLQANFCR LQEALRVLEE YGKLYNPNMG SAFKQMRYQV YTLESTLMGY QRHQLLGQSR LYLVTSPVDH FLETVEAALK GGLMLLQFRE KTSDDLTHLE RARKLQQLCH DYGALFIIND RVDLALAVGA DGVHLGQQDM PIAVARQLLG SQRLIGRSTT NPQEMQGAIA EGADYIGVGP VYETPTKVGK AAAGLEYVRY ASKNCPVPWF AIGGIDASNI NDVIDAGGQR VAVVRALMQA EQPTLVTQYF ISQLYRK
|
| |