Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4898 |
Symbol | |
ID | 9342705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 5010588 |
End bp | 5011871 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | glycine hydroxymethyltransferase |
Protein accession | YP_003723157 |
Protein GI | 298492980 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAATA ACAACTCAGA CCTACTTAAA TCCGCCGATT CTGCTGTTAG CGAGTTAATT AACCAAGAAC TACAGCGTCA ACGTGACCAC TTAGAGTTAA TTGCCAGTGA GAACTTCACC TCGCCTTCTG TATTAGCGGC TCAAGGCTCT ATATTAACCA ATAAGTATGC AGAAGGCTTA CCAGGTAAAC GCTATTATGG CGGTTGTGAA TTCGTTGACA AAATTGAGCA AATAGCCATT GACAGAGCTA AACAGTTATT TGGTGCTGCT CATGCTAACG TTCAACCCCA TTCTGGCGCA CAAGCTAACT TTGCTGTGTT TCTAACTTTG TTAGAACCAG GGGACACAAT TATGGGCATG GACTTGTCAC ACGGTGGACA CCTGACCCAT GGTTCACCAG TTAATGTTTC CGGTAAGTGG TTTAAAGTAC GTCACTATGG CGTGAGTAGG GAAACAGAAC AACTGGACTA TGACCAAATC CGTGATTTAG CCCTGAAAGA ACGTCCTAAG CTGTTAATTT GTGGTTATTC AGCTTATCCC CGGATTATAA ACTTTGAAAA ATTCCGCAGC ATTGCTAATG AAATAGGTGC TTACCTGCTT GCCGATATTG CTCATGTTGC TGGCTTAGTG GCTACAGGAC ATCATCCCAA CCCGCTTCCT TATTGTGATG TAGTAACAAC AACAACTCAC AAAACTTTAC GGGGTCCCAG AGGAGGTTTG ATTTTAACCC CTGATCCAGA ACTGGGTAAA AAGCTGGATA AATCAGTTTT CCCTGGCACT CAAGGAGGGC CATTAGAACA CGTTATTGCT GGTAAAGCAG TAGCTTTTGG TGAAGCTCTC AAGCCAGAGT TTACAACCTA TTCTGGTGAA GTAATTGAAA ATGCTCGCGC TTTGGCTACC CAACTACAAA ATAGGGGGTT AAAGCTAGTA TCAGATGGGA CTGATAATCA TTTAATATTA GTAGATTTAC GTTCTATCGA CATGACCGGT AAGAAAGCTG ATCAGCTACT TAGCGGTGTG AATATTACTG CTAATAAAAA TACTGTACCT TTTGAGCCAG AATCACCATT TATTACCAGT GGTCTCAGAC TAGGTTCACC GGCAATGACA ACAAGGGGTT TAGGTGCAAC AGAATTTAGG GAAATTGGTG ATATTATTAG CGATCGCTTA CTTGACCCAG GTTCAGATAA AGTAGCCAAG GATTGTAAGC AACGAGTAGC ATCATTGTGC AATCGCTTCC CCTTGTATCC TCATTTAGAA ATTCCCCAGC CAGTTCTAGC GTAA
|
Protein sequence | MNNNNSDLLK SADSAVSELI NQELQRQRDH LELIASENFT SPSVLAAQGS ILTNKYAEGL PGKRYYGGCE FVDKIEQIAI DRAKQLFGAA HANVQPHSGA QANFAVFLTL LEPGDTIMGM DLSHGGHLTH GSPVNVSGKW FKVRHYGVSR ETEQLDYDQI RDLALKERPK LLICGYSAYP RIINFEKFRS IANEIGAYLL ADIAHVAGLV ATGHHPNPLP YCDVVTTTTH KTLRGPRGGL ILTPDPELGK KLDKSVFPGT QGGPLEHVIA GKAVAFGEAL KPEFTTYSGE VIENARALAT QLQNRGLKLV SDGTDNHLIL VDLRSIDMTG KKADQLLSGV NITANKNTVP FEPESPFITS GLRLGSPAMT TRGLGATEFR EIGDIISDRL LDPGSDKVAK DCKQRVASLC NRFPLYPHLE IPQPVLA
|
| |