Gene Aazo_4898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4898 
Symbol 
ID9342705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5010588 
End bp5011871 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content43% 
IMG OID 
Productglycine hydroxymethyltransferase 
Protein accessionYP_003723157 
Protein GI298492980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAATA ACAACTCAGA CCTACTTAAA TCCGCCGATT CTGCTGTTAG CGAGTTAATT 
AACCAAGAAC TACAGCGTCA ACGTGACCAC TTAGAGTTAA TTGCCAGTGA GAACTTCACC
TCGCCTTCTG TATTAGCGGC TCAAGGCTCT ATATTAACCA ATAAGTATGC AGAAGGCTTA
CCAGGTAAAC GCTATTATGG CGGTTGTGAA TTCGTTGACA AAATTGAGCA AATAGCCATT
GACAGAGCTA AACAGTTATT TGGTGCTGCT CATGCTAACG TTCAACCCCA TTCTGGCGCA
CAAGCTAACT TTGCTGTGTT TCTAACTTTG TTAGAACCAG GGGACACAAT TATGGGCATG
GACTTGTCAC ACGGTGGACA CCTGACCCAT GGTTCACCAG TTAATGTTTC CGGTAAGTGG
TTTAAAGTAC GTCACTATGG CGTGAGTAGG GAAACAGAAC AACTGGACTA TGACCAAATC
CGTGATTTAG CCCTGAAAGA ACGTCCTAAG CTGTTAATTT GTGGTTATTC AGCTTATCCC
CGGATTATAA ACTTTGAAAA ATTCCGCAGC ATTGCTAATG AAATAGGTGC TTACCTGCTT
GCCGATATTG CTCATGTTGC TGGCTTAGTG GCTACAGGAC ATCATCCCAA CCCGCTTCCT
TATTGTGATG TAGTAACAAC AACAACTCAC AAAACTTTAC GGGGTCCCAG AGGAGGTTTG
ATTTTAACCC CTGATCCAGA ACTGGGTAAA AAGCTGGATA AATCAGTTTT CCCTGGCACT
CAAGGAGGGC CATTAGAACA CGTTATTGCT GGTAAAGCAG TAGCTTTTGG TGAAGCTCTC
AAGCCAGAGT TTACAACCTA TTCTGGTGAA GTAATTGAAA ATGCTCGCGC TTTGGCTACC
CAACTACAAA ATAGGGGGTT AAAGCTAGTA TCAGATGGGA CTGATAATCA TTTAATATTA
GTAGATTTAC GTTCTATCGA CATGACCGGT AAGAAAGCTG ATCAGCTACT TAGCGGTGTG
AATATTACTG CTAATAAAAA TACTGTACCT TTTGAGCCAG AATCACCATT TATTACCAGT
GGTCTCAGAC TAGGTTCACC GGCAATGACA ACAAGGGGTT TAGGTGCAAC AGAATTTAGG
GAAATTGGTG ATATTATTAG CGATCGCTTA CTTGACCCAG GTTCAGATAA AGTAGCCAAG
GATTGTAAGC AACGAGTAGC ATCATTGTGC AATCGCTTCC CCTTGTATCC TCATTTAGAA
ATTCCCCAGC CAGTTCTAGC GTAA
 
Protein sequence
MNNNNSDLLK SADSAVSELI NQELQRQRDH LELIASENFT SPSVLAAQGS ILTNKYAEGL 
PGKRYYGGCE FVDKIEQIAI DRAKQLFGAA HANVQPHSGA QANFAVFLTL LEPGDTIMGM
DLSHGGHLTH GSPVNVSGKW FKVRHYGVSR ETEQLDYDQI RDLALKERPK LLICGYSAYP
RIINFEKFRS IANEIGAYLL ADIAHVAGLV ATGHHPNPLP YCDVVTTTTH KTLRGPRGGL
ILTPDPELGK KLDKSVFPGT QGGPLEHVIA GKAVAFGEAL KPEFTTYSGE VIENARALAT
QLQNRGLKLV SDGTDNHLIL VDLRSIDMTG KKADQLLSGV NITANKNTVP FEPESPFITS
GLRLGSPAMT TRGLGATEFR EIGDIISDRL LDPGSDKVAK DCKQRVASLC NRFPLYPHLE
IPQPVLA