Gene Aazo_1090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1090 
Symbol 
ID9338886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1170741 
End bp1171886 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content42% 
IMG OID 
Producttransaldolase 
Protein accessionYP_003720565 
Protein GI298490388 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACCA ATCATTTATT AGAGATAAAA CAATACGGTC AAAGTATCTG GATGGATAAT 
TTGAGCCGTG AGATTATTGA ATCAGGGGAA CTCGAAAACT TGGTAAAAAA TCAAGGAATC
TCTGGTATTA CCTCCAATCC TGCCATCTTT GAAAAAGCAA TTACTGGTAA TGCCATTTAT
GATGCTGATA TTGAAGCCGG AATTCGTGAG GTTTTACCAA CATACAAAAT CTATGAATCA
CTAATTTTCG CAGATATCCG CAATGCTTGT GATATTTTGC GCCCTGTTTA TGAAGCCACG
AATGGTCTTG ATGGTTATGT GAGTATAGAA GTTCCACCAA CTATAGCTCA TGATACCCAA
GCAACAATAG CTGAAGCTCG TCGCTATTTC CAAGAGATTG GGCGAGAAAA TGTGATGATT
AAAATTCCTG GGACTGAGGC GGGTTTACCG GCAATAGAAC AAGCAATATT CGAAGGAATT
AACATCAACG TGACGCTGTT ATTTGCTGTC CAAAGTTACA TTAATACAGC TTGGGCGTAT
ATTCGTGGGT TGGAAAAGAG AGTGACTCAA GGTAGGGATA TCAGCAAAAT TGCTTCTGTA
GCCAGCTTTT TTCTCAGCCG GATTGATAGC AATATCGACG GTAAGATAGA TGCTAAATTG
CAGCGAGGCG TTGATGACAT TAATCATGAA GCGGTGCTGC GAGGGGTAAG AGGGAAAGTT
GCGATCGCCA ACGCCAAGAT AGCTTACCAA GAATACAAAA AAATCACCAG CACCGATGCC
TGGCAAGCCC TATCAACAAA AGGTGCAAAA GTCCAGCGGT TACTGTGGGC CAGCACCAGC
ACCAAAGACC CCAGTTACAG TGATGTCATG TACGTCGATC AACTAATTGG CAAAGACACA
GTGAACACCT TACCACCAGC TACTATAAAG GCTTGTGCTG ATCATTGTAA TGTAAGCGAT
TACCTGGAGA CAGGCACTTT AGAAGCTTAC ACCCTCATAG AAAGCTTGAA AGAACCGGAC
ATCAACATTG ATATTAATAC GGTAATGGAC GAACTACTCG CCGAAGGTAT TGATAAGTTT
GTCCAGCCCT TCCAGTCACT CATGAACTCT TTAGAAGGCA AAGTCAAGCT ATTGTCACCA
GTATAG
 
Protein sequence
MATNHLLEIK QYGQSIWMDN LSREIIESGE LENLVKNQGI SGITSNPAIF EKAITGNAIY 
DADIEAGIRE VLPTYKIYES LIFADIRNAC DILRPVYEAT NGLDGYVSIE VPPTIAHDTQ
ATIAEARRYF QEIGRENVMI KIPGTEAGLP AIEQAIFEGI NINVTLLFAV QSYINTAWAY
IRGLEKRVTQ GRDISKIASV ASFFLSRIDS NIDGKIDAKL QRGVDDINHE AVLRGVRGKV
AIANAKIAYQ EYKKITSTDA WQALSTKGAK VQRLLWASTS TKDPSYSDVM YVDQLIGKDT
VNTLPPATIK ACADHCNVSD YLETGTLEAY TLIESLKEPD INIDINTVMD ELLAEGIDKF
VQPFQSLMNS LEGKVKLLSP V