Gene Aazo_1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1659 
Symbol 
ID9339451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1726010 
End bp1727683 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content43% 
IMG OID 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_003720947 
Protein GI298490770 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCACAGC CTAAGACTAG CCAAGCCTTT GAATTTGATT CTATTGATGC CGCTTTAGCC 
GACCTCAAAG CGGGTCGTGT GATTGTGGTG GTAGATGACG AAAATCGAGA AAATGAAGGC
GATTTAATTT GTCCTGCCCA ATTTGCCACC CCCGACATGA TAAATTTCAT GGCGGTAGAA
GCCAGAGGGC TAATTTGTCT AGCCATGACG GGCGAGCGCC TGGATGAACT AGACTTACCA
TTAATGGTCA GTAACATCAC AGATACTAAC CAGACTGCTT TTACTGTCAG CATTGACGCT
GGTCCTCACT TGGGCGTTAG CACTGGCATT TCCGCAGAAG ACCGCGCCCG TACTATTCAA
GTTACCCTCA ACCCTGTCAC ACAGCCGCTG GATTTACGAC GTCCTGGGCA TATTTTCCCG
ATTCGGGCTA AAGCTGGAGG TGTACTCAAA CGGGCAGGAC ACACAGAAGC AGCAGTTGAT
TTAGCAAGAC TAGCAGGATT ATACCCCGCT GGGGTAATTT GTGAAATTCA AAATCCTGAT
GGTTCAATGG CAAGATTACA ACAGTTAATC GAATATGCTA AAACGCATAA ACTTAAAATT
ATTAGCATTG CCGATCTTAT CAGCTATCGT CTCCAAAACG ATCGCCTGGT GTATCGAGAA
ATCGTTACCA AGCTACCCAG TCAATTTGGA CAATTTGATA TTTACGGATA CCGCCACACT
TTAGATAATA CTGACCATGT AGCGATCATC AAGGGCGACC CAGCAAACTT TCAAGATGAG
CCTGTGATGG TGCGGATGCA CTCCGAATGT TTGACGGGTG ATGCTTTAGG TTCTTTGCGC
TGTGACTGTA GAATGCAGTT AAATTCCGCA TTGAAAATGA TTGAAAATGC TGGTCAAGGT
GTGGTTGTTT ACCTGCGTCA AGAAGGACGG GGTATAGGCT TGATTAATAA ACTCAAAGCC
TACTCGTTGC AGGATATGGG ACTTGATACC GTGGAAGCTA ATGAGCGTTT AGGTTTTCCT
GCTGATTTGC GCGATTATGG AATGGGGGCG CAAATACTCA TGGATTTGGG TATTAAAAAG
ATCCGTCTGA TTACCAATAA TCCTCGTAAA ATTGCTGGTG TCAAGGGCTA TGGTTTGGAA
GTTGTAGATC GAGTGCCTTT GTTAATTGAA GCCAATGATT ATAACTGTTC TTACCTAGCT
ACAAAAGCCA AAAAGTTGGG ACATATGTTG TTACAAACTT ATCTGGTGAC AGTAGCATTA
CACTGGGAAG ATGAACCAGA TTCAGTGACG CAACGCTATG AACGTTTAGA GAAACTTCGG
CATTTAGCAA AAACGAATCA TTTACTTTTA CAAGAAGAAG CACGTCCGTT ACGAGTTGCC
TTATTTGATA AGCCATCGTT AACTGTTCAT TTGGGATTTG ACCAGCCAAA AGTTGCTGAC
GCTAATTGGT ATGAACAGAA GGGTCATCCT TACTTGCAAG CAATTTGTCA GATTTTGGAT
CAATTACTAA ATTTACCCTA TGTCCAGAAG TTGGATTTCT TAATTTCTTC TGGTAGTGAT
CCTCTGACTA ATTTGCAAGT ACAGTTAGAT CGGCAGGCAT TTAATTCTGA TGTACTACCT
TCTTCTTTGA GCGACCACTT GCAGACACAG CAGATTTATA GTTTTAGTAA GTAG
 
Protein sequence
MSQPKTSQAF EFDSIDAALA DLKAGRVIVV VDDENRENEG DLICPAQFAT PDMINFMAVE 
ARGLICLAMT GERLDELDLP LMVSNITDTN QTAFTVSIDA GPHLGVSTGI SAEDRARTIQ
VTLNPVTQPL DLRRPGHIFP IRAKAGGVLK RAGHTEAAVD LARLAGLYPA GVICEIQNPD
GSMARLQQLI EYAKTHKLKI ISIADLISYR LQNDRLVYRE IVTKLPSQFG QFDIYGYRHT
LDNTDHVAII KGDPANFQDE PVMVRMHSEC LTGDALGSLR CDCRMQLNSA LKMIENAGQG
VVVYLRQEGR GIGLINKLKA YSLQDMGLDT VEANERLGFP ADLRDYGMGA QILMDLGIKK
IRLITNNPRK IAGVKGYGLE VVDRVPLLIE ANDYNCSYLA TKAKKLGHML LQTYLVTVAL
HWEDEPDSVT QRYERLEKLR HLAKTNHLLL QEEARPLRVA LFDKPSLTVH LGFDQPKVAD
ANWYEQKGHP YLQAICQILD QLLNLPYVQK LDFLISSGSD PLTNLQVQLD RQAFNSDVLP
SSLSDHLQTQ QIYSFSK