Gene Aazo_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2074 
Symbol 
ID9339868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2158041 
End bp2159135 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content43% 
IMG OID 
Productdihydroxyacetone kinase subunit DhaK 
Protein accessionYP_003721245 
Protein GI298491068 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.71364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC TGATTAATCA ACCAGAAGAC TTTGTAAAGG AAAGTCTAGC AGGAATGGCT 
GTGGCTCATG CTGATTTAAT TCAGGTAAAT TATGAGCCTA GTTTTGTGTA TCGAACTGAT
GCACCTGTAC AGGGAAAGGT AGCAATCATT TCTGGTGGTG GAAGTGGTCA TGAACCTATG
CATGTGGGTT TTGTGGGGAT GGGAATGCTT GATGCTGCTT GTCCTGGGGA AGTTTTTACT
TCACCGACTC CTGACCAAAT GTTAGCCGCA GCACAGCAGG TCGATGGTGG TGCTGGTATT
CTTTATATCG TTAAAAATTA TAGTGGCGAT TTGATGAATT TTGAAATGGC GACGGAGTTA
GCCAGAAGTG AAGGTATCCG CACGTTAAAT ATTATTATTG ATGATGATGT AGCGGTGAAA
GATAGTTTAT ATACGCAAGG AAGAAGAGGT GTAGGAACAA CTGTGCTGGC GGAAAAAATT
TGTGGAGCCG CTGCGGAACA GGGTTATAAT TTGCAGCAGT TAGCAAATTT GTGTAGAAAG
GTAAATCTGC ATGGACGCAG TCTAGGTGTG GCGTTGAGTT CTTGTACAGT CCCGGCAAAG
GGTACGCCGA CTTTTGCTTT GGGGGATAAT GAGATAGAAT TGGGAATTGG TATTCATGGA
GAACCAGGAA GAGAAAGGGT TTCTATGAAA TCAGGGGATG AGATTACAGA GATTTTAGTG
CGTTGGCTTT GCCCGTCGCA AGCATCGCTC ATTGATAATA TTGACTATAG TCGCACAGTG
CGAGAGTGGG ATGAAGCTCA AGAGGGATGG GTTGATGTAG AACTGTTAAA TAAACCCCTG
CAAAAAGGCG ATCAGATCTT AGCTTATGTT AACAGTATGG GAGGTACTCC CGTTTCTGAA
TTGTATCTTG TCTACCGCAA ACTAGCAGAA ATCTGTGAAC AGGAAGGACT GCAAATAGTG
CGAAATCTAA TTGGACCCTA CATGACATCA TTAGAAATGC AAGGTTGCTC CATCACACTG
CTGAAGTTAG ATGACGAGAT GCTGCGGTTA TGGGATGCAC CAGTAAAAAC AGCAAGTTTA
CGCTGGGGAG TATGA
 
Protein sequence
MKKLINQPED FVKESLAGMA VAHADLIQVN YEPSFVYRTD APVQGKVAII SGGGSGHEPM 
HVGFVGMGML DAACPGEVFT SPTPDQMLAA AQQVDGGAGI LYIVKNYSGD LMNFEMATEL
ARSEGIRTLN IIIDDDVAVK DSLYTQGRRG VGTTVLAEKI CGAAAEQGYN LQQLANLCRK
VNLHGRSLGV ALSSCTVPAK GTPTFALGDN EIELGIGIHG EPGRERVSMK SGDEITEILV
RWLCPSQASL IDNIDYSRTV REWDEAQEGW VDVELLNKPL QKGDQILAYV NSMGGTPVSE
LYLVYRKLAE ICEQEGLQIV RNLIGPYMTS LEMQGCSITL LKLDDEMLRL WDAPVKTASL
RWGV