Gene Aazo_0419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0419 
Symbol 
ID9338204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp428252 
End bp430144 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content44% 
IMG OID 
Producttransketolase central region 
Protein accessionYP_003720094 
Protein GI298489917 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.218492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCC AAGAGCAACT ATATCAATGG CAAGAACTAG CTCAACAGTT GCGTGTGGAT 
AGTATTCGCG CTACAACGAT CGCAGGTTCA GGTCATCCTA CTTCTTCTAT GTCTTCCGCT
GATTTGATGG CGGTTTTTCT ATCTAAATAT CTTCGCTACG ATTTTGATCA TCCAGAAAAT
CCGAATAGCG ATCGCTTTAT TCTTTCCAAA GGACACGCTG CACCTCTACT ATATTCCATG
TATAAAGCTG CGGGGGTCAT TTCTGACCAA GAATTACTAT CATTGCGTCA GTCTGGTAGC
CGTTTAGAAG GTCATCCCAC ACCAATTTTA CCTTGGGTGG ATGTGGCAAC AGGTTCTCTC
GGACAAGGTT TACCCATTGG TGTGGGGGTG GCTTTGGCAG GTAAATATTT AGACCAATTA
CCTTATAATG TTTGGGTATT ATTGGGGGAT GGTGAAACGG CTGAAGGTTC GATTTGGGAA
GCTTTTGATC ATGCTTCTCA CTACACATTA GATAATCTGC TCGCCATTAT TGATGTCAAC
CGGCTTGGTC AACGTGGTCA AACTGAATTA GGCTGGAATA CACAAGCTTA TGGCAATCGT
GCTAAGGCTT TTGGTTGGCA AGCAATAGAA ATTGATGGTC ATAATTTAAC AGAAATTGAC
CAAGCTTTTA GTGCAGCCGT GGCTATAAAT GACCGTCCCA CGGTGATTAT TGCTAGGACA
AAGAAAGGTA AAGGTGTGAA GGCTTTAGAA GATTTAGGTG GTTGGCATGG TAAAGCACTG
AAACAGGATC AAGAACAACA AGCTATTACG GAACTAGGTG GAGAACGTCA CATTACCATT
ACCGTTGATA AACCAGAAGA ACAAAGCCAA CCCGCTACAC TGGGAGTACC TCAACCCCTA
CAACTTCCCA TATATCAAAA AGGCAATAAA GTAGCCACCC GTCGCGCTTA TGGAGATGCT
TTATTAGCTT TAGGCGCATC GCAACCTGAT GTGGTTGCTC TTGATGCGGA GGTGAGTAAT
TCCACTTATG CGGAAGATTT CGCCGAAGCT TTTCCAGAAC GCTACTTTGA GATGTACATT
GCTGAACAGC AAATGATAGC AGCCGCAGTA GGCTTGCAGG TCCGAAAATA CAAACCCTTT
GCTTCTACTT TTGCAGCTTT TTTAACTCGT GGTTACGACT TTATTAGGAT GGGTGCGGTA
TCTCGTGCCA ACATTAAGTT AGTTGGTTCT CATGGGGGTG TCTCCATTGG TCAAGATGGT
GCTTCCCAAA TGGGATTAGA AGATTTAGCA GCTTTTCGCG CTGTGTGTAA TAGCACTGTA
TTGTATCCCA GTGATGCTAA TCAGACTGCT AAACTAGTAC CACAGATGAG TAATGCCCCT
GGTATAGTTT ACCTCCGCAC CACCAGAGAA AGCACACCTG TAATTTATGG TAGTGAAGAA
CAATTTTCCA TTGGTGGCAG CAAAGTTATC CACCGGTCCG AGCGCGACCA AGCCACAATT
ATTGCCGCAG GTATCACTGT ACATGAAGCC CTCAAAGCTT ATGACAGATT GAAAAATGAA
GGGATCACAG CCCGTATTAT TGATGCCTAT TCCGTTAAAC CCATTGATGT GCAAACACTA
CATCAAGCAG CAAAAGATAC CAACGGTAAT TTAGTAGTTG TAGAAGATCA TTGGCCAGAA
GGAGGATTAG GTGCGGCTGT CTTAGATGCC TTTGCTGGTA ATAGTACCAC CCCTGCCTAC
AAAATTCCGC AATTACAGAT TATTAAACTT GCAGTTCAAA ATATGCCAAC TTCTGGAACT
CCTGAAGAAC TACTCCATGC TGCTAAAATT GATGCAGATG CCATTGTAGA AGTTGTGAAA
TCACAAGTTA GGCGACTGGT AGGAGTATCT TAG
 
Protein sequence
MTTQEQLYQW QELAQQLRVD SIRATTIAGS GHPTSSMSSA DLMAVFLSKY LRYDFDHPEN 
PNSDRFILSK GHAAPLLYSM YKAAGVISDQ ELLSLRQSGS RLEGHPTPIL PWVDVATGSL
GQGLPIGVGV ALAGKYLDQL PYNVWVLLGD GETAEGSIWE AFDHASHYTL DNLLAIIDVN
RLGQRGQTEL GWNTQAYGNR AKAFGWQAIE IDGHNLTEID QAFSAAVAIN DRPTVIIART
KKGKGVKALE DLGGWHGKAL KQDQEQQAIT ELGGERHITI TVDKPEEQSQ PATLGVPQPL
QLPIYQKGNK VATRRAYGDA LLALGASQPD VVALDAEVSN STYAEDFAEA FPERYFEMYI
AEQQMIAAAV GLQVRKYKPF ASTFAAFLTR GYDFIRMGAV SRANIKLVGS HGGVSIGQDG
ASQMGLEDLA AFRAVCNSTV LYPSDANQTA KLVPQMSNAP GIVYLRTTRE STPVIYGSEE
QFSIGGSKVI HRSERDQATI IAAGITVHEA LKAYDRLKNE GITARIIDAY SVKPIDVQTL
HQAAKDTNGN LVVVEDHWPE GGLGAAVLDA FAGNSTTPAY KIPQLQIIKL AVQNMPTSGT
PEELLHAAKI DADAIVEVVK SQVRRLVGVS