Gene Aazo_2156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2156 
Symbol 
ID9339955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2240006 
End bp2241586 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content44% 
IMG OID 
ProductD-3-phosphoglycerate dehydrogenase 
Protein accessionYP_003721293 
Protein GI298491116 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAGG TTCTTGTCTC TGATCCGATT GACCAAGCTG GTATTGACAT TCTTTCCCAA 
GTTGCTACGG TTGATGTCAA AACAGGTCTA AAACCAGCAG AACTAATAGA AATTATTGGT
GAGTATGACG CGCTAATGAT CCGTTCTGGA ACTCGCGTTA CTCAAGAGAT TATTGAAGCT
GGCACACAAT TAAAAATTAT TGGTCGTGCT GGTGTAGGTG TGGATAATGT GGATGTTCCT
GCTGCTACCC GCAAAGGTAT TATAGTAGTT AACTCTCCAG AGGGAAACAC GATCGCCGCT
GCTGAACACG CACTAGCAAT GATATTATCT TTATCCCGTC ATATCCCGGA TGCAAATGCT
TCCGTTAAAC GCGGTGAGTG GGATCGGAAA ACTTTTGTGG GTGCAGAAGT ATACAAAAAA
AATCTCGGTA TTGTTGGGTT AGGGAAAATT GGCTCCCATG TAGCGTCTGT AGCTAAGGCT
ATGGGGATGA AGCTATTAGC TTATGATCCC TTTATTTCTA CAGAACGGGC TGAACAAATG
GGTTGTCAGT TGGTAGATTT AGATTTGCTA TTCCAGCAAG CAGATTATAT TACTTTACAC
ATCCCTAAAA CTCCAGAAAC TACCAATTTA ATCAACGCTA AAACTTTGGC GAAGATGAAA
CCAACTGCTA GAATTATCAA CTGCGCTCGT GGTGGCATCA TTGATGAGTC AGCTTTGGCA
GCGGCGATTA AAGAAGGTAA AATCGGTGGT GCAGCGTTGG ATGTATTCGA TTCTGAACCC
CTGGGAGAGT CTGAGCTGCG ATCGCTCGGT AAAGATATTA TTCTTACTCC GCACTTAGGT
GCATCTACCA CAGAAGCCCA AGTTAATGTA GCCATAGACG TAGCTGAACA AATCCGTGAT
GTTATTTTAG GACTACCAGC CCGTTCTGCT GTTAATATTC CCGGACTCGG ACCCGATATC
TTGGAAGAAC TCAAACCCTA TATGCAGTTG GCGGAAACCT TGGGTAACTT GGTAGGACAA
CTAGCAGGAG GAAGGGTGGA AACACTGACT GTCAAACTAC AAGGGGAACT GGCAACTAAT
AAGAGTCAGC CTTTAGTAGT AGCAGCCCTG AAAGGACTAC TATATCAAGC TTTGCGGGAA
CGAGTAAATT ACGTCAACGC TAGCATAGAA GCCAAAGAAA GGGGTATTCG CGTTATTGAA
ACAAGGGATG CTTCAGCACG AGATTATGCA GGCTCACTGC ATTTAGAAGC TACAGGTACT
TTGGGCACTC ATTCTGTCAC AGGTGCTTTG TTGGGTGATA AAGAAATCCA CCTAACTGAT
GTTGACGGCT TCCCCATTAA CGTTCCACCT AGCAAATATA TGCTGTTCAC TCTCCACCGT
GATATGCCAG GAATTATTGG TAAACTCGGT TCTCTATTGG GTAGTTTTAA TGTCAATATT
GCCAGTATGC AGGTAGGACG AAAAATTGTC CGTGGTGATG CGGTCATGGC TTTAAGTATT
GATGATCCTT TACCCGATGG CATTTTGGAT GAAATTACAA AAGTACCCGG CATTCGAGAT
GCATATACAG TAACACTTTA A
 
Protein sequence
MSKVLVSDPI DQAGIDILSQ VATVDVKTGL KPAELIEIIG EYDALMIRSG TRVTQEIIEA 
GTQLKIIGRA GVGVDNVDVP AATRKGIIVV NSPEGNTIAA AEHALAMILS LSRHIPDANA
SVKRGEWDRK TFVGAEVYKK NLGIVGLGKI GSHVASVAKA MGMKLLAYDP FISTERAEQM
GCQLVDLDLL FQQADYITLH IPKTPETTNL INAKTLAKMK PTARIINCAR GGIIDESALA
AAIKEGKIGG AALDVFDSEP LGESELRSLG KDIILTPHLG ASTTEAQVNV AIDVAEQIRD
VILGLPARSA VNIPGLGPDI LEELKPYMQL AETLGNLVGQ LAGGRVETLT VKLQGELATN
KSQPLVVAAL KGLLYQALRE RVNYVNASIE AKERGIRVIE TRDASARDYA GSLHLEATGT
LGTHSVTGAL LGDKEIHLTD VDGFPINVPP SKYMLFTLHR DMPGIIGKLG SLLGSFNVNI
ASMQVGRKIV RGDAVMALSI DDPLPDGILD EITKVPGIRD AYTVTL