Gene Aazo_0840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0840 
Symbol 
ID9338628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp891246 
End bp892889 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content39% 
IMG OID 
Productthiamine pyrophosphate protein domain-containing protein TPP-binding protein 
Protein accessionYP_003720383 
Protein GI298490206 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.923558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACGG CAGATTTGTT AGTACAGTGT TTAGAAAATG AAGGAGTGCA ATATGTTTTT 
GGACTGCCAG GTGAGGAAAA TTTGCACGTT TTAGAAGCTT TAAAAAACTC ATCCATTCAA
TTTATTACTA CTCGTCACGA ACAGGGTGCA GCTTTCATGG CGGATGTTTA TGGGAGGTTA
ACTGGTAAAG CCGGAGTCTG TCTTTCCACT CTTGGTCCTG GTGCTACTAA TTTAATGACT
GGGGTTGCAG ATGCTAACCT TGATGGTGCA CCATTAGTAG CAATTACCGG ACAGGTGGGA
ACAGATAGAA TGCATATTGA ATCCCATCAA TATTTAGATT TAGTGGCTAT GTTTGCGCCA
GTTACTAAGT GGAATAAGCA GATAGTTAGA CCGAGTATTA CACCAGAAGT TGTGAGAAAA
GCATTCAAGC GCTCGCAAAC TGAAAAACCT GGTGCAGTCC ACATAGATTT ACCCGAAAAT
ATTGCTGCTA TGCCCGTAGA AGGCAAACCT TTACATAAGG ATAACAGCGA AAAAACCTAT
GCTGCTTTTG CTAGTATTCG CGCTGCTGCT GCCATAATTT CTCAAGCAGT TAATCCCATT
ATCTTAGTGG GAAATGGGGC GATTCGCGCT CAAGCTAGTG ATGCGGTGAC GCAATTCACC
ACCCAAATAA ATATTCCAGT CGTTAATACT TTCATGGGTA AAGGCGTAAT TCCCTACACT
CATCCTTTAG CACTTTATTC TGTAGGATTA CAACAAAGAG ATTTCATTAC TTGTGGTTTT
GATAATACCG ATTTAGTAAT TGCAATTGGC TATGATTTAA TTGAATTTTC TCCCAAAGAA
TGGAATCCTG ACGGCAAAAT TCCTATTATC CATATTGCTG CTATTTCAGC AGAAATTGAT
AGTAGTTACA TTCCTAAAGT CGAAGTTATT GGGGATATTT CTGATTCAGT TAATGAAATA
TTAAAATTAG CAGACAGACA AGGAAAACCC AATCCCTATG CCATCAGTTT ACGTTCTAAT
ATTCGCGCTG ATTACGAACA ATATGCCCAT GATGATGGCT TCCCAATAAA ACCGCAAAGA
TTAATTTATG ATTTGCGGCA AGTGATGGGA CCAGATGATA TTGTCATTTC TGATGTAGGT
GCACATAAAA TGTGGATTGC TAGACATTAT CATTGTCATA GTCCTAATAC GTGCATTATT
TCCAATGGAT TTGCAGCAAT GGGAATTGCC ATTCCTGGGG CTTTAGCTGC TAAACTTGTC
TATCCAGATC GTAAAGTTGT AGCAGCTACA GGCGATGGTG GCTTTATGAT GAACTGTCAA
GAATTAGAAA CAGCTTTGCG TGTTGGTACA CCTTTTGTTA CCTTAATTTT CAATGACGGT
GGCTATGGTT TAATTGAATG GAAACAAGAA AATCAATTTG GTAAAGGTAA TTCATGTTTT
GTGCATTTTG GTAATCCTAA TTTTGTCAAA TTAGCCGAAA GTATAGGATT AAAAGGTTAC
AGGGTTGAAT CAGCAACTGA TTTAATTCCT GTCGTCAAAG AAGCCCTAAT TCAAGATGTT
CCTGCGGTAA TAGATTGTCC TGTAGATTAT CGAGAAAACC GCCGTTTTAG TCAAAAAGCT
GGGGAGTTAA ATTGTGATAT TTAA
 
Protein sequence
MNTADLLVQC LENEGVQYVF GLPGEENLHV LEALKNSSIQ FITTRHEQGA AFMADVYGRL 
TGKAGVCLST LGPGATNLMT GVADANLDGA PLVAITGQVG TDRMHIESHQ YLDLVAMFAP
VTKWNKQIVR PSITPEVVRK AFKRSQTEKP GAVHIDLPEN IAAMPVEGKP LHKDNSEKTY
AAFASIRAAA AIISQAVNPI ILVGNGAIRA QASDAVTQFT TQINIPVVNT FMGKGVIPYT
HPLALYSVGL QQRDFITCGF DNTDLVIAIG YDLIEFSPKE WNPDGKIPII HIAAISAEID
SSYIPKVEVI GDISDSVNEI LKLADRQGKP NPYAISLRSN IRADYEQYAH DDGFPIKPQR
LIYDLRQVMG PDDIVISDVG AHKMWIARHY HCHSPNTCII SNGFAAMGIA IPGALAAKLV
YPDRKVVAAT GDGGFMMNCQ ELETALRVGT PFVTLIFNDG GYGLIEWKQE NQFGKGNSCF
VHFGNPNFVK LAESIGLKGY RVESATDLIP VVKEALIQDV PAVIDCPVDY RENRRFSQKA
GELNCDI