Gene Aazo_0368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0368 
Symbol 
ID9338153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp369770 
End bp371461 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content45% 
IMG OID 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003720062 
Protein GI298489885 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAA ATATTAAGAG TAAATTTATC ACACAAGGGG TACAGCGATC GCCTAACCGG 
GCTATGTTGC GGGCTGTTGG TTTTCAAGAT GCAGACTTCA ACAAAGCCAT AGTCGGTATT
GCTAATGGTT ACAGTACCAT CACCCCCTGT AATATGGGGA TTAACCAACT AGCACAAAGG
GCAGAAATGA GCCTCAGAAA TGCAGGTGCA ATGCCCCAAG TTTTCGGCAC AATTACCATC
AGTGATGGGA TTTCAATGGG AACAGAGGGG ATGAAGTTCT CTCTGGTGTC ACGGGAAGTG
ATCGCAGATT CCATTGAAAC TGCATGTACT GGCCAAAGTA TGGATGGTGT TCTGGCTATT
GGTGGTTGTG ACAAAAATAT GCCAGGGGCA ATGTTAGCAA TCGCTCGCAT GAATATCCCT
GCTATCTTTG TTTATGGTGG CACAATCAAA CCCGGACACT ACGATGGCAA AGATTTAACC
GTTGTTAGTT CCTTTGAAGC CGTCGGCCAA CACAGCGCCG GCAAAATTGA CGAAACCGAA
CTTTTAGAAA TTGAACGCCG TGCTTGTCCT GGTGCTGGGT CCTGTGGTGG AATGTTCACA
GCTAATACCA TGTCTTCAGC CTTTGAAGCA ATGGGAATGA GTTTACCTTA TTCCTCAACC
ATGGCCGCAG AAGATGCCGA AAAAGCGGAT AGCACAGAGA AATCCGCCTT TATCTTAGTC
GAAGCCATCC GTAAGCAAAT ATTACCCCGG CAACTTATCA CCCGGAAATC TATCGAAAAT
GCCATATCTG TAATTATGGC CGTTGGTGGT TCTACCAATG CAGTTTTACA TTTTTTAGCG
ATCGCCCGCG CAGCTGGTGT AGAACTAACT TTAGACGACT TTGAAACCAT CCGGGCCCGT
GTTCCAGTTT TGTGCGATTT AAAACCCAGT GGTAGATACG TAGCCACAGA CTTGCATAAA
GCTGGTGGTA TTCCTCAAGT CATGAAAATC TTATTAGTTC GTGATTTACT GCATGGTGAC
TGTCTAACTA TCTCTGGTCA AACTGTAGCC GAAGTCTTAG CAGACATACC AGCAGAACCA
TCACCCAAGC AAAATGTAAT TCGTCCTTGG GATCGTCCCA TCTATGCACA AGGACATTTA
GCTATTCTCA AAGGTAACTT AGGTACTGAA GGTGCAGTTG CTAAAATTAC TGGTGTGAAA
AAACCCATCA TCACCGGGCC AGCGAGAGTA TTTGAATCAG AAGAATCTTG CTTAGATGCA
ATTTTAGCAG GTAAGATTAA AGCAGGTGAT GTGATCATCA TCCGTTACGA AGGTCCAAAA
GGTGGGCCTG GTATGCGGGA AATGTTGGCT CCCACCTCAG CAATTATTGG TGCGGGATTA
GGTGATTCTG TGGGCTTAAT TACGGATGGA CGCTTTTCTG GCGGTACTTA TGGGATGGTA
GTTGGTCACG TCGCTCCAGA AGCTGCAGTC GGCGGTAATA TTGCCTTGGT AGAAGAAGGT
GATAGTATCA CCATTGATGC TAATTCTCGA TTATTACAAG TGAATATATC GGATGCAGAA
TTAGCTAGTC GTCGTGCTAA CTGGCAACCG CGTCCACCAC GTTATACAAA AGGGGTGCTG
GCGAAATATG CCAAGTTGGT ATCTTCTAGT AGTGTTGGTG CTGTTACAGA CTTAGATTTG
TTTGGTAATT AA
 
Protein sequence
MSENIKSKFI TQGVQRSPNR AMLRAVGFQD ADFNKAIVGI ANGYSTITPC NMGINQLAQR 
AEMSLRNAGA MPQVFGTITI SDGISMGTEG MKFSLVSREV IADSIETACT GQSMDGVLAI
GGCDKNMPGA MLAIARMNIP AIFVYGGTIK PGHYDGKDLT VVSSFEAVGQ HSAGKIDETE
LLEIERRACP GAGSCGGMFT ANTMSSAFEA MGMSLPYSST MAAEDAEKAD STEKSAFILV
EAIRKQILPR QLITRKSIEN AISVIMAVGG STNAVLHFLA IARAAGVELT LDDFETIRAR
VPVLCDLKPS GRYVATDLHK AGGIPQVMKI LLVRDLLHGD CLTISGQTVA EVLADIPAEP
SPKQNVIRPW DRPIYAQGHL AILKGNLGTE GAVAKITGVK KPIITGPARV FESEESCLDA
ILAGKIKAGD VIIIRYEGPK GGPGMREMLA PTSAIIGAGL GDSVGLITDG RFSGGTYGMV
VGHVAPEAAV GGNIALVEEG DSITIDANSR LLQVNISDAE LASRRANWQP RPPRYTKGVL
AKYAKLVSSS SVGAVTDLDL FGN