Gene Aazo_5179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5179 
Symbol 
ID9342986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5303751 
End bp5304845 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content42% 
IMG OID 
Product3-dehydroquinate synthase 
Protein accessionYP_003723351 
Protein GI298493174 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.981519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCTG TAATTAAAGT AGATATACCA GGAAAATCTT ATGAAATTGT GATTGCACCT 
GGGAGTTTGG ATAACCTAGG TAAACAGATG GCGAGTTTGA AACTGGGTAA GAAGGTATTG
CTGGTTTCCA ACCCGATGAT ATTTAAACAT TATGGCGAAA GAGCAATCGC ATCTTTACAA
AATGCCGGCT TTGAGGTCAC AAGCTATAAC CTGCCACCAG GGGAACGCTA CAAAACCCTA
AACTCCATCC AAAAAATCTA TGATATTGCC CTCGACAACC GCCTAGAACG TTCCTCCACA
ATGGTGGCTT TGGGGGGTGG TGTAGTTGGC GATATGACTG GGTTTGCAGC TGCTACATGG
TTGAGAGGAA TTAACGTTGT CCAAATTCCT ACCAGCCTCT TAGCAATGGT AGATTCGGCT
ATTGGTGGTA AAACTGGGGT AAATCATCCG CACGGTAAAA ACTTAGTTGG CGCTTTCCAT
CAACCTAGCT TTGTCTTGAT TGATCCAGAA GTCTTAAAAA CCCTGCCAGC GCGTGAATTT
CGGGCGGGAA TGGCGGAGGT AATCAAGTAT GGCGTAATTT GGGACGCTGA ATTATTTACC
CAATTGGAAG CGAGTAAACA CCTTGACCAA CTCCGCTATG TAAAATCCGA CCTGATAAAT
TACATATTAA CTCATTCTTG TCAAGCAAAA GCAGATTGTA TCAGCAAAGA TGAAAAAGAA
TCTGGACTCC GTGCAATTTT GAATTATGGT CACACTATCG GTCATGCGGT GGAAAGCTTG
ACAAATTATC GTCTGTTCAA ACACGGTGAA GCTGTGGGTA CTGGCATGAT AGCAGCAGGA
GAAATTGCTG TGAAATTAGG ACTTTGGCAA AAAGCCAACA CAGAACGTCA AAACGCGCTG
ATTAAAAAAT CTGGTTTACC GACACAATTA CCAGCAGGTT TGGATATTCA AGCCATTATT
GATGCTTTGC AATTAGATAA AAAAGTCAAA TCAGGTAAAG TGCGGTTTGT GTTACCCACC
CAAATAGGTG AAGTGAAAGT CACAGACGAA GTACCCACAG ATATTATTAG GCAGGTATTA
CAGGAAATCC AATAA
 
Protein sequence
MSSVIKVDIP GKSYEIVIAP GSLDNLGKQM ASLKLGKKVL LVSNPMIFKH YGERAIASLQ 
NAGFEVTSYN LPPGERYKTL NSIQKIYDIA LDNRLERSST MVALGGGVVG DMTGFAAATW
LRGINVVQIP TSLLAMVDSA IGGKTGVNHP HGKNLVGAFH QPSFVLIDPE VLKTLPAREF
RAGMAEVIKY GVIWDAELFT QLEASKHLDQ LRYVKSDLIN YILTHSCQAK ADCISKDEKE
SGLRAILNYG HTIGHAVESL TNYRLFKHGE AVGTGMIAAG EIAVKLGLWQ KANTERQNAL
IKKSGLPTQL PAGLDIQAII DALQLDKKVK SGKVRFVLPT QIGEVKVTDE VPTDIIRQVL
QEIQ