Gene Aazo_1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1304 
Symbol 
ID9339099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1380023 
End bp1383022 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content37% 
IMG OID 
ProductHAD superfamily ATPase 
Protein accessionYP_003720702 
Protein GI298490525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTCAAC CAATACACGC GAGTATCAAA GGGAGAACTA GATTTAAAAT TAAGGAACTT 
TATCGTTCAC CATCTTTGAA AAGTTATTTA GAGCGATCAA TTGTAAATTT CGCGGAAATA
AGATATATTT CTGCTAATAT ATTAACTAGC AATATCTTGG TTATCTTCGA TCAGGATATT
AGCTCTAAAG AAATAGAATT TTTAATTCGA AAGGTCTTAC TTAATTACCA AGATGAGTCC
GGTTTTAACC ATAAAAATAA AAAATTAGTA AGAACTGCCA AAGCCAAAAC AGATCGACAA
CTTGCTGTCA AGCAAGAGCA AAAAACTAAA AACTGGCATT TGATGGCAGT AGAACAGGTT
TTAACATCAT TAAATACCTC AAATATCTCA GGATTATCCA ATGAAACAGC TACTAATAAC
CTCAAAAAAT ATGGAAAGAA TGTGTTGTCA ACAATAGTGA AACGTTCTCA ACTATCTATG
TTGCTGGATC AGTTTAAATC ATTACCAGTT GCTTTACTAG TTATTGCATC TGGCATTTCT
ATTGCTACTG GGGGAATGAT TGATGCTGTA GTAATTTTGG GTGTAGTTGG TTTAAATGCT
GCAATTGGTT ACGCCACAGA AAGTCAGTCA GAACGCATTA TTAACTCTCT AAAGAATAGG
GCTGAACAAT CAACTCTGGT AATTAGAGAT AGCTATCAAA CAGAAATACC CACAGAAAAT
GTAGTTCTAG GAGACATTTT AGTTCTGAAC CCTGGTAGTT ATATTGCCGC AGATGCCCGC
ATTATTACAA CGGATAACTT AAATGTTGAT GAATCCGCCT TAACTGGAGA AAGTCTGCCT
ATAACTAAAA TAAGTACATG TTTAAATGGT GAAAATGTAG CTTTAGCAGA TCGTTTAAAT
ATGATTTATA AAGGCACATT AGTTATCAAT GGAAAGGGAT TAGCGGTGGT AGTTGCTACA
GGCGACTTTA CTGAAATGGG CGAAATTCAG CAATTAGTTG GTGAAGCAAC AACAATCCAA
ACTCCCCTTG CTAAACAGTT AGATCAAGTA GGTAGTCAAA TAGTTTTTAT CGGCATGGGA
TTATGTGGTT TAGTCTTTAG TTTGGGAATG TTACGAGGAT ATGGTTTATT ACCAATGTTA
CAATCTTCTA TATCTTTAGC AGTTGCAGCA GTTCCCGAAG GCTTACCGAC AATTGCCATC
ACTACCCTAG CTTTAGGCAT TCGTGATATG CGGAAGAATC ACGTTCTTGT CCGCAGTTTA
AATGCAGTAG AAGCGTTAGG TTCGGTACAG ACTATTTGTT TAGATAAAAC CGGGACACTC
ACAGAAAATA AAATGTCGGT AGTAGAAATT ATCACTAACA GTAAATATAT TCAAGTAAGT
AATGGTAACT TTATTGATGG TGAAGAAACG ATTAATCCCT ATACCTATAA TGAATTATTA
AAATTAATTC ATGTTTCAGT TCTTTGTAAT GAAAGTGAAG TCAGTAAACA AAATGACGAG
TATGTAGTTA CAGGTTCAGC AACAGAAAAT GCCTTGATTT ACACTGCAAT TAGCGCGGGG
GTAGATGTTA TTACATTGAA AGGAAAATAT CCCTTATTAC AAACTAATTT GCGTTCGGAA
AATCGCAACC TGATGAGTAC AATTCACGCA ACACATAATT GTCACCAGAT GGTTGCTGTT
AAAGGAAATC CTGCGGAAGT CTTGCAAATC TGCCACAGAT GGATGAAAAG CGGGCAAATA
GTACCTTTGA CCGCAGAAGA TCGGCAGAAA ATGGACATGG AAAATGATCG CATGGCAGGA
AAAGCCTTGC GAGTTTTAGG GATAGCTTAT GGTTATATTG AGGAAGGTGA CAACAGCAAT
AATCATCATG AATCAGATTT AATTTGGTTG GGTTTGGTAG GGATGGCAGA TCCTCTGAGA
ACAGGTGCAA AAAAATTAAT TGCAGACTTC CACCAAGCGG GAATAGATAC GGTAATGATT
ACTGGAGATC AAAGTCCTAC AGCTTATGCG ATCGCTAAAC AGTTAGAATT AAATAGACAT
ACCCAATTAG AAATTCTCGA CTCCAACAAC CTCAATAATC TCACCCCCGA AGCATTAGCA
GCACTCAGCG ACAAAGTAGA CGTTTTTGCC CGTATCAGTC CCAGTAATAA ATTACAAATA
GTTCAAGCAT TGCAGACATC CGGTAAAGTC GTCTCCATGA CCGGTGATGG CATTAACGAC
GCACCAGCAT TAAAAGCTGC CCAAATCGGT GTGGCAATGG GTAAAGAAGG GACTGATGTT
GCCCGTGAAG TGGCAGATAT TATCTTAGAA GATGACAGAT TAGAAACAAT GATGATTGCT
GTCAGCCGAG GTAGAACAAT TTACAACAAC ATCAGAAAAT CGGTCCATTT TCTCCTAGCT
ACAAATCTCA GCGAAATCAT GGTGATGATA ACCGCTACCG CAGTCGGTAT TGGTGAACCT
TTAAATGCAA TTCAACTTTT ATGGTTAAAC TTAGTAACTG ATATTTTCCC TGGACTTTCT
TTAGCTATGG AGGCACCAGA ACCAGACGTA TTAAATCAAC CACCACGTGA TCCCAATGAA
CCAATTATTA AAAATACCGA CTTTGGCAGA ATTGTATTTG AGTCTGCTGC TATATCAATT
AGTACCTTAG CTGCCTATGG TTATGGTATT TTTAAATATG GTATTAGTCC TAAAGCCAGG
GCTTTAGCAT TTATGACTTT AACATCAGGA CAATTACTAC ATACTATTAG TAGTCGTTCT
GAAAAACACA GTATTTTTAG CAAAGAAAAA TTACCACCTA ATCCTTATTT AAATGCTGCT
ATTTTAGGTT CTTTTGGTAT TCAGTTGTTA ACTTTAACCG TGCCACAGTT GCGAGGTTTA
TTGAAGATTA CCCGATTGAA TATAGTCGAT ATTGCTGTAA TTACTAGTGG TGCTTTATTA
CCGCTATTGG TGAATGAAGG AACTAAAAAT ATTCAAATCA CAGGAAAGAA TTCAACTTGA
 
Protein sequence
MIQPIHASIK GRTRFKIKEL YRSPSLKSYL ERSIVNFAEI RYISANILTS NILVIFDQDI 
SSKEIEFLIR KVLLNYQDES GFNHKNKKLV RTAKAKTDRQ LAVKQEQKTK NWHLMAVEQV
LTSLNTSNIS GLSNETATNN LKKYGKNVLS TIVKRSQLSM LLDQFKSLPV ALLVIASGIS
IATGGMIDAV VILGVVGLNA AIGYATESQS ERIINSLKNR AEQSTLVIRD SYQTEIPTEN
VVLGDILVLN PGSYIAADAR IITTDNLNVD ESALTGESLP ITKISTCLNG ENVALADRLN
MIYKGTLVIN GKGLAVVVAT GDFTEMGEIQ QLVGEATTIQ TPLAKQLDQV GSQIVFIGMG
LCGLVFSLGM LRGYGLLPML QSSISLAVAA VPEGLPTIAI TTLALGIRDM RKNHVLVRSL
NAVEALGSVQ TICLDKTGTL TENKMSVVEI ITNSKYIQVS NGNFIDGEET INPYTYNELL
KLIHVSVLCN ESEVSKQNDE YVVTGSATEN ALIYTAISAG VDVITLKGKY PLLQTNLRSE
NRNLMSTIHA THNCHQMVAV KGNPAEVLQI CHRWMKSGQI VPLTAEDRQK MDMENDRMAG
KALRVLGIAY GYIEEGDNSN NHHESDLIWL GLVGMADPLR TGAKKLIADF HQAGIDTVMI
TGDQSPTAYA IAKQLELNRH TQLEILDSNN LNNLTPEALA ALSDKVDVFA RISPSNKLQI
VQALQTSGKV VSMTGDGIND APALKAAQIG VAMGKEGTDV AREVADIILE DDRLETMMIA
VSRGRTIYNN IRKSVHFLLA TNLSEIMVMI TATAVGIGEP LNAIQLLWLN LVTDIFPGLS
LAMEAPEPDV LNQPPRDPNE PIIKNTDFGR IVFESAAISI STLAAYGYGI FKYGISPKAR
ALAFMTLTSG QLLHTISSRS EKHSIFSKEK LPPNPYLNAA ILGSFGIQLL TLTVPQLRGL
LKITRLNIVD IAVITSGALL PLLVNEGTKN IQITGKNST