Gene Aazo_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1072 
Symbol 
ID9338868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1146260 
End bp1149478 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content36% 
IMG OID 
ProductSNF2-like protein 
Protein accessionYP_003720552 
Protein GI298490375 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATCT TACACGGTAG CTGGTTACTA AAAAATCAAA ATAATTATTT ATTTATTTGG 
GGAGAAACTT GGCGAAATTC CCAGATAAAT TTAGAATTTA TAACTTTATC AAAATTGCCA
TTAAATCCCT TGGCTATGAT ACCGGAGGAA TTGAAAGCAT TCTTGGAAGC AAGTAATTTT
AAATTGCCTA ATAATATTAA TCAAAATTAC TCAGATGCTA ATAGCAGAAC TAAATTTAAA
AAAGGTAAAA ATACACCAGA AGAAATAGAC CTGCTGAATA CAACCCAAAT TATTGCTATT
TTAACAAATT TTATCGGCAA AGATACCATT TCGCCAGTAC ACTCTGCTAG TTTAGGCGTT
GATAAAAAAA CTCCACAATA TTTACAACCT TGGAAAGTAG AAGGTTTTTA TTTACAACCC
ACCACAGCCA TAAAATTTTT AACTTCTCTC CCATTGAGTT CAACCGATGG AAAAAACTCA
TTGTTGGGGG GAGATATATA TTTCTGGGTA CAAATAGCTC GTTGGAGTTT AGATTTAATC
TCACGCTGTA AGTTTTTACC AACAGTTAAA AAAAAAGCAG ATGGTTCTTT AGTTGCTGAA
TGGCAGGTAC TTTTAGACAG TGCTTTGGAT ACCACTAGGT TAGAAAGATT TGCTGCTAAA
ATGCCATTAA TTTGTCGTAC TTATCAAGAA GATAGGAGTG AAAATCTGCA ACATTTAGCA
GTTGATCTAC CATTACTACC AGAAGAATTA ATTTTAGAAT TTCTCACTAG CATCATAGAT
GCCCAAATAC GGGATATGCT TGCTTCTCAA TCTCCCATTG AAAATAGGTT AATGATGTCT
TTACCACCAG CCGTACAGCA GTGGTTACAG GCTTTAACTG GTGCATCTAA TAAAATTAAT
GACGATGCAA TGGGATTGGA AAGACTAGAT GCTGCACTCA AAGCTTGGAT TCTACCTTTA
CAATATCAAT TAACAGGCAA AGCTTCATTT CGTACCTGTT TTCAACTCCA TCCTCCAAAC
TCAGAAAAAA CAGATTGGAT ACTAGCTTAT TTTCTCCAAG CAGCAGATAA TTCTGAATTT
TTAGTAGATG CAGTAACTAT TTGGAAGTAT CCTGTAGAAA AATTAGTTTA TCAGAATCGG
ACAATTGAAC AGCCCCAGGA AACATTTTTA CGGGGTTTAG GTTTAGCTTC TCGATTGTAT
CCAGTTATAG CTCCTAGTTT AGAAACTGAA TCTCCGCAAT TTTGCCAACT CACCCCTATG
CAAGCTTATG AGTTTATCAA ATCTGTAGCG TGGAGGTTAG AAGATAACGG TTTAGGAGTA
ATTATACCAC CAAGTTTAGC CAATCGTGAA GGATGGTCTA ACCGCTTAGG ATTAAAAATT
AGTGCCGAAA CACCAAAGAA AAAACAGGGA CGTTTGAGTT TACAAAGTTT GTTAAATTTT
AAGTGGCAAT TAGCAATTAG TGGACAGACA ATTTCTAAAG CTGAGTTTGA TAAATTAGTA
GCGTTGAATA GTCCTTTAGT AGAAATTAAT GAAGAATGGG TTGAACTGCA TCCCCAAGAT
ATTAAAACTG CCCAAAATTT TTTTGCATCT CGTAAAGAAG AGATATCTTT ATCATTAGAA
GATGCTTTAA GGATTAGTAC AGGAGATACT CAAGTAATTG AAAAACTACC TGTAGTTACT
TTTGAAGCTT CTGGTGTATT GCAAGAATTA GTTGGGGTTT TAACTAATAA TCAAGATATT
CAACCTTTAC CAACTCCTAG CAATTTTCAC GGACAATTAC GACCTTATCA AGAACGAGGT
GCAGCTTGGT TGGCTTTTCT GGAACGTTGG GGTTTGGGTG CCTGTCTTGC GGACGATATG
GGACTGGGTA AAACGATCCA GTTCATCGCT TTCCTACTTC ATCTTCAAGA ACAGGATGCA
CTAGAAAATC CAACTCTCTT AGTTTGCCCT ACTTCTGTTT TAGGTAATTG GGAAAGAGAA
GTAAATAAAT TTGCTCCCAC CCTCAAAGTT TTACAACATC ATGGTGATAA ACGACCCAAA
GGAAAAATCT TTACAGAAAC AGTTAATAAA TACGATCTAG TAATTACTAG TTATTCACTC
GTGCAACGAG ATATAAAATT ATTACAAACC GTGAATTGGC AGATTGTTGT TTTAGATGAA
GCCCAGAATG TCAAAAATGC AGATGCAAAA CAATCGCAGG CAGTGAGACA GTTAGAAACT
AAATTTCGCA TTGCTTTAAC AGGGACACCA GTAGAAAATA GATTGCAGGA ATTATGGTCT
ATTTTAGATT TTCTCAACCC TGGTTATTTG GGTAATAAAC AGTTTTTCCA GCGACGGTTT
GCTATGCCCA TTGAAAAATA TGGTGATACG GCTTCTTTAA TGCAGTTACG TTCTTTGGTA
CAACCTTTTA TTTTGCGTCG CTTGAAAACT GATAAACAAA TTATTCAAGA CCTGCCAGAA
AAGCAGGAAA TGACTGTATT TTGTGGACTG ACTGCTGAAC AAGCGCAACT ATATCAACAG
TTGGTAGATG AATCTTTAGT TGAGATTGAA TCAGCGGAAG GTTTACAACG CAGAGGAATG
ATTTTAGCTT TATTAATTAA GCTGAAACAA ATTTGCAATC ACCCTGCACA GTATTTAAAA
GAATCAAGTT TAGCAAAACA CAATTCTGCT AAATTACAAC GTCTAGAGGA AATGTTAGAG
GAAGTTTTAG CAGAGGAAAA CCGCGCTTTA ATTTTCACCC AATTTGCTGA ATGGGGTAAA
TTACTTAAAC CATATTTAGA AAAGCAGCTA GGTAGGGAGA TCTTGTTTTT ATATGGTAGC
ACCAGTAAAA ACCAACGTGA AGAAATCATT GACCGTTTCC AAAATGACCC CCAAGGTCCA
CGAATTATGA TTCTTTCTTT AAAAGCTGGT GGTGTGGGTT TAAACTTAAC TAGGGCAAAT
CATGTATTTC ACTTTGATAG ATGGTGGAAT CCCGCTGTAG AAAATCAAGC TACAGATAGA
GTATTTAGAA TTGGACAAAC TAAGAATGTA CAAGTACATA AATTTGTGTG TACTGGGACT
GTAGAAGAAA AAATTAATGA CATGATTGAA AGTAAAAAAC AATTGGCACA ACAAGTTGTT
GGTGCTGGTG AAGAATTGTT AAGTGAATTA GATACAGACC AACTACGCAA TTTACTATTA
CTTGACCGTA ATGCCATAAT TGATGAGGAT GAAGAATGA
 
Protein sequence
MAILHGSWLL KNQNNYLFIW GETWRNSQIN LEFITLSKLP LNPLAMIPEE LKAFLEASNF 
KLPNNINQNY SDANSRTKFK KGKNTPEEID LLNTTQIIAI LTNFIGKDTI SPVHSASLGV
DKKTPQYLQP WKVEGFYLQP TTAIKFLTSL PLSSTDGKNS LLGGDIYFWV QIARWSLDLI
SRCKFLPTVK KKADGSLVAE WQVLLDSALD TTRLERFAAK MPLICRTYQE DRSENLQHLA
VDLPLLPEEL ILEFLTSIID AQIRDMLASQ SPIENRLMMS LPPAVQQWLQ ALTGASNKIN
DDAMGLERLD AALKAWILPL QYQLTGKASF RTCFQLHPPN SEKTDWILAY FLQAADNSEF
LVDAVTIWKY PVEKLVYQNR TIEQPQETFL RGLGLASRLY PVIAPSLETE SPQFCQLTPM
QAYEFIKSVA WRLEDNGLGV IIPPSLANRE GWSNRLGLKI SAETPKKKQG RLSLQSLLNF
KWQLAISGQT ISKAEFDKLV ALNSPLVEIN EEWVELHPQD IKTAQNFFAS RKEEISLSLE
DALRISTGDT QVIEKLPVVT FEASGVLQEL VGVLTNNQDI QPLPTPSNFH GQLRPYQERG
AAWLAFLERW GLGACLADDM GLGKTIQFIA FLLHLQEQDA LENPTLLVCP TSVLGNWERE
VNKFAPTLKV LQHHGDKRPK GKIFTETVNK YDLVITSYSL VQRDIKLLQT VNWQIVVLDE
AQNVKNADAK QSQAVRQLET KFRIALTGTP VENRLQELWS ILDFLNPGYL GNKQFFQRRF
AMPIEKYGDT ASLMQLRSLV QPFILRRLKT DKQIIQDLPE KQEMTVFCGL TAEQAQLYQQ
LVDESLVEIE SAEGLQRRGM ILALLIKLKQ ICNHPAQYLK ESSLAKHNSA KLQRLEEMLE
EVLAEENRAL IFTQFAEWGK LLKPYLEKQL GREILFLYGS TSKNQREEII DRFQNDPQGP
RIMILSLKAG GVGLNLTRAN HVFHFDRWWN PAVENQATDR VFRIGQTKNV QVHKFVCTGT
VEEKINDMIE SKKQLAQQVV GAGEELLSEL DTDQLRNLLL LDRNAIIDED EE