Gene Aazo_5348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5348 
Symbol 
ID9343214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014250 
Strand
Start bp3332 
End bp6841 
Gene Length3510 bp 
Protein Length1169 aa 
Translation table11 
GC content40% 
IMG OID 
ProductPrimase 2 
Protein accessionYP_003723436 
Protein GI298501440 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.396058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATAAATA ACCATCACGC ATTGCAGCTA AATCTTGAAC AAACACCGAA GTTAAAATTT 
GCTGTCAACA CTTATGGCAG AAATAAAGAC TGGGATTTTA AGAAACTAGC TGCTAACTTT
CAAGATAAAG AAGGCACAAT AGAGAACGTT AAAGAACATA TTAAAGCAGG TCACGCTGTT
TGTGCCAGCT TATTGGGTAA TAAATGGCGG AGTAAGGCCA ATGTTATCGG TTCTCAATGG
TTATTACTTG ACATCGATAA TTCTGATATG GCCCGTGATG CAGACGGTAA GCTAATTAAG
GATAATTATG GGAATTACAT CAAAGTTTAT AAGCACCAGT TAACTATTGA AGAAGCCTTA
ACCCATCCCT TCATTAGAAA ATACTGTGCT TTGATTTATA CCACTGCTAG TCATCAACCA
GACTGGCATA GATTCCGACT TATTTTCCTT CTGCCAGAAT ATGTTGAGGG TGCTGACATT
GTTGAAACTT GTATACACCT CTTAATGCAA CATCTACCCC ATGACCGTGC GTGTAGAGAT
GCTTCTCGTG TCTTTTACGG TAATACAGAA GCAGAATTCC TGTTGGTCAA TCCTGAAGCT
AGTTTACCAA ATGAATGGGT AAGGGAAGCA ATTACAATCA CACAACAGGA ACGTAGTGAG
TATCAGCTAC ACATTCAAGA AATTGAGTTA CGGCATAAGA AGTGGCGTGA AATTTCCCTT
ATTGAAGGCT GGGATATTGA CCAGCTTATT AAGAACGCAC TTTCATTAAT TCCACCGCGT
ACCCTTGGGA GTGGAAACTA TGATGAATGT CGCCAAGTAC TGATGGCTTT GGTCAATCAC
TACGGGGCAT CTGAAGCAGA AATTATTGCT GAGAAATGGT CGCCTTCGAT CAAGGGTACA
ACTTGGAACA TTCGTGCCAA AATCCGCAGT TTTAGGCGTG GTGGGATTAC GATCGGTACA
CTATTCCACA TCGCTAAGGA GTATGGTTTT CACTTTCCAA AACGGCAGTA TGAAGTATCC
GAAAACTATA AAGGACTCAC TAGTCCTCAA GAATGCGAAC TAGGACAATT CAGAGAAGAT
TTGACCAGCT TCCAGAATTT ATTAAAACAG GCGCTTATAC CTTTTGTTAA AGGTGCTAAA
GGATTTTCCA CTCCCATATT AAGGGGAGCA CAATTCGTAA GAGAGGAACC AAAACTTGAG
ATTAACTCCC CAATACGTAA GTTCATCGTC TATAAGCCGG GGAATATCCC ATACAAAAAC
GAATTTATAG ATGACATCTA TATTGAATGC CAAGCAGGAG AACATATCGC CGCATGGGTC
GAAGCAATTT CTAAAGGCTG GACACAAATT TTAGACAATT CTCATCCAGG TCAGGGTAAA
TCTCACAATG CTGGTAAACT GACGGCTGCC ATGTTCGGTA CAGATAAACT AATTTACCAA
GATGCAAACC ACAGAAACCC ATCTACCTTA CCGATTGAGA TTAATTTTGT TGACTTACCT
GTGCGGCACA ACGGCTTGAA AATTGACACT AACCGTACTA CTCCAACTGG TGCTAATTTT
CTAGTACAAC CTAAAGTGGG TGAAACTCCT GATACTGATA GTAACTGTAG GAGAACTAGT
TTATTTGGTG CGTTCCGTGA TAAGAACTTT GCTTCTTTAG ATTTTGAAGA AAGTGCTATT
AGCCCTATCT GTAAGGGTTG TATTAATCAC AATCAGTGCT GTTTCACTTA TGGTGATGGT
TTCGGCTTCC GTTTTCTAAA GCAGATAGTA ATCCAAAATT ACGCCGAAAT TCGAGCACAT
CCTGACTCTA CCCCAGTTGT TTTAACTAAC GGTGCTAATG AACCTTTCAC TGTGGGACGG
ATATGGGAAG AAGCAGGAAG TCTTATTAAG CCTGTGCGAT CAATTGAAGT ACGACTACAG
GATTTTGATA CTACCATTGG TTCCTTAATT GCTGGTGGTT TACCAACTGA GGAATGGGTA
AAACTCCAGC AGTATTTACC AGTTTTACGA TCACTTCTTA ATGGAGACAT TAAACCTAAT
TCTCGTTATG GCTTTAACGA TGCTGAATTA AGAGAACAAT TAGGTGAATT TCCGTCTGGA
ATAGACATTA AAATAATCCG GCAAATATTA CAGCCTAATT TAGAATTCTT GGGTGACCTG
GATTCTGTTG ATATCAATGG TGATCAACAA CTACGAAAAA GTGCTGCCGC TCATTTTGCT
GCTAAACGTG TAGCTAAGGA GAGCGCCCAA CAAGCGGGTA AACAGTTCTT TGATTTACCC
AATTACTGGT TACCTGATTT CCTCGAAGCT TGGAAAGGTG AGGGTTCCCT AGCTTATAAA
TGGGGTATAT TGACCATTTA TCGCCCTAAT ACTAAATACA GTGAATTAGC TTACTCAGCC
CAATTCAATA TTTACCTTGA TGCTACTATT AGGCCAGAGC ATTTGAAATT GAAACTTGGT
TACAATGATT CAATATTAGT CATCCAGCAG TCTCGTCCTG ACTACAGTAA TTTAAGAGTA
GTTAATGTCA TAGGGCTCGG TAAGTTGCCT AAAAAACGGT CAAGCTCCTT AACTGAACGT
GTTCTTTCCC TCAAAGAAAC TTTTCGTAAG ATACACAAGA AACTGGGGAT TATTGAGTGG
AAACAACTTG CTATTAAGAC GGAAGACTAT ATTGAATACA GTCACTTTGT TGATGGTCGT
GGTGTTAACC GCTTTAGTGA ATGTGATGTC ATTGCTTCTT TTGGCATTCC GTACCAGAAT
ATTGGTGTAG TAGCTGCACA TATAAAAGTA ATGACTGATG AGGCAGTCCA CGTGGGAAAT
TTAGAAGGCA TCTTACAAAA ATATCTCACC GAGTTAATCA GGGGTGAGAT TATTCAAGAA
ATTGGTCGAT TACGCGCTCA TCGTCGCTGC AATGAGGAGT TAATATTTTA CTTCTGTGCT
GACTATGACT TGAGTTTTTT GTTAGATGAA CTACCAGGTG TAAAGTTAGA GGTTATAGAT
GCCTGTACTT TATGTACGGA AGCTGGTAGC CGAGATCAGC AGACTGGACA TGCCATTATC
AAGGCCTTCA ACCAGCTTTT GGAATCCCAA CAGAAAATTT GTCAAACGGC AATTGCAAAA
ATTATAAACA CTACGCAAGG TTGGGTAAGC AGATTTACCC AACGTTGGGG TGGTTGGGTT
AGGTTTAAAA AATTATTACT TTTACTATTA GATAGTTTTC ATAGTGATAG TAATAAAAAT
CTCACAGACC TCAGCGATGA CGAAATTTGG TTTGCTCGTG AATACTTCCC GACACTACTT
GATAAAGCTA GATCATTACC TGAAGATCCT ATTGAATATA TAGCTGAAGT TACTCTAACG
GTTACTAAAC CTGTAATGTC GCCCATTCTG CGCCATTTTA GCCCTGCTGT AAAAACTGGT
TTGCTGTTGA TTGTTTTATC TTCTCTGGCT ACTGAGGTTT ACTTGCACTT TCTCCTGGTA
CAAGCCGTGA CCAATCCGAT AGTTATCTGA
 
Protein sequence
MINNHHALQL NLEQTPKLKF AVNTYGRNKD WDFKKLAANF QDKEGTIENV KEHIKAGHAV 
CASLLGNKWR SKANVIGSQW LLLDIDNSDM ARDADGKLIK DNYGNYIKVY KHQLTIEEAL
THPFIRKYCA LIYTTASHQP DWHRFRLIFL LPEYVEGADI VETCIHLLMQ HLPHDRACRD
ASRVFYGNTE AEFLLVNPEA SLPNEWVREA ITITQQERSE YQLHIQEIEL RHKKWREISL
IEGWDIDQLI KNALSLIPPR TLGSGNYDEC RQVLMALVNH YGASEAEIIA EKWSPSIKGT
TWNIRAKIRS FRRGGITIGT LFHIAKEYGF HFPKRQYEVS ENYKGLTSPQ ECELGQFRED
LTSFQNLLKQ ALIPFVKGAK GFSTPILRGA QFVREEPKLE INSPIRKFIV YKPGNIPYKN
EFIDDIYIEC QAGEHIAAWV EAISKGWTQI LDNSHPGQGK SHNAGKLTAA MFGTDKLIYQ
DANHRNPSTL PIEINFVDLP VRHNGLKIDT NRTTPTGANF LVQPKVGETP DTDSNCRRTS
LFGAFRDKNF ASLDFEESAI SPICKGCINH NQCCFTYGDG FGFRFLKQIV IQNYAEIRAH
PDSTPVVLTN GANEPFTVGR IWEEAGSLIK PVRSIEVRLQ DFDTTIGSLI AGGLPTEEWV
KLQQYLPVLR SLLNGDIKPN SRYGFNDAEL REQLGEFPSG IDIKIIRQIL QPNLEFLGDL
DSVDINGDQQ LRKSAAAHFA AKRVAKESAQ QAGKQFFDLP NYWLPDFLEA WKGEGSLAYK
WGILTIYRPN TKYSELAYSA QFNIYLDATI RPEHLKLKLG YNDSILVIQQ SRPDYSNLRV
VNVIGLGKLP KKRSSSLTER VLSLKETFRK IHKKLGIIEW KQLAIKTEDY IEYSHFVDGR
GVNRFSECDV IASFGIPYQN IGVVAAHIKV MTDEAVHVGN LEGILQKYLT ELIRGEIIQE
IGRLRAHRRC NEELIFYFCA DYDLSFLLDE LPGVKLEVID ACTLCTEAGS RDQQTGHAII
KAFNQLLESQ QKICQTAIAK IINTTQGWVS RFTQRWGGWV RFKKLLLLLL DSFHSDSNKN
LTDLSDDEIW FAREYFPTLL DKARSLPEDP IEYIAEVTLT VTKPVMSPIL RHFSPAVKTG
LLLIVLSSLA TEVYLHFLLV QAVTNPIVI