Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0728 |
Symbol | |
ID | 9338514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 766734 |
End bp | 769775 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | cyclic nucleotide-regulated ABC bacteriocin/lantibiotic exporters |
Protein accession | YP_003720304 |
Protein GI | 298490127 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.637206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCAG CGTTTTCTCA GTCATATTTA GCAACACAAA TTACCAAAAT TGTGGGTGAT TCGCTATCAG ATCAAGAACT TGAAAATTGT GTTCAAGCAT TAGATATTAT TGAACCACCA ATAGCCAAAC AATTCTGGAT ATCGACAACA GCACCACCAG GAATTTATCT AGTTTTGTCG GGTAAGGTCA GGCTATTGGA TGGTGGAAAT AATTTGATTT CAACTTTGAC TTCTGGTTCA TCTTTTGGGG AGTTGACTTT ATTTCCCGAA CAAAATTTTA GTGCTTATGT AGCGAGGGCT TCTGTAAATT TAAAGGTCGG TTATCTTCCC CAGGAGGTAA TTAATAGATT TGTGGGTGTG AGCGATCGCC TATTCATAAA GGCAGAACTT TGGGATATAC TGGTGTTATT ATCTCAAAAC TCTGCTACAC CTCGTCATGA ATCGGTAGAG GAGATACTAA CAGCATTATC TCTATTTGCA AAACAAAATC TAGAAATTGG TTCTCTAAAT CCTCAAGTGA CCGAAAATAC CAAACTATTG CTAGTGTGTC AGGGGGAATT ACAACATTGT CAGGGTAAAA AATTAACGCC AGGTAACATT TATGTAAATC CCCACAAGGA AAAGTGGCAA GCAACACAAC CCAGCAGAGT ATACATTTTG CATGATGCTG ATTTGCAAAC AGCACTACAA TATTGGCCGC AATTAAGTAG GTTAATTGAT GTAGAAAATG AGCCAATTAC AGAACCAATC AAGCGCGAAG TTAAGGCTAG AAGCCGCAAT GTGATTCAAT TTCCCCAAAC CACAATCCAG GAACAATCAC AACCAAAACA AAGCCAGAAA TACTTTCCTA GTCCTACTGT CACCGCTGGA AATTGGTGGC GTAAAGTTAG TAAACGCTAT CCGTTTTGTG AACAACAAAG CGCCTCTGAT TGTGGTGCGG CTTGTTTGGT GATGATTAGT CGTTATTGGG GTAAGAATTT TACCATTAAT CGCTTGCGAG AACTGGCTAA TATTAACCGT GCTGGTGCAT CTATGCGGAG TTTAACTGCG GCGGCTGAAA GTATCGGTTT TGCTACTCGT CCGGTGAAAG CTAGTTTAGA TAAATTAGCA CAACAAACGT TACCTGCGAT TGCACACTGG GAGGGTAAAC ATTACATCGT CGTCTATGAA ATTACCGAAA AATGGGTAAT TGTCGCCGAT CCTGCTATCG GTCAACTTAA TCTGTCTATT CGAGAATTTA AAGCAGGTTG GACTGGTTAT GCATTGTTAT TACAACCAAC AAACTCACTT CAAACAATCC CAGAAGCTAA CACACCATTT TGGCAGTTAT TTGAGTTAGT AAAACCTCAT TACCAGGTAC TGCTAGAAGT ATTTGTTGCT TCAGTGTTAA TTCAAGTATT TGGATTGGTG ACACCTTTAT TTACTCAACT ATTATTAGAC AGAGTAATTG TCCAAGGTAG CACCATTACT TTAAATACTG TTGGGTTTGG GTTACTTATT TTTAGCTTAT TTCGTGTTGT TATCAATGGA CTCAGACAAT ATTTACTAGA CCACACAGCT AACAGAATTA GTGTGGCCTT GATGGTAGGT TTTATTAAAC ATACCTTTCG TTTACCTCTC TCATTTTTTG AGTCTCGTTA CGTTGGTGAT ATTGTTTCTC GTGTCCAAGA GAATCAAAAA ATTCAGCGTT TCTTAACTGG TGAAGCACTA TCTATTGTTT TAGATTTTCT GACAGTATTT ATCTACATCA GCTTGATGTT TTGGTACAGT CCTTCAATGG CTTTATTGGT TTTGGCAATT GTACCACCCT TTGTATTACT AGCCCTCTTT GCCACACCTT TTTTAAGACG AACTAGTCGT GAAGTTTTTA CGGCGGTGAC AAAAGAAAAT AGTTACTTAA TTCAAAGTTT AACGGGAATT TCCTCGATTC GCTCAATGGC TATTGAACAA ACAGTAAGAT GGCGTTGGGA AGAATTGCTG AATAATTTGA CTAAGAAAAA CTTCAGTGTT CAAGTAATTG GAAATCAACT GGAAATTATT AGTTCTACAA TTCAAGCCAT AGCCAGTACA GGATTATTGT GGTTTGGAGC ATGGTTAGTA ATTCAGAACC AGTTAACCAT TGGACAGCTA GTAGCTTTTA ATATGTTACT AGGTAACGTT ATTCAGCCTT TCCAAAGGTT AATTAGTTTG TGGAATCAGT TACAAGAAGT GATAGTTTCT ACAGAACGAA TTAATGATGT TTTAGAAGCA AGCCCAGAAG AGGATCTAGC AGTTCACTCA CGTCAAATAT TACCTAGATT ACGTGGCCAT TTACATTTTG ACAATGTAAC TTTTCGCTAT CATGCTGAAA GCGATATTAA CATCCTAGAA AATCTCAGTT TTGAGATCTT ACCTGAACAA ACTGTTGCAG TTGTAGGGCG GAGTGGTTCA GGGAAAACAA CTCTTTCTAA GTTGATTTTG GGTTTATATC CAGCGACAGA TGGCCGAGTA TTAATTGACA ATCAAGATGT GGCTAGTATT TCTCTGCATT CATTGCGCTC GCAAATAGGA GTTGTTGACC AAGATACATT TTTATTTGGC GGTACAATTC GAGAAAATAT TAGCATAGCT CATCCAGAAG CTACCCTAGA AGAAATTATT GAAGCAGCCC AACTTGCAGG TGCAGATGAG TTTATTAAAC GCATGGCTAT GGGTTATGAA ACCCAAATCG GTGAAGGTGG AGGAATGTTA TCAGGGGGAC AACGCCAACG CCTAGCAATA GCTCGTGCAT TATTAGGAAG TCCGCGCCTT TTAATATTAG ATGAAGCAAC CAGTCATCTT GATGCAGAAT CTGAGCGCAT TATTCAGAAC AATCTGAAAA CAATTCTCAA AGGACGCACG AGTTTTATTA TCGCCCATCG TCTTTCCACC GTGCGTCATG CTGACCTAAT TTTAGTTTTA GATCGGGGGA TTTTAGTTGA AAGTGGTACT CATGAAGAAT TAATTCTCAA GAGAGGACAT TATTACTATC TTAATCAACA ACAACTAGCG TCAACAGGCT GA
|
Protein sequence | MPSAFSQSYL ATQITKIVGD SLSDQELENC VQALDIIEPP IAKQFWISTT APPGIYLVLS GKVRLLDGGN NLISTLTSGS SFGELTLFPE QNFSAYVARA SVNLKVGYLP QEVINRFVGV SDRLFIKAEL WDILVLLSQN SATPRHESVE EILTALSLFA KQNLEIGSLN PQVTENTKLL LVCQGELQHC QGKKLTPGNI YVNPHKEKWQ ATQPSRVYIL HDADLQTALQ YWPQLSRLID VENEPITEPI KREVKARSRN VIQFPQTTIQ EQSQPKQSQK YFPSPTVTAG NWWRKVSKRY PFCEQQSASD CGAACLVMIS RYWGKNFTIN RLRELANINR AGASMRSLTA AAESIGFATR PVKASLDKLA QQTLPAIAHW EGKHYIVVYE ITEKWVIVAD PAIGQLNLSI REFKAGWTGY ALLLQPTNSL QTIPEANTPF WQLFELVKPH YQVLLEVFVA SVLIQVFGLV TPLFTQLLLD RVIVQGSTIT LNTVGFGLLI FSLFRVVING LRQYLLDHTA NRISVALMVG FIKHTFRLPL SFFESRYVGD IVSRVQENQK IQRFLTGEAL SIVLDFLTVF IYISLMFWYS PSMALLVLAI VPPFVLLALF ATPFLRRTSR EVFTAVTKEN SYLIQSLTGI SSIRSMAIEQ TVRWRWEELL NNLTKKNFSV QVIGNQLEII SSTIQAIAST GLLWFGAWLV IQNQLTIGQL VAFNMLLGNV IQPFQRLISL WNQLQEVIVS TERINDVLEA SPEEDLAVHS RQILPRLRGH LHFDNVTFRY HAESDINILE NLSFEILPEQ TVAVVGRSGS GKTTLSKLIL GLYPATDGRV LIDNQDVASI SLHSLRSQIG VVDQDTFLFG GTIRENISIA HPEATLEEII EAAQLAGADE FIKRMAMGYE TQIGEGGGML SGGQRQRLAI ARALLGSPRL LILDEATSHL DAESERIIQN NLKTILKGRT SFIIAHRLST VRHADLILVL DRGILVESGT HEELILKRGH YYYLNQQQLA STG
|
| |