Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4099 |
Symbol | |
ID | 3681564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5092701 |
End bp | 5098802 |
Gene Length | 6102 bp |
Protein Length | 2033 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637719447 |
Product | non-ribosomal peptide synthase |
Protein accession | YP_324595 |
Protein GI | 75910299 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01720] non-ribosomal peptide synthase domain TIGR01720 [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.908444 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAACAGC TATCGCCTCA AAAACGCGAA CTTGTTCTGC AAAAATTGTT AGCACAGCAA TCTTCTACAA TCAACAATAA ACCTAAATTG CCGCGCATTG AAGCTGTTAG CAGCGACAAA TTAATACCTT TATCTTTTCC CCAACAAAGA CTTTGGTTTC TTGACCAAAT GGAGGGTAAT AGTGCTGCAT ATAATATGGC GGCTGCGGTA GAAATTACTG GCAATCTGCA AGTATCTATC CTAGAAAACA TCATTGCCGA AATCATTCAA CGTCATGCCA TTCTCCGGAC TAACTTTAAA AACGTTGATG ACAATGCTGT TCAGATAATT GCACCTCATC TCACAATAAA TATTCCTGTA ATTGACTTAC AAACATTGCC AGTAGCAGAA CAGTTTGCCG AGGTAGAATG TTTAGCGATC GCGGAACAAT TAAAACCTTT TGACTTAGCG AACGATTGTT TGTTACGGGT GACGTTGCTA CAGTTAGCCC CAGAATCTTA TGTGTTGCTG GTCACAATGC ACCATATTGT TTCTGATGGC TGGTCGATGG GGGTATTTAT TCAAGAATTT TCAACTCTGT ACACCGCTTT TTCGCAAAAT CAACCTTCTC TACTGCCAGA ATTAGCCATC CAATATGCTG ATTTTGCTCA TTGGCAAAGA CAATGCTTGC AAGGTGAGGT ATTAGAAAAT CAACTCAACT ACTGGCGGCA GCAGTTAGCA GGTATTCCCC CCATCTTAGA ATTACCTACA GACCGCCCGC GTCCTCCAGT GCAAACTTTT CAAGGTCAAA CTTTATATTT TGAACTCGAC CAAAATCTCA CCAAACAACT AAATATCCTG TGCCAAAAGT CAGGGACAAC CATGTTTATG ACCCTGATGG CAGCTTTTGC AACTTTACTG TACCGCTATA GTCGCCAGTC AGATATTGTC ATCGGTTCTC CTATTGCTAA CCGCGATCGC CAAGAAACAT TTCCACTAAT TGGTTTATTT GTCAATACTT TAGTGCTGCG AACTAATTTA GAGGGGAATC CGAGTTTTGC AGAATTACTG CAAAAAGTCA AACAAGTGGC TTTAGATGCC TATGCTCATC AAGATGTACC TTTAGAGCGA TTAGTCGAAG CTTTGCAACC AGAGCGATCG CTCTCCCATA TGCCATTGTT CCAAGTGGCG TTTGCGATGC AAAATGCACC GATGGGTAAA TTAGAATTAC CCAACTTGAG TTTAAATCTC TTAAAAATCG AAAATCGCAC AGCGAAATTT GATCTGGCGC TATCAATGCA AGAAACCGAG TCAGGACTTT TAGGAGAATG GGAATTCAAC ACCGATCTAT TTGATGCTAC GACGATTAGC CGCATGGCGA GACATTTCCA GACTTTGCTG GAAAATATTG TGGCTAATCC TCAACAACGA ATTGCGGAAG TATCATTATT AAGTGCTAGC GAGCAGCATC AATTACTTGT AGATTGGAAC AACACCACCA CTGATTATCC CCAAGGTAAA TGTATTCATC AGTTATTTGA AGAGTGGGTA GAACAAACAC CAGATGCGGT GGCGGTAGTC TTTGAAAATC AGCAAATTAC CTACAAAGAG TTAAATCATC GCGCCAACCA ATTAGCACAC CAATTGCAAA CCTTGGGTGT CAAACCAGAC GTATTGGTGG GGATTTGCGT TGAGCGTTCC CTAGAGATGA TAGTAGGATT ATTGGGAATT CTCAAAGCAG GTGGTGGGTA TGTACCTTTA GATCCTAACT ATCCGAGCGA TCGCTTGGCT TTTATGCTCA ATGATGCTCA ATTACCAGTA TTATTAACAC AACAGCAATT AGTAGAGAAA TTACCAGAGC ATCAAGCGAT CGCAATTTGT TTGGATGCAG ACTGGAACGA AATTGCCAAA AATAATAGTT TTAATCCCAC CAGTACAGTT ACCACTGCCA ATTTAGCCTA TGTAATTTAT ACCTCTGGAT CGACAGGCAA ACCAAAAGGC GTGATGGTAG AGCATACTGG GTTATGCAAC TTAGCCAAAG CGCAGATTCA GACTTTTGAT GTGCAAACAT CCAGCCGGAT TCTTCAGTTC GCCTCCTTTA GTTTTGACGC TTCTATTTTT GAAGTTGTGA TGGCTCTGGG AACGGGAGCT AGACTTTATC TGGGAACAAA AGAATCTTTA TTGCCTGGTT CATCATTAAT TCAGCTATTA CAAAAATACG GTATTACCCA CATCACCTTA CCACCCTCAG CTTTAGCGGT CTTACCTGCT GATGAACTCC CAGCGTTGCA AACCATCATC GTAGCAGGAG AAGCTTGTCC TCCCGATTTA GTCGAGCGTT GGTCTCGTGG TCGTCGTTTC TTCAATGCTT ACGGCCCGAC AGAAGCCACT GTTTGGTCAA CAGTTGCAGA ATGCAGCAGC AACAGCACTA ACAAACCCCC CATTGGTCGT CCAATTACCA ATACACAAAT ATATTTACTC GATCAAGATT TGCAACCTGT ACCTGTTGGT GTTCCAGGCG AACTGCATAT TGGTGGTATT GGATTAGCCA GAGGTTATCT CAACCGTCCT GAGTTGACAC AACAAAAATT CATTCCTCAC CCCTTTAGCA ATGAACCAGA AGCGCGACTT TACAAAACAG GCGACCTCGC TCGCTACTTA AGCGATGGCA ATATTGAATA CTTAGGACGT ATCGATCATC AAGTGAAACT ACGCGGCTTC CGTATTGAAT TGGGAGAAAT TGAAGCTTTA CTCAGCCAGC ATCCAGGAGT AATCCAAAAT ACACTGATCA TCCGTGAAGA TATCCCTGGT AGCCAACGTT TAGTCGCTTA CACAGTTGCC AATCCCGACC AAATACCAAC AATTAGTGAA CTGCGACAAT TCCTCAAGGA ACGGCTACCT GAGTACATGG TTCCTTCGGC TTTTGTGATG TTAGATACTC TGCCTTTAAC ACCAAACGGC AAAGTAGACC GCCGTGCATT ACCTGCGCCA GAATCCCGTC CTGAGTTAGC AGTTAACTTT GTTGCTCCAC GTACTCCACA AGAAGAAAAG TTAGCTGCGA TTTGGGCAGA TGTGTTGAGA TTACAGCAAG TTGGGATTCA TGATAACTTC TTTGAAATTG GTGGCGATTC TATCCTCAGC TTGCAGATTA TTGCCAGAGC CAACCAAGCC GGAATTCAAC TCAATATCAA GCAGTTATTT CAGCATCAAA CAATTGCTGA GTTGGCTGCT GTGGCAAATA CAATACCAAG TATCACGGCA GAACAAGGTT TAATTACAGG TTCGTTACCC TTAACACCGA TCCAGCATTG GTTTTTTGAC CAGAATCTAC CGCAACCAGC TTATTTTAAT CAGTCGGTGT TATTGGAAGT ACCAAATGAC CTGAAACTAG AAATCTTGGA ATCAGCATTG CAGCAATTAC TATTACACCA CGATGCTTTA CGGTTGAGAT TTGTCCAAGA AGGGGAAAGT TGGTCACAGA CTCACGCGGA TGCTAATGCT ACAGTAGCTT TAACTTGTGT AGATTTGTCA GAAAAAGCCC CACAAGCACA ACAAACAGCC CTCGAAACTA CCGCAAATCA GCTACACGCT AGCTTAAATC TGTCCCAACA GTTAATGCAA GCTGCCCTGT TCCACTTCGG TGCTACACAA TCTGCTCGGT TGCTGATTAT TGTGCATCAC TTAGCTGTGG ATGGTGTCTC TTGGCGAATT TTGGTAGAAG ATTTATTTCA TGCTTATCAG CAACTCAATC GTGGTGAAAC AGTTCAACTA CCAGCTAAAA CAACTTCCTT CAAAGAATGG TCACAGCGAT TGACTGAATA TAGCCACTCA GAAGCACTTG CAGGGGAACT CGACTTTTGG CTAGGTCAGT CATCCGGCTC TATAGCTTTG CCTGTAGACT ATCCCCAAGC GGTTAATACT GTGGCTTCAT CTGCACAAGT TTCGGTATCT CTCGATATAG AAAAAACTCG TGCTTTGCTG CAAGAAGTTC CTGCAATTTA CAACACCCAA ATCAACGATG TCTTGTTAAC TGCTTTGGTA CAAAGCTTTG CTCAATGGAC TGGTGAATCT TCCCTACTTG TTGATTTGGA AGGTCACGGA CGAGAAGAAC TGTTTGCAGA CATCGATTTA TCACGCACTG TCGGCTGGTT TACTTGCCTG TTCCCAATTA AATTAGAATT GGCAGCGATC GCCAATGTGG GTAAAACCTT AAAATCCATT AAAGAACAAT TACGCCCTTG CCAAAAACGG GGTATTAACT ACGGTATTCT CCGCTATCTC AACCCCAACC CCGCAATCCG CCATCAACTC ACAACTGCAC CCCAAGCACA AGTTAGTTTC AACTACTTGG GACAATTTGA CCAAGAGTTA TCGGAATCTG GCGCGTGGAA ATTGGCTCAA GAGTCTGCGG GTAATGAACA AGGTATATCG GGCGATCGCA CTCACTTATT AGAAGTTAAC GCCTTAGTAG CATCAGGTAA GCTGCAACTG AATTGGACTT ATAGCCAAAA CATTCACCAA ACATCTACTA TTGAGGCTTT AGCAGCTGGT TTCATCAATG CACTTACAGA GATTATCCAT CACTGTCAGT CTACTGATGT TGGCGGTTAT ACACCTTCAG ATTTTCCCGA AGCTGAGTTA ACTCAGGAAT ATCTGGACAA TTTAGTCGGA GAAATGGCAA ACCGTGACAC AGCCAGTCAC AAGAAAAATA TTGAGTCGAT TTATCCACTT TCCCCGATGC AGCAGGGTAT TCTCTTCCAT AGCCTGTATG ATCCAGAGTC CGGAGTATAT TGCGAACAAC TAAGCTGCAC GCTTCACGGC TCTATTAATA CCACAGCATT TGCACAAGCT TGGCAACGTG TAGTTGAGCG TCATTCGGCT TTGCGTACCT TCTTTGTTTG GGACAATCTG GATCAGCCAC ACCAAGTTGT TTGCAAAACT GTCAATCTAC CTTTTGCGGT TGATGACTGG CGTTCTCTAT CGCCTACCGA GCAACAAGAA AAACTGACAG CTTTTCAGGA AGCAGATAGA AACAAGGGTT TTGAACTTAA CCAAGCTCCC TTAATGCGTT GTAGTCTCAT TCAGACAGCA GATGATACTT ACGAATTCGT CTGGACATTC CATCATTTGT TGATAGATGG ATGGTCTTTA CCTGTCGTGG TTCAAGAAGC TTTTGCTTTC TATGAAGCTG CGAATCAGGG GCATGATTTA TATTTAAAAA CACCTCGTCC CTTCCGGGAT TATATAACTT GGTTGCAACA ACAAGACCTT TCCCAAGCCA AAGAATTTTG GCAGCGATCG CTGCAAGGTT TTACAGCCCC AACTCCGCTC ATGGCAGATA AATCTATTGT TCACAATTCC CAACAACAGC AGACTTATCA CGAGCAGCAT ATTCAATTAC CACAAGCATT AACAACTCAA CTGGAATCTC TTGCTAGACA AAATCAACTC ACTCTTAACA CTTTAGTACA AGGAATTTGG GCGCTATTAC TCAGCCATTA CAGTAATCAA AAAGATGTAG TATTTGGCAC AACTGTATCT GGTCGTCCCC CTGCACTTGT CGATGTAGAA ACTATGGTAG GGATGTTTAT CAATGCCTTA CCAGTCAGGG TACAAGTATC AGAGGATGAG CAAATATTAC CTTGGCTGAA AGATTTACAC ACACGGCAAG TAGAACGGGA GCAATACTCT TACACTCCCT TAGTGGAAAT TCAAAGAGTA AGTGAAATAG CTAGCGGGAC ACCAATGTTT GAGACTAACG TCGCCTTTTA TAATTACCCA GTAGATCCTG CTTTGCAAAA TTCTAGTAGC GGCTTAAAAA TTACCAACAT CAGTAATTAT GAACGCACAA AATATCCTTT GATGTTGGTA ATTATGCCTG GTGAAAATAT ATCTGTGGGA CTAAGTTATG AAGGAAATCG CTTTAATCAC GAAACTATTG CGGATATTCT AGAAAACTTT GCAACTGTAG CCAATAAAAT TACTGAACAA CCTGATGCCA AGTTACAAAC CGTAGCAGAA ATTCTTGTCC AAGCAGACAA ACAAAAACAA CTCATTAAAG ACCAAAAACT GGCAGCGACA GCAATTAATA AATTGCACAA ATTCAAGCGT AAATCAGCTT AA
|
Protein sequence | MEQLSPQKRE LVLQKLLAQQ SSTINNKPKL PRIEAVSSDK LIPLSFPQQR LWFLDQMEGN SAAYNMAAAV EITGNLQVSI LENIIAEIIQ RHAILRTNFK NVDDNAVQII APHLTINIPV IDLQTLPVAE QFAEVECLAI AEQLKPFDLA NDCLLRVTLL QLAPESYVLL VTMHHIVSDG WSMGVFIQEF STLYTAFSQN QPSLLPELAI QYADFAHWQR QCLQGEVLEN QLNYWRQQLA GIPPILELPT DRPRPPVQTF QGQTLYFELD QNLTKQLNIL CQKSGTTMFM TLMAAFATLL YRYSRQSDIV IGSPIANRDR QETFPLIGLF VNTLVLRTNL EGNPSFAELL QKVKQVALDA YAHQDVPLER LVEALQPERS LSHMPLFQVA FAMQNAPMGK LELPNLSLNL LKIENRTAKF DLALSMQETE SGLLGEWEFN TDLFDATTIS RMARHFQTLL ENIVANPQQR IAEVSLLSAS EQHQLLVDWN NTTTDYPQGK CIHQLFEEWV EQTPDAVAVV FENQQITYKE LNHRANQLAH QLQTLGVKPD VLVGICVERS LEMIVGLLGI LKAGGGYVPL DPNYPSDRLA FMLNDAQLPV LLTQQQLVEK LPEHQAIAIC LDADWNEIAK NNSFNPTSTV TTANLAYVIY TSGSTGKPKG VMVEHTGLCN LAKAQIQTFD VQTSSRILQF ASFSFDASIF EVVMALGTGA RLYLGTKESL LPGSSLIQLL QKYGITHITL PPSALAVLPA DELPALQTII VAGEACPPDL VERWSRGRRF FNAYGPTEAT VWSTVAECSS NSTNKPPIGR PITNTQIYLL DQDLQPVPVG VPGELHIGGI GLARGYLNRP ELTQQKFIPH PFSNEPEARL YKTGDLARYL SDGNIEYLGR IDHQVKLRGF RIELGEIEAL LSQHPGVIQN TLIIREDIPG SQRLVAYTVA NPDQIPTISE LRQFLKERLP EYMVPSAFVM LDTLPLTPNG KVDRRALPAP ESRPELAVNF VAPRTPQEEK LAAIWADVLR LQQVGIHDNF FEIGGDSILS LQIIARANQA GIQLNIKQLF QHQTIAELAA VANTIPSITA EQGLITGSLP LTPIQHWFFD QNLPQPAYFN QSVLLEVPND LKLEILESAL QQLLLHHDAL RLRFVQEGES WSQTHADANA TVALTCVDLS EKAPQAQQTA LETTANQLHA SLNLSQQLMQ AALFHFGATQ SARLLIIVHH LAVDGVSWRI LVEDLFHAYQ QLNRGETVQL PAKTTSFKEW SQRLTEYSHS EALAGELDFW LGQSSGSIAL PVDYPQAVNT VASSAQVSVS LDIEKTRALL QEVPAIYNTQ INDVLLTALV QSFAQWTGES SLLVDLEGHG REELFADIDL SRTVGWFTCL FPIKLELAAI ANVGKTLKSI KEQLRPCQKR GINYGILRYL NPNPAIRHQL TTAPQAQVSF NYLGQFDQEL SESGAWKLAQ ESAGNEQGIS GDRTHLLEVN ALVASGKLQL NWTYSQNIHQ TSTIEALAAG FINALTEIIH HCQSTDVGGY TPSDFPEAEL TQEYLDNLVG EMANRDTASH KKNIESIYPL SPMQQGILFH SLYDPESGVY CEQLSCTLHG SINTTAFAQA WQRVVERHSA LRTFFVWDNL DQPHQVVCKT VNLPFAVDDW RSLSPTEQQE KLTAFQEADR NKGFELNQAP LMRCSLIQTA DDTYEFVWTF HHLLIDGWSL PVVVQEAFAF YEAANQGHDL YLKTPRPFRD YITWLQQQDL SQAKEFWQRS LQGFTAPTPL MADKSIVHNS QQQQTYHEQH IQLPQALTTQ LESLARQNQL TLNTLVQGIW ALLLSHYSNQ KDVVFGTTVS GRPPALVDVE TMVGMFINAL PVRVQVSEDE QILPWLKDLH TRQVEREQYS YTPLVEIQRV SEIASGTPMF ETNVAFYNYP VDPALQNSSS GLKITNISNY ERTKYPLMLV IMPGENISVG LSYEGNRFNH ETIADILENF ATVANKITEQ PDAKLQTVAE ILVQADKQKQ LIKDQKLAAT AINKLHKFKR KSA
|
| |