Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4101 |
Symbol | |
ID | 3681566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5100210 |
End bp | 5104247 |
Gene Length | 4038 bp |
Protein Length | 1345 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637719449 |
Product | amino acid adenylation |
Protein accession | YP_324597 |
Protein GI | 75910301 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.357734 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.611275 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACACTG AAATGACTAA TCTGATTCGT ATTTCTCCTC AACAAAAGCA TTTATGGTTA CAACAAAAAG ATAATTATCA AGCTTACTGT ACACAAATTG CTATTAATAT CACAGGGAAT CTGAACTCAG ATATTCTGAA ATTAGCTTTA GCAAATCTTA GCGATCGCTA TGAAATATTT CACACTGCTT TTCATTCTTT ACCAGGGATG AATATCCCAT TCCAATCAAT TAGCGATGCT CCAACTTACA ACCATATTTC CCTAGCAGAA TATGATTGGA GTCGTTTAAG TTCTCAGGTA CAAAAAGCCA AAATCGATAC ACTTTTTCAA GATACCAGCC TCCGAGTTTT TGATTTTGAA CAAGCGACAA ATTTACATTT TATCTTAGGG AAGCTAGCGA CAGATCAGCA TATACTGTTT ATCAGCTTGC CTGCTTTATA TGCAGATAAG ACAACACTCA ACTATCTTAT ATGTGAAATC GGTCGTGCTT ATACAGCTTG TTTAGATGGC GAAGAAATTT CTGAGGAAGT AATGCAATAT GCAGATGTCT CTGAATGGCT GAATGAATTA TTAGAATCTG AGGATACAGA AACAGCTAGA AATTATTGGC AACAACAAGA TATTTCTTCT GTGTTAAATT TGCGAATTCC TTTTGAAATC AAAAATGATC ATCAGTCATC CGGAGAAATT AGCTTTGCAC CTCAGTCGCT GTGTTTAGCG ATGAACCGCG ATACAGTAGA AAAACTAGAA ATTCTCCAAG AAAGTTATCA TACTTCAACA GAAGCTATTT TATTAGCTTG TTGGCAAGTG CTACTGTGGC GGTTAAGCGG TCAGTCGGAA ATTGTGATTG GGGCGGCTGG CGATGGACGC GAATATGATG AATTAAAAGG AGTTTTTGGA CTTTTAACTA AATATTTACC GATTCACTCC CGCTTAGAGG ATGCTTTAAA ATTCTCAGAA CTGATGGAAA GAGTTTCTGA GTCTTTGCGG TTTGTAAAAA AATGGCAAGA GTATTTTAAC TGGGAGAATT TCATCTCTAC TAATGAGAAA TTTGCTGCTC GCCCCTTTTT CCCATTCTGC TTTGATTTTA CACAACAGTC ACAGAGCTAT GTAAATGGAA ATATTGCCTT TTCGGAATAT AAAAAATCTG CCAACATCGA CAAATTCCAA ATTAAGCTAT CTAGCGGTTA TAGCCAAGAT AATTTGGTGG TGAATTTTGA TTACGATGCT AACTTATTTT CGGAGGATAG TATCAAAATT CTCGCGGAAC AATATCTAAC ATTAATCGCA AGTGTCACTT ATAACCCAGA AGCAGCAATC ATTGATTTAA ATATGATCAG TGAAAGCGAT CGCCAGAAAT TATTGATTGA TTTTAACAAT ACAAAAATAG AGTTTCCCCA CGATAAATGT CTTCATGAAC TATTTGCCGC ACAAGTAGAA CGTACCCCCA ACAACATCGC CGTTGAATTC AATCACGAAT CCCTCACTTA TGCCCAACTA AACGCCAAAT CCAACCAGCT AGCGCATCAT CTCCAAAAAT TAGGAGTAAA ACCAGAAGTA TTGGTGGGAA TTTGTGTAGA ACGTTCCTTA GATATGCTGA TTGGTATCCT GGGTATTCTC AAAGCTGGTG GTGCATATAT ACCATTCGAT CCCACCTATC CCCAAGAACG ACTGGGATTT ATGTTAGAAG ATGCTCAAAT TCCCATCCTG TTAACGCAAC AAAGACTGGT AGATAAATTT GTTGAACATA AAACCCAGAT AATTTGTCTG GATCGAGACT TACCAGAAAA TGCCACCCTC AGCATTGACA ACCCTGTCAG CAATGTTACA TCAGAAAATT TAGCTTATAT CATTTACACC TCTGGTTCTA CAGGAAAACC CAAGGGAACG ATGATTCCCC ACAGAGGCTT AGTTAACTAC CTGAGTTGGT GTACAAATGC TTATGCGTTA GCCCAAGGTT ATGGTGCGCC AGTACAATCT TCCATCGGGT TTGACGCGAC AATTACCAGC TTATTCTCAC CGTTATTAGT CGGAAAAAGG GTTGTATTGT TACCTGAGAA GGAAGAAATT GAAGCCTTGT CAGCACTTTT ACAGTCTGAT CAAAACTATA GCTTGGTGAA AATTACACCA GCGCACCTGG AAATGCTCAA TCAAATGTTA CCTAACCATA AAGGTGTCAC GGAAACAAGA GCATTAATCA TCGGTGGTGA AGCATTATTG GGTAAAAGCT TGAACTTTTG GCGCGATAAT GCCCCAAAGA CCAGAATCAT TAACGAATAT GGCCCCACCG AAACTGTTGT TGGTTGTTGT GTTTATGAGG TTGATGAGCA AACTTCTTTA TCAGGTGCAA TTTTAATTGG TCGCCCCATT GCAAATACTC AACTTTATTT GCTGGATGAC AAGCAAAAAC TTGTACCAAT CGGTGTTCCC GGCGAATTAT ATATTGGTGG TGCAGGAGTA GCTAGGGGCT ATCTCAATCG TCCAGAATTA ACACAACAGC GATTTATCCC TAATCCTTTT AGCGATGAAC CTAATTCTCG CCTATACAAG ACAGGAGATT TAGCGCGATA TTTACCTGAT GGTAATATTG AGTACCTCGG ACGCATTGAC CATCAAGTGA AAATTCGTGG TTTCCGTATT GAATTAGAAG AAATTGAGTC CTTACTAGCC CAACATCCCC TAGTAAATGC TGTCACTGTA ATTGCGAGAG AAGACCAACC TGGAGATAAG CGGTTGGTAG GCTATATTGT ACCGAAAGAG CAAGCCCCCA CCAGTAGCGA ACTGCGCCAA TTTTTACAGT CCAAGTTACC CGAATACATG ATACCCTCTG CCTTTGTGAT GCTAGAGGTC ATCCCTCTCA CCACTCATGG TAAGGTAGAC CGCCAAGCCC TACCGCAACC CGATACATCC CGTCTAGATA CACTCAAAGT AGAATTTATC GCACCACGCG ATCGCCTAGA ACTAAAATTA GCCAATATTT GGGAAAATCT ACTCAATGTC CATCCTATAG GTATCAAAGA TAGCTTTTTT GAACTCGGTG GTCACTCTCT GCTGGCTGTG CGCTTGATGG CTCATATTAA CCAAGAGTTT GGTAAAAACT TAGCCTTAGC CACACTTTTC CAAAACCCCA CAATCGAAAA ATTGGCTAAT CTGCTTCGCC AAACATCAGA AATCTCATCT TGGTCTCCCT TAGTAGAAAT TCAAACAGGT AATTCCCAGT ATCCTTTCTT CTGCTTACCT GGAGGCGGTG GTAATGTCCT TTATTTACAT GAGTTGGCGC GTGATTTAGG TTCTGAGCAA ACATTTTACG GTTTACAAGC GCCAGGTTTA AATGGAGAAT CAGACCCCTT GACTAGTGTT GAAGAGATGG CGGCTTATTA CATCCAGGCT ATACAAAGCG TGCAGCCGGA AGGGCCATAT TTTCTCGGTG GTCACTCCTT TGGGGGTATT GTGGCCTATG AAATAGCTCA ACAGTTAGTC AAATCCGGAC ATGAGGTGGC TTTAGTGGCG ATTTTAGACG CTCCAGCCCC AGTTGCTAGT GACAAACCTA TATATATTGA TGTTGATGAT GCAACGCGAT TGACCGAAAC TGCTCGCCTG ATTGAACGTT GGGCAGGTAA AAGCTTAAAT ATCAGCTATG AAATACTCCA GCCACTAGAA CTCGACGCAC AATTGGAATA TCTCAAAGAA CAATTAATTG CCGTTGGTTT ACTCCCGACT GGTACTGAGA CAAAGCAAGT GCGTGGGTTG GTGCAAGTGT TCGAGACCAA TCTGCAAGCT AGTATCAAAT ATTCACCACA GGAAGTTTAT CCCCATCGCC TGACACTGCT GCGGGCTAGG GAAGTGAACG CAGAGGATGC GGCACTATTA ACTGAGTTGC GCCAAGATCC CGCCTGGGGA TGGGGTCAGT TTTCGACTGA AAAAGTCGAT ATTCACATTG TTCCAGGCGA TCATATGACG ATGATGACTC AACCCCACAT CTCATCGGTT GCCAAACAAC TAAGAATCTG CATTGAGCAA ACAAATGTAG GTTCATGA
|
Protein sequence | MHTEMTNLIR ISPQQKHLWL QQKDNYQAYC TQIAINITGN LNSDILKLAL ANLSDRYEIF HTAFHSLPGM NIPFQSISDA PTYNHISLAE YDWSRLSSQV QKAKIDTLFQ DTSLRVFDFE QATNLHFILG KLATDQHILF ISLPALYADK TTLNYLICEI GRAYTACLDG EEISEEVMQY ADVSEWLNEL LESEDTETAR NYWQQQDISS VLNLRIPFEI KNDHQSSGEI SFAPQSLCLA MNRDTVEKLE ILQESYHTST EAILLACWQV LLWRLSGQSE IVIGAAGDGR EYDELKGVFG LLTKYLPIHS RLEDALKFSE LMERVSESLR FVKKWQEYFN WENFISTNEK FAARPFFPFC FDFTQQSQSY VNGNIAFSEY KKSANIDKFQ IKLSSGYSQD NLVVNFDYDA NLFSEDSIKI LAEQYLTLIA SVTYNPEAAI IDLNMISESD RQKLLIDFNN TKIEFPHDKC LHELFAAQVE RTPNNIAVEF NHESLTYAQL NAKSNQLAHH LQKLGVKPEV LVGICVERSL DMLIGILGIL KAGGAYIPFD PTYPQERLGF MLEDAQIPIL LTQQRLVDKF VEHKTQIICL DRDLPENATL SIDNPVSNVT SENLAYIIYT SGSTGKPKGT MIPHRGLVNY LSWCTNAYAL AQGYGAPVQS SIGFDATITS LFSPLLVGKR VVLLPEKEEI EALSALLQSD QNYSLVKITP AHLEMLNQML PNHKGVTETR ALIIGGEALL GKSLNFWRDN APKTRIINEY GPTETVVGCC VYEVDEQTSL SGAILIGRPI ANTQLYLLDD KQKLVPIGVP GELYIGGAGV ARGYLNRPEL TQQRFIPNPF SDEPNSRLYK TGDLARYLPD GNIEYLGRID HQVKIRGFRI ELEEIESLLA QHPLVNAVTV IAREDQPGDK RLVGYIVPKE QAPTSSELRQ FLQSKLPEYM IPSAFVMLEV IPLTTHGKVD RQALPQPDTS RLDTLKVEFI APRDRLELKL ANIWENLLNV HPIGIKDSFF ELGGHSLLAV RLMAHINQEF GKNLALATLF QNPTIEKLAN LLRQTSEISS WSPLVEIQTG NSQYPFFCLP GGGGNVLYLH ELARDLGSEQ TFYGLQAPGL NGESDPLTSV EEMAAYYIQA IQSVQPEGPY FLGGHSFGGI VAYEIAQQLV KSGHEVALVA ILDAPAPVAS DKPIYIDVDD ATRLTETARL IERWAGKSLN ISYEILQPLE LDAQLEYLKE QLIAVGLLPT GTETKQVRGL VQVFETNLQA SIKYSPQEVY PHRLTLLRAR EVNAEDAALL TELRQDPAWG WGQFSTEKVD IHIVPGDHMT MMTQPHISSV AKQLRICIEQ TNVGS
|
| |