Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4837 |
Symbol | |
ID | 3679335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 6083299 |
End bp | 6086580 |
Gene Length | 3282 bp |
Protein Length | 1093 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637720194 |
Product | amino acid adenylation |
Protein accession | YP_325329 |
Protein GI | 75911033 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0536139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAT TACCTATAGA AAATCAAACT GAATTGACCG CTATTGATTT CGATCCATTT GCAGATGGTG AATTATTATT AACTGCCCCT GCTACCGAAT CTCAAAAAGA AATTTGGGCA TCTGTACAGA TGGGGAGTGC TGCTAATTGT GCTTATAACG AATCCCAATC CCTAAACTTA AAAGGACAAT TCAATTTAGA AGCTTTTCAA TCTGCTATCC AACAGCTAGT TCAACGTCAT GAGGCTTTAC GAACAACCTT TAGTACTAAT GGTAGTACAC TCTGTATTGT GGCTTCGTTA CTAATTGATA TTCCTTTTAT TGATGTTTCT AATTTAGAGG TTCAAGAGAG AGAAATAAAA GTAGCTGAAA TCTTACAGCA AGAAGTAGGA CAACCTTTTG ATTTAGAACA TGGCCCACTA TTTCGAGCAA AAATAATTAA ATTGCATCCC CAAGAACATT TAGTTACTTT AACTGCCCAT CATATTATTT GTGATGGTTG GTCTTGGGGG GTGCTGATGC CTGATTTGGG TCAGATATAT TCTGATTTGC TGCAAGGGAT AATGCCTGAT TTAGAAGAGC CAGACAAATT TAGTGAATAC GCTATTTTAC AGGAAGAAGA AGAACAAACA TCAGAAGCGA TCGCCACAGA ACAATATTGG TTAGAGCAAT TTTCCGACTC AGTACCCGTA GTAGACTTTC CTACAGATCG TCCCCGTCCA CCACTGAGGA CTTTTAACGC CGCCCGTGAA GACTGGCAAT TAAAACCCGA ACTAGTTACC CACCTCAAAC AACTAGGAAC AAAATTTGGT TGTAGCTTTA TGACAACTAT TTTGGCAGGG TTTGAAGTGT GGCTGCATCG ATTGACCGGA CAAAATGATC TAGTGGTCGG TATTCCCGCC GCCGGACAAG CTGCCTCAGG ACAGTATAAT CTTGTAGGGC ATTGCGTAAA TTTATTACCA CTGCGGACTC AGATAGATAG CCAAAAATCT TTTAGTGACT ACTTACAAAC TAGACGTTCC ACCATCTTAG ATGCCTACGA TCATCAACAA TTCACCTTTG GTGGCCTAGT CAAAAAACTG TCACTGCCAC GGGATTCTAG CCGGATTCCT TTAGTGCCAA TTACATTCAA TATTGATCAA GGATTGGATA GTGATAAATT ACCCTTCGCT GGACTGGAAG TAGAATTTTT CTCTAATCCT CGCTCATTTG AAAATTTTGA ACTGTTTATC AACGCTACAG AATTGCGTGG TCAACTGACA CTGGAATGTC AGTACAACAC TAATTTATTT GATGCTGATA CTATCCGCCG TCGCCTGGCA GAGTTGGAAA CCTTGCTATT GGGTATAGTG TCTAACCCCG ATCAAATTAT TGCTAAGTTG CCCATCTTAC CAGCCGTTGA GCAACAGTTA TTAGCCGCAT GGAACCAGAC TCAAACTGAC TATCCCCAAG ATAAATCCAT ACACCAGTTA TTTGCGGAAC AGGTAGAGCG CACCCCCGAC GCTGTGGCGT TGATATTTCA AGAGCAACAA CTAACTTATC GAGAATTAAA TATTCGAGCC AATAAATTGG CGCAATATCT GCAAACACTA GGAGTCGGGG CAGACGTTCT GGTGGGTATC TGCGTGGAGC GTTCCTTAGA AATGGTGGTG GGTCTTTTAG GCATCCTAAA AGCAGGTGGC GCTTATGTAC CACTAGATCC TGGTTATCCA CCAGAACGCC TGGCTTTCAT GCTGGCAGAC ACCCAAATTC AATTATTATT GACTCAAAAA CACCTAGTTG AAAAGCTACC TACCCACACT GCAACATTAA TTTGTTTAGA TACCGATGGG GATACGATCA ACCAAACAAC AGTAGAAAAT CCCGTTAGTA ACGTCACAGC CAATAACTTG GCTTATGTGA TGTACACATC TGGTTCTACA GGTCAGCCCA AAGGTGTCAG TGTTATTCAT CAAGGCGTAG TGAGATTAGT CAAAGAAACT AACTATGTCA GCCTCACCAA TAAAGAAGTA TTTCTGCAAA TTAGCCCCAT CTCTTTCGAC GCTTCAACTT TTGAAATTTG GGGTTGTTTG CTCAACGGTG GAAAACTAGT AATATTCCCT CCAAACACAC CATCTTTAGA CGAATTAGAG CAAATTATTC AACAACACCA GGTAACAACT CTGTGGTTAA CGGCTGGGTT ATTTCATCTG ATAGTCGATG AAAAAATTGA TGCGTTAAAA CCCTTACGTC AATTATTAGC AGGTGGTGAT GTTTTATCAG TTTCCCACGT CCAGAAATTC CTTAGCACAG TAGAGAACTG TAAATTAATT AATGGTTATG GACCAACGGA AAACACAACT TTTACTTGCT GTTATGACAT CACAGCACCA CTAAAACCAG GGGCTTCTAT TCCCATTGGT CGCCCGATCG CCAATAGCCA AGTTTATATA TTAGACTCTC ATTTACAACC AGTTCCCATT GGGGTCACAG GTGAACTGTA CATTGGTGGG GATGGTCTAG CGCGAGAATA TTTCCATCGC CCTGATTTAA CTGCTGAGAG ATTTATTCTT CATCCTTTTA ACTCAGATCC TCAATCACGC TTATATAAAA GTGGTGATTT GGCACGCTAT TTACCCGATG GCGAGATTGA ATACCTGGGT CGGATTGATA ATCAGGTAAA AGTCAGTGGT TTCCGCATAG AATTGGGTGA AATTGAAATC GCCTTTTTAC AACATCCAGC AGTCAAAGAA GCTGTTGTAG TTGTTAGAGA AGATATTCCT GGAGAGAAAC TCTTAGTTGG TTATTTTGTT GCAGAAACTG GTCAAGATAA CTTGTCAATC ATCTCGGAAT TACGTCAATT TTTAAAACAA CAACTCCCAG AATATATGGT GCCAAAGGTT TTTATGGCAC TGGAAGCTTT ACCACTAAAT GCTAACGGCA AAGTTGATCG CCGAGTACTA CCAAAACCTG ATTCTTCTCG TCCAGAATTA GAAGCAAATT ATGTCGCACC GCGCACTCCC ATTGAACAAC AAATTGCTGA CATCTGGACG CAGGTTCTCA ATATCAAACG GGTGGGAATT TACGACAACT TTTTTGAACT GGGTGGATAT TCTCTAGTAG GCATCCAAGT AATATCTAGA CTACGCCAAG CCCTGCAAGT AGAAATCTTG ATGTCCAACT TATTTGAACT ACCAACCGTG GCAGATTTAG CAGAACGAGT AGAGACTCTG CGTTGGGCAA CTCAGAACGT TCAAGCTACT GGAGTTGAAA CCGCAGATGA TTACGAGGAA GGTGAGCTAT GA
|
Protein sequence | MKILPIENQT ELTAIDFDPF ADGELLLTAP ATESQKEIWA SVQMGSAANC AYNESQSLNL KGQFNLEAFQ SAIQQLVQRH EALRTTFSTN GSTLCIVASL LIDIPFIDVS NLEVQEREIK VAEILQQEVG QPFDLEHGPL FRAKIIKLHP QEHLVTLTAH HIICDGWSWG VLMPDLGQIY SDLLQGIMPD LEEPDKFSEY AILQEEEEQT SEAIATEQYW LEQFSDSVPV VDFPTDRPRP PLRTFNAARE DWQLKPELVT HLKQLGTKFG CSFMTTILAG FEVWLHRLTG QNDLVVGIPA AGQAASGQYN LVGHCVNLLP LRTQIDSQKS FSDYLQTRRS TILDAYDHQQ FTFGGLVKKL SLPRDSSRIP LVPITFNIDQ GLDSDKLPFA GLEVEFFSNP RSFENFELFI NATELRGQLT LECQYNTNLF DADTIRRRLA ELETLLLGIV SNPDQIIAKL PILPAVEQQL LAAWNQTQTD YPQDKSIHQL FAEQVERTPD AVALIFQEQQ LTYRELNIRA NKLAQYLQTL GVGADVLVGI CVERSLEMVV GLLGILKAGG AYVPLDPGYP PERLAFMLAD TQIQLLLTQK HLVEKLPTHT ATLICLDTDG DTINQTTVEN PVSNVTANNL AYVMYTSGST GQPKGVSVIH QGVVRLVKET NYVSLTNKEV FLQISPISFD ASTFEIWGCL LNGGKLVIFP PNTPSLDELE QIIQQHQVTT LWLTAGLFHL IVDEKIDALK PLRQLLAGGD VLSVSHVQKF LSTVENCKLI NGYGPTENTT FTCCYDITAP LKPGASIPIG RPIANSQVYI LDSHLQPVPI GVTGELYIGG DGLAREYFHR PDLTAERFIL HPFNSDPQSR LYKSGDLARY LPDGEIEYLG RIDNQVKVSG FRIELGEIEI AFLQHPAVKE AVVVVREDIP GEKLLVGYFV AETGQDNLSI ISELRQFLKQ QLPEYMVPKV FMALEALPLN ANGKVDRRVL PKPDSSRPEL EANYVAPRTP IEQQIADIWT QVLNIKRVGI YDNFFELGGY SLVGIQVISR LRQALQVEIL MSNLFELPTV ADLAERVETL RWATQNVQAT GVETADDYEE GEL
|
| |