Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4745 |
Symbol | |
ID | 3679632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5954819 |
End bp | 5959582 |
Gene Length | 4764 bp |
Protein Length | 1587 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637720101 |
Product | Beta-ketoacyl synthase |
Protein accession | YP_325237 |
Protein GI | 75910941 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACTC AAGACTTAAC ACCTTTACAA AAAGCTATTA TTGCCCTGAA AGAAGCCCGT ACAAAGATTG AAAGCTTAGA ACGGACACAA AGTGAACCAA TCGCCATTGT GGGCATGGGT TGTCGTTTTC CTGGGGATGC CAACAACCCA GAAAAATTTT GGGAGTTATT GCATCAAGGA AAAAATGGCA TTACCACAGT ACCGCCCCAA CGGTGGGATA TTGATGCCTA TTACGATGAT GATCCTGATG TTCCCAATAA AATGTATGCC CGTCATGGGG GCTTTATTAA CAACGTTGAT CAGTTTGATC CGCAGTTTTT TGGCATTACC CCCAGAGAAG CGATCGCCCT TGACCCCCAA CAACGGCTAT TGTTAGAAGT AAGTTGGGAA GCCCTAGAAA ATGCGGGAAT TGCCCCACAA AAGCTGACAG GTACGCAAAC AGGGGTGTTT GTCGGTATTG GTATAGATGA CTATGCTAAA CGGCAAATTA AACATCACAT TCCCATTGAT GCCTATACGG GATCGGGCAA TGCTTTTTGT TTTGCGGCAG GACGTTTATC TTATCTTTTG GGATTGCAAG GGCCGAGTTT AGCCATTGAT ACAGCTTGTT CTACCTCTTT AGTCACCATT CATTTGGCTT GTCAAAGTTT ACGCAATGGT GAGTGTAACT TGGCTTTAGC CGGGGGCGTG AGTTTAATGC TGTCCCCCGA AGTTACCCTG TATTTATCTA AAACCCGCGC CCTTTCTCCT GATGGTCGTT GTAAAACCTT TGACCGAGAT GCCAATGGCT ATGTACGCGG TGAAGGTTGT GGGATGGTAG TTCTCAAACG TCTCAGTAGT GCCGTTGCTG ATGGAGATCA TATTTTAGCG GTAATTCGTG GCTCGGCAGT TAATCAAGAT GGTGCTAGCA GTGGGTTGAC GGTTCCTAAT GGGACAGCTC AACAAGCAGT GATTCGTCAA GCTTTGGCTA ATGCCAAGGT GACACCAGAA CAAATTAGCT ATCTGGAAGC CCACGGGACG GGAACTGCTT TGGGCGATCC CATTGAAGTG CGGGCTATTG ATAATGTTTT TGGTAAAGGA CGTTCCCCAA ATCATCCCCT GATTCTCGGT TCGGTGAAAA CCAATATTGG TCACTTAGAA ATCGCTGCCG GTATGGCAAG TTTGTTGAAA GTGATCTTAT CCTTGCAACA TCAAGAAATT CCCCCCCATC TTCATTTTCA AGAACTCAAC CCCGATTTAG CTGCTTCGGC CAAGTCTTTA AAGATTCCTA CCAGCGTTAT TCCTTGGCAA CCTACAGAAC AACCGAGAAT GGCGGGAATT AGTTCATTTG GGTTAAGTGG AACTAATGCT CATATCATTA TTGAAGAACC ACCCCAGTTA ACGGTTACTC AGGCGGAAGT CGATCGCCCG CTTCATGTTT TAATGTTATC GGCAAAAAGT GAAGCTGCTT TACACACCTT AGCTGCTGAT TGGGAAAATT TGTTACGCAA CCATCCCGAA ACGAATTTTG CCGATTTGGC ATTTAGTGCC AATACCGGAA GGGGAAGCTT TAATCATCGC TTGGCAATCG TCGCCCAATC AACCGCACAA GCAAGGCAAA ATTTAGCCGC TTTTAATCAA AAACAACCAT CCTTGAACGT TTTCAGTCAA GAAGTAGAAA AAGGACGACA ACCGAAAATA GCCTTTTTAT TCACCGGACA AGGATCTCAA TATGTGGGTA TGGGCAGACA ACTCTATGAA ACCCAACCCA CATTCCGTCA AGCGTTAGAC GAATGCGATC GCTTATTACA ACCTTACTTA AAAGAATCCC TATTAAGTGT TTTATATCCC CAAACTCCAA CAGCCAACCC CCTAGTAAAC CAAACAGCTT ATACCCAAAC TGCCTTATTT GCTATTGAGT ATGCTTTGTG TAAACTATGG CAATCATGGG GCATTCAACC CCAAGGAGTC TTGGGTCATA GCGTAGGTGA ATATGTAGCG GCTTGCATCG CTGGAGTATA TAGTCTCCAA GAAGGCATAG AATTAATTGC CCAACGGGGA CAACTAATGC AGGCGTTACC CCAAACAGGG ACGATGGCGG CTGTATTTGC CCCTGTGGAA ACAGTGGCCA GAGCGATCGC TCCTTACGGA AATGAAGTAA CGATTGCTAC TATTAATAGC CCGGAAAATG TGGTAATTTC TGGGGTGAAA GCAGCGATCG CTGCGGTACA AGCTGATTTG ATTGCCCAGG GAATTGATGT CCGTCCTCTG CAAGTCTCCC ACGCCTTTCA TTCTCCCATG ATGGAACCCA TGTTAGGAGA ATTTAAACAA GTAGCAGCGA AGATTAATTA CCAAACTCCT CGCATTGATT GGATTTCTAG CGTTACAGGG GCAGAGATTA CCCACAGTAT CGATGCAGAA TATTGGTGTC AGCAAATACG TGACTGTGTG CAGTTTGCAC CTGCGATCGC CACATTAGCC CAACAAGGTT ATGATTTGTT GATGGAAATT GGCCCCCATC CTGTTTTAAC TAGATTGGGA AAACAAACTT TATCTGATCC CCAAATCCTC TGGCTATCAT CTCTGCATCG AGAACAAGAC AATTGGCAAT CTTTATTACA AAGTGTTGCC ACCTTATCAG TTCATGGAGT AGCACTTGAT TGGTCTGGGT TTGAGCAAGA TTATGTTCGT CGTCGTCTGC TTGTACCTAC CTATCCTTTC CAAAGACAAC GGTATTGGTT AGCCGAAACA GAATTTATTC AACCAGAAGT TGTACCTGTT GCTGCCATTT CCCCAGAAAC ATCAAAAATT GTTGCAGCCA CAGAAACCTT AGAAAGCCAG ATTCTCTCGT TAGTAGCGAA GATTACGGGG ATGAATCCTC AACAACTCAG TCTGGATGCC ACTTTAGAAG GGGGTTTAGG CTTAGACTCA ATTATGATGA CCCAATTAAT GAATGGGATT ATCAAATTTA TTCCCCCAGG ACAAAGAGAA AGTTTCCATC AGGTATTTTC GCTGCGGGAT TTAATGCAAA TATCCAATTT AGGAGAACTA TTAAAGGTTC TAGAACCCTG GCAAACTGTT GATTTACAAG AGACAACAGC AGTTGATGTA ATCCCATCTG CTGATATTCC CACAATAGAT CAGACAAGTG ACGTTGTAGA CATTCTCCAT AGTCAATTAC CCTTATTAGT GAGTTACTGG AGTCTCAATT CCAATAGCTT GTTTACTAAG GTACAAGTTG CAGGAGATTT TAACCTAGAA ATTGCTCAAC ACAGTTGGCA AAAACTGATA GATAGACATC CCATGTTAAG GGCGCGGTTT CACATTCCCC AAGGGGCAAC AAGCTTTGCA GACTATCAGC TGCAAGTGTT GAAAAATCCC AGCCCTCCGG CTATTCCCCT CAAGGATATC AGACATCTTG CCTCTGAGGA ACAAACACAA GCGATCGCTC AAGAAGTACA CCATTGGTTA AATTATCGCT GGTCATTAAC CCAATGGCCA CTACATAAGT TCTCAGTGTT ACAACTATCT GATTCAGTCT ATCAAATGTT CTTGGGGAAT GAACATTTGA TTGCTGATGG TTTGAGTAAT CATGTAATTA TCCGAGAATT TCTGGAAATA TACCGCGCTT GTATTGATCA AGATACCCCT GACCTTCCCC CGGCTTTATC AGTCGCTGAT TATCAAGCCC AGGTGCAAGC AATGAACGCT TGGCAAGATG TTGATGAAGA CCGAGCTTTA GCAGCATACA ACAACTCTCA ACGTCACACT GCTTACCTGT GGAATCCCCA ACAACAGATT CGTCAGCAAA CTCCTTTATT TGATAATCAA AAATATATCC TGTCTGCGGA AACCACTGCC AAGTTAATCA CCAAAACTCG TGAATGGCGA GTACCGATGA ATACCCTCCT CTTAGGGGCA TTTATTCAAA CTGTGTCCAA ATTGGATACC ACCTCAGAAA AAATTGGTAT TCAAATCCCT ACAAGTGGTC GGGTGTATCC GGGAGTGGAT GCGTCAGGAG TAATTAGTAG TTTTGCTCAA AATTTAGCCC TGAGTTTTAC CCCACCCCAA GCCCAACAAG ATTGGCAGAC ATTTCTTACC GAAATTCAGC AAACGGTACA ACAACATATT GGTAGTGGTT TAGACAGAGC TCAAACCAGA CAGATGGGAG TAATTTTCCG GGATAGTTTC GTCTTAGAAA ATGGCAGGAT TCCAGACCAC AGCCTTTCGT TAATCCAAGG AGCTTTAAAA TCTAACTTGT ATTTGCCTTA TACAGGTCAA ACCCACATTC ATCATTACTA TGGGTCTTTA TCTGTGACTG AGTATCAAGC AGGAGGGATG AATGCTTCCG GAACAATTGA TATTTTGCAA GAAATTTTTG ATAGTCGTCT GCACTTGTTT GCTAGCTACG ATAGCAACAC CTTTGATTTG TCTGTAATTG ATAGTTTGAT GAAATCCTAT CTAGCACAAA TTGAAGAATT AGCTACCTTA CCAATTGAAG AACAAGTTTC CTCAGCGTTT GTCTCTCCTA TTTTTATCAA TAAAGATATT GGGAAAAACT TACGCCAAAT TACTTCTGAA ATTTGCCATT GGGCAATTGA AGAATCAGAA ATGAGTGATG ATTTGGAAGC GGATTTAGGA CTCGATTCTT TAGAGCTAAT TCGTTTGGTG ACTCGCTTAG AAAGTGTGTA TGGCAAAAAT TATCGTCAAT GCCTCCTGAA TTGCCGCACT TTAGAGGAGA TGGTGGTTGT TTTGAGTGCT GAATCCCTAG CTATCAGTGC TTAG
|
Protein sequence | MQTQDLTPLQ KAIIALKEAR TKIESLERTQ SEPIAIVGMG CRFPGDANNP EKFWELLHQG KNGITTVPPQ RWDIDAYYDD DPDVPNKMYA RHGGFINNVD QFDPQFFGIT PREAIALDPQ QRLLLEVSWE ALENAGIAPQ KLTGTQTGVF VGIGIDDYAK RQIKHHIPID AYTGSGNAFC FAAGRLSYLL GLQGPSLAID TACSTSLVTI HLACQSLRNG ECNLALAGGV SLMLSPEVTL YLSKTRALSP DGRCKTFDRD ANGYVRGEGC GMVVLKRLSS AVADGDHILA VIRGSAVNQD GASSGLTVPN GTAQQAVIRQ ALANAKVTPE QISYLEAHGT GTALGDPIEV RAIDNVFGKG RSPNHPLILG SVKTNIGHLE IAAGMASLLK VILSLQHQEI PPHLHFQELN PDLAASAKSL KIPTSVIPWQ PTEQPRMAGI SSFGLSGTNA HIIIEEPPQL TVTQAEVDRP LHVLMLSAKS EAALHTLAAD WENLLRNHPE TNFADLAFSA NTGRGSFNHR LAIVAQSTAQ ARQNLAAFNQ KQPSLNVFSQ EVEKGRQPKI AFLFTGQGSQ YVGMGRQLYE TQPTFRQALD ECDRLLQPYL KESLLSVLYP QTPTANPLVN QTAYTQTALF AIEYALCKLW QSWGIQPQGV LGHSVGEYVA ACIAGVYSLQ EGIELIAQRG QLMQALPQTG TMAAVFAPVE TVARAIAPYG NEVTIATINS PENVVISGVK AAIAAVQADL IAQGIDVRPL QVSHAFHSPM MEPMLGEFKQ VAAKINYQTP RIDWISSVTG AEITHSIDAE YWCQQIRDCV QFAPAIATLA QQGYDLLMEI GPHPVLTRLG KQTLSDPQIL WLSSLHREQD NWQSLLQSVA TLSVHGVALD WSGFEQDYVR RRLLVPTYPF QRQRYWLAET EFIQPEVVPV AAISPETSKI VAATETLESQ ILSLVAKITG MNPQQLSLDA TLEGGLGLDS IMMTQLMNGI IKFIPPGQRE SFHQVFSLRD LMQISNLGEL LKVLEPWQTV DLQETTAVDV IPSADIPTID QTSDVVDILH SQLPLLVSYW SLNSNSLFTK VQVAGDFNLE IAQHSWQKLI DRHPMLRARF HIPQGATSFA DYQLQVLKNP SPPAIPLKDI RHLASEEQTQ AIAQEVHHWL NYRWSLTQWP LHKFSVLQLS DSVYQMFLGN EHLIADGLSN HVIIREFLEI YRACIDQDTP DLPPALSVAD YQAQVQAMNA WQDVDEDRAL AAYNNSQRHT AYLWNPQQQI RQQTPLFDNQ KYILSAETTA KLITKTREWR VPMNTLLLGA FIQTVSKLDT TSEKIGIQIP TSGRVYPGVD ASGVISSFAQ NLALSFTPPQ AQQDWQTFLT EIQQTVQQHI GSGLDRAQTR QMGVIFRDSF VLENGRIPDH SLSLIQGALK SNLYLPYTGQ THIHHYYGSL SVTEYQAGGM NASGTIDILQ EIFDSRLHLF ASYDSNTFDL SVIDSLMKSY LAQIEELATL PIEEQVSSAF VSPIFINKDI GKNLRQITSE ICHWAIEESE MSDDLEADLG LDSLELIRLV TRLESVYGKN YRQCLLNCRT LEEMVVVLSA ESLAISA
|
| |