Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4834 |
Symbol | |
ID | 3679332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 6076081 |
End bp | 6081021 |
Gene Length | 4941 bp |
Protein Length | 1646 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637720191 |
Product | Beta-ketoacyl synthase |
Protein accession | YP_325326 |
Protein GI | 75911030 |
COG category | [H] Coenzyme transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0001] Glutamate-1-semialdehyde aminotransferase [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.560935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0357231 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAATA TGAAAACAGT GGAAAACCAA GAGCCTCTTG ATGGTGTTGC AATTGTTGGC ATGGTAGGAC GATTCCCAGG TGCGAAAAAT GTCCAGGAAT TTTGGGGAAA TCTTTGTGCA GGGAAAGAGT CAACTACTTT CTTTCAAGAT GAGGAGTTAG ATCCCAGCAT AGATCCGCAC CTCGTACAAG ATCCGAGTTA TGTGAAAGCT AGAGGGATAA TTCCTGGGGG AGAAACCTTT GATGCTGCCT TCTTTGGAAT TAATCCCAGA GAAGCCGAAG TGATGGACCC CCAAAGTCGG GTGTTTTTAG AACTAGTTTA TGAAGCCCTG GAAAACGCCG GTTACGATGC AGAAGCATAC AGTGGCTTAA TTGGTTTGTA TGCTGGCTGT GGTCAAAATA CCTATTTCGC TAATCATATC TGCGGTCGCC AGGAAATTAT CGATCGCGTC GGTGAATTTC AGACAATGCT GGCCAATGAA AAGGATTTTT TGACTACCCG CGCTGCCCAT AAGCTCAACC TCAAAGGCCC GGCTGTTAGT ATCAGTACTG CTTGTTCTAC TTCTTTGGTG GCAGTCATTC AAGCCTGTCA AAGCTTAAGT AACTATCAGT GTGATATGGC TTTAGCTGGT GGTGTGTCCA TGACTACACC ACAAAACAGT GGTTATATCG CCCAAGAAGG AACTATGTTA TCTGGGGATG GTCACTGTCG TCCTTTTGAT GCCAAATCTC AAGGCACAAT GTTTAATAAT GGTGCGGGAT TAGTAGTATT AAAGCGTGTA GAAGATGCAA TACAAGATGG CGATCGCATC TATGCAGTCA TTCGAGGCTT TGGTATAAAC AATGACGGTG CTGACAAGGT AAGTTTCACC GCACCTAGTG TAGATGGGCA AGCCGAAGCC ATAGCAATGG CTCAAGCCTA TGCTAACTTT CATCCAGAAA CGATTTCTTA CATTGAAGCT CATGGTACTG CCACATCTCT AGGCGACCCC ATTGAGATTG AAGCTTTGAC CCAAGCCTTT CGCATCCATA CCAATAGCAA ACAATTTTGT GCGATCGGTT CTCTCAAAAG TAATGTGGGA CATTTAGTAG CGGCGGCTGG TGTTGCAGGT TTGATTAAAA CTGCCCTTGC CCTTTATCAT CAGCAGATTC CACCCAGTTT AAACTTTGAG GCTCCAAATC CCAATATTGA TTTTGCCAAT AGTCCTTTTT ATGTCAACAC GCAATTAACT CCTTGGCCGC AAGGAGAAAC GCCCAGGCGT GCTGGTGTGA GTTCCTTTGG TGTGGGTGGG ACTAATGCCC ACATTGTCCT AGAAGAAGCA CCAGCGATTC TACCTTCTAA CTTATCCCGT CCCAGCCAAT TATTATTGCT TTCAGCCAAA ACCAGTACAG CTTTAGAGAC AGCTACAGTA AATTTGCAGG AGCATTTACG AAACAATGCC TCTATCAACT TAGCTGATGT TGCCCATACC CTACAAAGGG GACGTAAAGC CTTTAACTAT CGGCGCTTTG TGGTTTGTCA AAATACTAAA GAGGCCATCA CCACGCTACA GTCTTTAGAA CCCAAGCGCG TGTTTACTCG CCACACACAA GTCCGTGATC CCGAAATTGC CTTCATGTTT CCCGGACAAG GTTCACAATA CGTCAATATG GGATTTAATC TCTACAGCCG CGAAATTGTG TTTCGGCAAG TAGTAGATCA GTGTGCAGAA ATTCTCAAAC CAATATTGGG TAGAGATTTG CGAGAAATTA TTTATCCAGT AGCCACGGAT ATTGAAACTG CATCTACAGC ACTGCAACAA ACCTGCTTTA CCCAGCCTGC ATTATTTGTA GTTGAATATG CCCTGGCGCA ATTGTGGCAA AGTTGGGGAG TGAAGCCCCA AGCAATGATC GGTCACAGTA TTGGGGAATT TGTGGCTGCT TGTCTTGCCG GTGTCTTTAG TTTAGAAGAT GCCTTAATGT TGGTAGCTAA TCGTGGTCGA TTGATGTGGG AGTTACCAAG GGGGGCAATG TTATCTGTGC GTTTGTCCGC CCCAGAGGTA GAAAAACGAT TGGCAGGAGA ATTAGCGATC GCCGCCATCA ACAGCCCTTC TTTATGTGTA GTATCGGGAA CCACAGAAGC GATCGCCGCC TGGCAAACAC AACTAGAGAC AGCAGAAATT GTCTGTCGTC TGTTACACAC ATCCCACGCA TTTCATTCTC CCATGATGGA GGCGATCGTT GCTCCCTTTG CTGAGTTAGT CGGGAAAGTT AAATTATCGC CACCCCAAAT TCCCTTTGTT TCCAGTGTTA CAGGTGATTG GATTACCCCA GAAGAAGCCA CCAATCCCAT GTATTGGGCC CAGCATTTGC GCCAGACAGT CAGATTTGCT GATGGTGTCA AAACTTTATG GCAGCAGCCA GAACGCCTCT TACTAGAAGT AGGGCCACGC ACAACCACCA CCACCCTAGC ACGGCAACAA GCTAAAGATA TTAAACAACA AATAACCATC GCCTCTCTCA GCGACAATGC CGATAACGAG GCAGAATGGA CAGCCCTACT GCAAGCAATG GGGCAACTAT GGTTAGCAGG AGTCACCATC AACTGGAACA ACTTCTATCA AGAAGAAAGA CGACAACGTA TTCCTCTACC TAACTATCCC TTTGAACGTC AACGCTTTTG GATTGATCCT CTACCCCATC CCAACCGTAA TACCAATCAC AAACCTGCCC ATTATCAACT AGAAAAAACT CAAACTATGT CAGCCCAAAC TACACTCATT TCACTGCTAA AAGAAATTAT TGAAGAAACC TCTGGACTAG AAATTGCCAG CGTTGACGCA TCAACAACAT TTCTGGAAAT GGGGTTAGAC TCTTTGTCTC TCACACAAGT TGGACTGGCA TTGAAGAAAA AATTCAAAGT TAAAGTTTCA CTGAGACATT TACTAGAAGT TTATCCAAAT TTAGCAACAC TGGCTGATTT TCTCCAGCAA AACCTATCTG CCGAAGCTTT ATCTGCATTA GTTCCTTCCG AAACAACATC ACCATCCGCC ACCTTAAGCC TGCAAGAAGT TTCGACAAAC GGCTCAAATG GCAAGAATGG CAAGAATGGT TCTACCCCAT CCCCAATGTT GCTAGTACCT CCAACTTCAA CAAATGGAAA TGGATCAAAG AATGGCTCCT TACCCCAAGT TTCTTTGCCC AATGTAGCAT CCAGCGCCCT CGAAGGAGTG ATTAATCAGC AGCTACAAAT TATGGCTCAA CAACTAGCGC TGCTGGGTAA TAACAGTCAA CCTGTAACTG TGCCAGCCGT AAACGGACAA AATAACGGTG TCAAATCAGA AAAACCCGTT ACCCAAAGCA ACCAAAAACC AGAACCACAA GATTCTACAG AGAATCTTCC CAAGAAAGTC TTTGGTGCAG GCGCTCGCAT AGAAACCACT CAGACTAAAA CTCTCACACC GCAACAACGC ACTTATTTAG ACAGCATTAT TCAAAGATAT ACACAAAGAA CTCAAAAATC TAAAGAATAT ACTCAAACCA ATCGCCCTCA TCTAGCAGAT CCCAGGAGTG TTTCCGGCTT TAACCCAACG ATGAAAGAGA TGGTTTACCC AATTGTGGTG TCTCGCTCAT CAGGTTCTAA ACTTTGGGAC ATTGACGGCA ATGAATACGT CGATTTAAGT AACGGTTTTG GCCTGAATTT GTTCGGTTGG TCGCCTGCAT TTGTCACAGA AGCCATCGAA GCCCAGCTTA AGCTCGGTAT GGAAATTGGA CCCCAGACCC CATTGGTGGG AGAAGTTGCC AAGCTGATGT GTGAAATAAC CAACTTCGAT CGCGCCGCTT TTTGTAACAC AGGTTCAGAA GCAGTACTAG GAGCGATGCG CCTGGCACGA ACCATTACAG GACGTAACTT AATCGCTATC TTCTCTGGAG GTTATCACGG TATTTTAGAT GAAGTTATTG TTCGTGGTAC AAAAAAACTG CGATCGATTC CCGCAGCCCC TGGTATCCCC CAAGAAAAAG TAGACAATAT CTTAGTAATT GACTACGATG CCCCCGATGC TTTAGATATT CTCAAAAGTC GGGCTGATGA ATTAGCTGGT GTCATGGTGG AATCGGTACA GAGTCGCCGC CCAGAATACC AGCCAAGGGA ATTTCTCCAA CAATTGCGAC AATTCACTGA ACAAGAAGAT ATTGCCTTGA TTTTTGACGA AATTGTCACC GGCTTTAGGA TTCATCCCGG TGGCGCACAA GCTCATTTTG GCATCAAAGC TGACATTGCT ACCTACGGCA AGATTGTAGG TGGTGGTTTA CCTATTGGAG TCATTGCAGG CAAATCGAAA TACATGGATG CGTTGGATGG TGGCTTTTGG CAATACGGAG ATGATTCTGT TCCAGAAGTT GGTGTGACTT ACTTTGCCGG AACTTTTGTT CGTCACCCCC TAGCATTAGC GGCAGCAAAA GCAGTACTCC AACATCTACT AAAGAATGGA CCAAGCTTGC AACAACAGTT AAACGCTAAA ACTGATCAGT TTGTGGTAGA ACTAACGGAT TATTTCCAAC AAATGCAAGC ACCATACACT GTTCACAATT TCGGTTCTCT GTTTATGGTG AAGTCTGCAC CAGAATTCCT CTACGGAGAC TTATTATTCT ATTTGATGCG GGATAAGGGA GTGCATATTT GGGATCATCG ACCATGCTTC CTCACCACAG CTCATTCTGA TGCTGATTTG TCCTTAGCAA TGGCAGCCTT TAAAGAAAGT ATTGCCGAAA TGCAGTCTGC TGGTTTTTTT TCTGCCCCAA CTCCAACAAC TAAAGGTAGT CCAGAATCAT CTAAAAACCT TCGTAACCGT CCACCCCAAC CCGGAGCTAA ATTAGGGCGA GATCCTGAAG GTAATCCATC CTGGTATGTT CCAGATCCTC AACGACCAGG GAAATATTTA CAAGTCAGTA GTGTTTCTTA A
|
Protein sequence | MVNMKTVENQ EPLDGVAIVG MVGRFPGAKN VQEFWGNLCA GKESTTFFQD EELDPSIDPH LVQDPSYVKA RGIIPGGETF DAAFFGINPR EAEVMDPQSR VFLELVYEAL ENAGYDAEAY SGLIGLYAGC GQNTYFANHI CGRQEIIDRV GEFQTMLANE KDFLTTRAAH KLNLKGPAVS ISTACSTSLV AVIQACQSLS NYQCDMALAG GVSMTTPQNS GYIAQEGTML SGDGHCRPFD AKSQGTMFNN GAGLVVLKRV EDAIQDGDRI YAVIRGFGIN NDGADKVSFT APSVDGQAEA IAMAQAYANF HPETISYIEA HGTATSLGDP IEIEALTQAF RIHTNSKQFC AIGSLKSNVG HLVAAAGVAG LIKTALALYH QQIPPSLNFE APNPNIDFAN SPFYVNTQLT PWPQGETPRR AGVSSFGVGG TNAHIVLEEA PAILPSNLSR PSQLLLLSAK TSTALETATV NLQEHLRNNA SINLADVAHT LQRGRKAFNY RRFVVCQNTK EAITTLQSLE PKRVFTRHTQ VRDPEIAFMF PGQGSQYVNM GFNLYSREIV FRQVVDQCAE ILKPILGRDL REIIYPVATD IETASTALQQ TCFTQPALFV VEYALAQLWQ SWGVKPQAMI GHSIGEFVAA CLAGVFSLED ALMLVANRGR LMWELPRGAM LSVRLSAPEV EKRLAGELAI AAINSPSLCV VSGTTEAIAA WQTQLETAEI VCRLLHTSHA FHSPMMEAIV APFAELVGKV KLSPPQIPFV SSVTGDWITP EEATNPMYWA QHLRQTVRFA DGVKTLWQQP ERLLLEVGPR TTTTTLARQQ AKDIKQQITI ASLSDNADNE AEWTALLQAM GQLWLAGVTI NWNNFYQEER RQRIPLPNYP FERQRFWIDP LPHPNRNTNH KPAHYQLEKT QTMSAQTTLI SLLKEIIEET SGLEIASVDA STTFLEMGLD SLSLTQVGLA LKKKFKVKVS LRHLLEVYPN LATLADFLQQ NLSAEALSAL VPSETTSPSA TLSLQEVSTN GSNGKNGKNG STPSPMLLVP PTSTNGNGSK NGSLPQVSLP NVASSALEGV INQQLQIMAQ QLALLGNNSQ PVTVPAVNGQ NNGVKSEKPV TQSNQKPEPQ DSTENLPKKV FGAGARIETT QTKTLTPQQR TYLDSIIQRY TQRTQKSKEY TQTNRPHLAD PRSVSGFNPT MKEMVYPIVV SRSSGSKLWD IDGNEYVDLS NGFGLNLFGW SPAFVTEAIE AQLKLGMEIG PQTPLVGEVA KLMCEITNFD RAAFCNTGSE AVLGAMRLAR TITGRNLIAI FSGGYHGILD EVIVRGTKKL RSIPAAPGIP QEKVDNILVI DYDAPDALDI LKSRADELAG VMVESVQSRR PEYQPREFLQ QLRQFTEQED IALIFDEIVT GFRIHPGGAQ AHFGIKADIA TYGKIVGGGL PIGVIAGKSK YMDALDGGFW QYGDDSVPEV GVTYFAGTFV RHPLALAAAK AVLQHLLKNG PSLQQQLNAK TDQFVVELTD YFQQMQAPYT VHNFGSLFMV KSAPEFLYGD LLFYLMRDKG VHIWDHRPCF LTTAHSDADL SLAMAAFKES IAEMQSAGFF SAPTPTTKGS PESSKNLRNR PPQPGAKLGR DPEGNPSWYV PDPQRPGKYL QVSSVS
|
| |