Gene Ava_4745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4745 
Symbol 
ID3679632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5954819 
End bp5959582 
Gene Length4764 bp 
Protein Length1587 aa 
Translation table11 
GC content43% 
IMG OID637720101 
ProductBeta-ketoacyl synthase 
Protein accessionYP_325237 
Protein GI75910941 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACTC AAGACTTAAC ACCTTTACAA AAAGCTATTA TTGCCCTGAA AGAAGCCCGT 
ACAAAGATTG AAAGCTTAGA ACGGACACAA AGTGAACCAA TCGCCATTGT GGGCATGGGT
TGTCGTTTTC CTGGGGATGC CAACAACCCA GAAAAATTTT GGGAGTTATT GCATCAAGGA
AAAAATGGCA TTACCACAGT ACCGCCCCAA CGGTGGGATA TTGATGCCTA TTACGATGAT
GATCCTGATG TTCCCAATAA AATGTATGCC CGTCATGGGG GCTTTATTAA CAACGTTGAT
CAGTTTGATC CGCAGTTTTT TGGCATTACC CCCAGAGAAG CGATCGCCCT TGACCCCCAA
CAACGGCTAT TGTTAGAAGT AAGTTGGGAA GCCCTAGAAA ATGCGGGAAT TGCCCCACAA
AAGCTGACAG GTACGCAAAC AGGGGTGTTT GTCGGTATTG GTATAGATGA CTATGCTAAA
CGGCAAATTA AACATCACAT TCCCATTGAT GCCTATACGG GATCGGGCAA TGCTTTTTGT
TTTGCGGCAG GACGTTTATC TTATCTTTTG GGATTGCAAG GGCCGAGTTT AGCCATTGAT
ACAGCTTGTT CTACCTCTTT AGTCACCATT CATTTGGCTT GTCAAAGTTT ACGCAATGGT
GAGTGTAACT TGGCTTTAGC CGGGGGCGTG AGTTTAATGC TGTCCCCCGA AGTTACCCTG
TATTTATCTA AAACCCGCGC CCTTTCTCCT GATGGTCGTT GTAAAACCTT TGACCGAGAT
GCCAATGGCT ATGTACGCGG TGAAGGTTGT GGGATGGTAG TTCTCAAACG TCTCAGTAGT
GCCGTTGCTG ATGGAGATCA TATTTTAGCG GTAATTCGTG GCTCGGCAGT TAATCAAGAT
GGTGCTAGCA GTGGGTTGAC GGTTCCTAAT GGGACAGCTC AACAAGCAGT GATTCGTCAA
GCTTTGGCTA ATGCCAAGGT GACACCAGAA CAAATTAGCT ATCTGGAAGC CCACGGGACG
GGAACTGCTT TGGGCGATCC CATTGAAGTG CGGGCTATTG ATAATGTTTT TGGTAAAGGA
CGTTCCCCAA ATCATCCCCT GATTCTCGGT TCGGTGAAAA CCAATATTGG TCACTTAGAA
ATCGCTGCCG GTATGGCAAG TTTGTTGAAA GTGATCTTAT CCTTGCAACA TCAAGAAATT
CCCCCCCATC TTCATTTTCA AGAACTCAAC CCCGATTTAG CTGCTTCGGC CAAGTCTTTA
AAGATTCCTA CCAGCGTTAT TCCTTGGCAA CCTACAGAAC AACCGAGAAT GGCGGGAATT
AGTTCATTTG GGTTAAGTGG AACTAATGCT CATATCATTA TTGAAGAACC ACCCCAGTTA
ACGGTTACTC AGGCGGAAGT CGATCGCCCG CTTCATGTTT TAATGTTATC GGCAAAAAGT
GAAGCTGCTT TACACACCTT AGCTGCTGAT TGGGAAAATT TGTTACGCAA CCATCCCGAA
ACGAATTTTG CCGATTTGGC ATTTAGTGCC AATACCGGAA GGGGAAGCTT TAATCATCGC
TTGGCAATCG TCGCCCAATC AACCGCACAA GCAAGGCAAA ATTTAGCCGC TTTTAATCAA
AAACAACCAT CCTTGAACGT TTTCAGTCAA GAAGTAGAAA AAGGACGACA ACCGAAAATA
GCCTTTTTAT TCACCGGACA AGGATCTCAA TATGTGGGTA TGGGCAGACA ACTCTATGAA
ACCCAACCCA CATTCCGTCA AGCGTTAGAC GAATGCGATC GCTTATTACA ACCTTACTTA
AAAGAATCCC TATTAAGTGT TTTATATCCC CAAACTCCAA CAGCCAACCC CCTAGTAAAC
CAAACAGCTT ATACCCAAAC TGCCTTATTT GCTATTGAGT ATGCTTTGTG TAAACTATGG
CAATCATGGG GCATTCAACC CCAAGGAGTC TTGGGTCATA GCGTAGGTGA ATATGTAGCG
GCTTGCATCG CTGGAGTATA TAGTCTCCAA GAAGGCATAG AATTAATTGC CCAACGGGGA
CAACTAATGC AGGCGTTACC CCAAACAGGG ACGATGGCGG CTGTATTTGC CCCTGTGGAA
ACAGTGGCCA GAGCGATCGC TCCTTACGGA AATGAAGTAA CGATTGCTAC TATTAATAGC
CCGGAAAATG TGGTAATTTC TGGGGTGAAA GCAGCGATCG CTGCGGTACA AGCTGATTTG
ATTGCCCAGG GAATTGATGT CCGTCCTCTG CAAGTCTCCC ACGCCTTTCA TTCTCCCATG
ATGGAACCCA TGTTAGGAGA ATTTAAACAA GTAGCAGCGA AGATTAATTA CCAAACTCCT
CGCATTGATT GGATTTCTAG CGTTACAGGG GCAGAGATTA CCCACAGTAT CGATGCAGAA
TATTGGTGTC AGCAAATACG TGACTGTGTG CAGTTTGCAC CTGCGATCGC CACATTAGCC
CAACAAGGTT ATGATTTGTT GATGGAAATT GGCCCCCATC CTGTTTTAAC TAGATTGGGA
AAACAAACTT TATCTGATCC CCAAATCCTC TGGCTATCAT CTCTGCATCG AGAACAAGAC
AATTGGCAAT CTTTATTACA AAGTGTTGCC ACCTTATCAG TTCATGGAGT AGCACTTGAT
TGGTCTGGGT TTGAGCAAGA TTATGTTCGT CGTCGTCTGC TTGTACCTAC CTATCCTTTC
CAAAGACAAC GGTATTGGTT AGCCGAAACA GAATTTATTC AACCAGAAGT TGTACCTGTT
GCTGCCATTT CCCCAGAAAC ATCAAAAATT GTTGCAGCCA CAGAAACCTT AGAAAGCCAG
ATTCTCTCGT TAGTAGCGAA GATTACGGGG ATGAATCCTC AACAACTCAG TCTGGATGCC
ACTTTAGAAG GGGGTTTAGG CTTAGACTCA ATTATGATGA CCCAATTAAT GAATGGGATT
ATCAAATTTA TTCCCCCAGG ACAAAGAGAA AGTTTCCATC AGGTATTTTC GCTGCGGGAT
TTAATGCAAA TATCCAATTT AGGAGAACTA TTAAAGGTTC TAGAACCCTG GCAAACTGTT
GATTTACAAG AGACAACAGC AGTTGATGTA ATCCCATCTG CTGATATTCC CACAATAGAT
CAGACAAGTG ACGTTGTAGA CATTCTCCAT AGTCAATTAC CCTTATTAGT GAGTTACTGG
AGTCTCAATT CCAATAGCTT GTTTACTAAG GTACAAGTTG CAGGAGATTT TAACCTAGAA
ATTGCTCAAC ACAGTTGGCA AAAACTGATA GATAGACATC CCATGTTAAG GGCGCGGTTT
CACATTCCCC AAGGGGCAAC AAGCTTTGCA GACTATCAGC TGCAAGTGTT GAAAAATCCC
AGCCCTCCGG CTATTCCCCT CAAGGATATC AGACATCTTG CCTCTGAGGA ACAAACACAA
GCGATCGCTC AAGAAGTACA CCATTGGTTA AATTATCGCT GGTCATTAAC CCAATGGCCA
CTACATAAGT TCTCAGTGTT ACAACTATCT GATTCAGTCT ATCAAATGTT CTTGGGGAAT
GAACATTTGA TTGCTGATGG TTTGAGTAAT CATGTAATTA TCCGAGAATT TCTGGAAATA
TACCGCGCTT GTATTGATCA AGATACCCCT GACCTTCCCC CGGCTTTATC AGTCGCTGAT
TATCAAGCCC AGGTGCAAGC AATGAACGCT TGGCAAGATG TTGATGAAGA CCGAGCTTTA
GCAGCATACA ACAACTCTCA ACGTCACACT GCTTACCTGT GGAATCCCCA ACAACAGATT
CGTCAGCAAA CTCCTTTATT TGATAATCAA AAATATATCC TGTCTGCGGA AACCACTGCC
AAGTTAATCA CCAAAACTCG TGAATGGCGA GTACCGATGA ATACCCTCCT CTTAGGGGCA
TTTATTCAAA CTGTGTCCAA ATTGGATACC ACCTCAGAAA AAATTGGTAT TCAAATCCCT
ACAAGTGGTC GGGTGTATCC GGGAGTGGAT GCGTCAGGAG TAATTAGTAG TTTTGCTCAA
AATTTAGCCC TGAGTTTTAC CCCACCCCAA GCCCAACAAG ATTGGCAGAC ATTTCTTACC
GAAATTCAGC AAACGGTACA ACAACATATT GGTAGTGGTT TAGACAGAGC TCAAACCAGA
CAGATGGGAG TAATTTTCCG GGATAGTTTC GTCTTAGAAA ATGGCAGGAT TCCAGACCAC
AGCCTTTCGT TAATCCAAGG AGCTTTAAAA TCTAACTTGT ATTTGCCTTA TACAGGTCAA
ACCCACATTC ATCATTACTA TGGGTCTTTA TCTGTGACTG AGTATCAAGC AGGAGGGATG
AATGCTTCCG GAACAATTGA TATTTTGCAA GAAATTTTTG ATAGTCGTCT GCACTTGTTT
GCTAGCTACG ATAGCAACAC CTTTGATTTG TCTGTAATTG ATAGTTTGAT GAAATCCTAT
CTAGCACAAA TTGAAGAATT AGCTACCTTA CCAATTGAAG AACAAGTTTC CTCAGCGTTT
GTCTCTCCTA TTTTTATCAA TAAAGATATT GGGAAAAACT TACGCCAAAT TACTTCTGAA
ATTTGCCATT GGGCAATTGA AGAATCAGAA ATGAGTGATG ATTTGGAAGC GGATTTAGGA
CTCGATTCTT TAGAGCTAAT TCGTTTGGTG ACTCGCTTAG AAAGTGTGTA TGGCAAAAAT
TATCGTCAAT GCCTCCTGAA TTGCCGCACT TTAGAGGAGA TGGTGGTTGT TTTGAGTGCT
GAATCCCTAG CTATCAGTGC TTAG
 
Protein sequence
MQTQDLTPLQ KAIIALKEAR TKIESLERTQ SEPIAIVGMG CRFPGDANNP EKFWELLHQG 
KNGITTVPPQ RWDIDAYYDD DPDVPNKMYA RHGGFINNVD QFDPQFFGIT PREAIALDPQ
QRLLLEVSWE ALENAGIAPQ KLTGTQTGVF VGIGIDDYAK RQIKHHIPID AYTGSGNAFC
FAAGRLSYLL GLQGPSLAID TACSTSLVTI HLACQSLRNG ECNLALAGGV SLMLSPEVTL
YLSKTRALSP DGRCKTFDRD ANGYVRGEGC GMVVLKRLSS AVADGDHILA VIRGSAVNQD
GASSGLTVPN GTAQQAVIRQ ALANAKVTPE QISYLEAHGT GTALGDPIEV RAIDNVFGKG
RSPNHPLILG SVKTNIGHLE IAAGMASLLK VILSLQHQEI PPHLHFQELN PDLAASAKSL
KIPTSVIPWQ PTEQPRMAGI SSFGLSGTNA HIIIEEPPQL TVTQAEVDRP LHVLMLSAKS
EAALHTLAAD WENLLRNHPE TNFADLAFSA NTGRGSFNHR LAIVAQSTAQ ARQNLAAFNQ
KQPSLNVFSQ EVEKGRQPKI AFLFTGQGSQ YVGMGRQLYE TQPTFRQALD ECDRLLQPYL
KESLLSVLYP QTPTANPLVN QTAYTQTALF AIEYALCKLW QSWGIQPQGV LGHSVGEYVA
ACIAGVYSLQ EGIELIAQRG QLMQALPQTG TMAAVFAPVE TVARAIAPYG NEVTIATINS
PENVVISGVK AAIAAVQADL IAQGIDVRPL QVSHAFHSPM MEPMLGEFKQ VAAKINYQTP
RIDWISSVTG AEITHSIDAE YWCQQIRDCV QFAPAIATLA QQGYDLLMEI GPHPVLTRLG
KQTLSDPQIL WLSSLHREQD NWQSLLQSVA TLSVHGVALD WSGFEQDYVR RRLLVPTYPF
QRQRYWLAET EFIQPEVVPV AAISPETSKI VAATETLESQ ILSLVAKITG MNPQQLSLDA
TLEGGLGLDS IMMTQLMNGI IKFIPPGQRE SFHQVFSLRD LMQISNLGEL LKVLEPWQTV
DLQETTAVDV IPSADIPTID QTSDVVDILH SQLPLLVSYW SLNSNSLFTK VQVAGDFNLE
IAQHSWQKLI DRHPMLRARF HIPQGATSFA DYQLQVLKNP SPPAIPLKDI RHLASEEQTQ
AIAQEVHHWL NYRWSLTQWP LHKFSVLQLS DSVYQMFLGN EHLIADGLSN HVIIREFLEI
YRACIDQDTP DLPPALSVAD YQAQVQAMNA WQDVDEDRAL AAYNNSQRHT AYLWNPQQQI
RQQTPLFDNQ KYILSAETTA KLITKTREWR VPMNTLLLGA FIQTVSKLDT TSEKIGIQIP
TSGRVYPGVD ASGVISSFAQ NLALSFTPPQ AQQDWQTFLT EIQQTVQQHI GSGLDRAQTR
QMGVIFRDSF VLENGRIPDH SLSLIQGALK SNLYLPYTGQ THIHHYYGSL SVTEYQAGGM
NASGTIDILQ EIFDSRLHLF ASYDSNTFDL SVIDSLMKSY LAQIEELATL PIEEQVSSAF
VSPIFINKDI GKNLRQITSE ICHWAIEESE MSDDLEADLG LDSLELIRLV TRLESVYGKN
YRQCLLNCRT LEEMVVVLSA ESLAISA