Gene Ava_4747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4747 
Symbol 
ID3679634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5963498 
End bp5967316 
Gene Length3819 bp 
Protein Length1272 aa 
Translation table11 
GC content45% 
IMG OID637720103 
ProductBeta-ketoacyl synthase 
Protein accessionYP_325239 
Protein GI75910943 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR02813] polyketide-type polyunsaturated fatty acid synthase PfaA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGACC AACAATTGAA CCCCAAAAAG TCTGTTTTGC AACAAACACC CATTGCCATT 
ATCGGTATGT CAGCCTTGTT TGCTAAAGCA GAGAATCTAC AACAATACTG GGAAAATATA
CTAGCTAAAA TTAACTGTAT TTCTGAAGTT CCAGAAACAG CTTGGAGCCT TAAGGATTAT
TATGATCCAG ACCCCAGTAC CCCTGATAAA ACTTACGCTA GAGCAGGGGG ATTTATCCCG
GAAATCGATT TTGATCCTTT AGAATTTGGA TTACCACCTG ATACTCTTGA AGCTTTGGAT
TCAGCCAATT TACTAGCTTT GGTATTGGCA AAACAAGCGT TAAAAGATGC TGGCTATGGT
GAAGGGAAAG AATTTAATCG AGAAACAACA GGAGTAATTT TAGGTGCTAG TGGACTTTGG
AAAAGTATTA CTCCATTAAC TGCTCGTTTA CAATATCCCC TGTGGGAAAA AGTTTTAAAA
AATAGTGGTC TATCAGCAGA GCAGACCGAA AAAATTATTG AAAAAATGCA GGCTGCTTTT
GTACCGTGGA CAGAAAATTC TTTCCCAGGT ATGTTGCCTA ATGTGGTCGC AGGCAGAATT
ACCAATCGTC TAGATTTGGG AGGAACTAAC TGTGTTGTTG ATGCAGCTTG TGCTAGTTCT
TTAAGTGCTT TAAAAATGGC AATTAGTGAA CTGATTGAAG GTCGCTGTGA CATGATGTTG
AGCGGGGGCT TTGACACAGA TAACTCTATC CTCAACTATA TGTGTTTCAG TAAAACCCCA
GCTTTTTCTA AAAAAGACTA TCTTAATCCC TTTGATGCCA CATCCGACGG CATGATGGTA
GGAGAAGGGT TAGGAATGTT AGTTCTCAAG CGGTTAGCAG ATGCAGAACG AGATGGCGAT
CGCATCTATG CAGTAATTAA AGGTATCGGT ACTGCCAGTG ATGGTCGCTA CAAGAGTATC
TATGCGCCCC GTACAGAAGG ACAAGTGCGA GCTTTACGGC GAGCCTATGA AGATGCAGGC
TTTTCCCCTG CGACTTTAGG ATTAATGGAA GCTCATGGTA CAGGTACGCC AGCTGGAGAT
TTTTGTGAAT TTACCGCGCT TAATCAAGTT TTGACGGAAA ATCAGGCAGG AAGACAACAA
ATCGCCCTTG GTAGTGTTAA ATCTCAAATC GGTCACACCA AAGCTGCTGC TGGCGCTGCC
AGTTTAATCA AAGCTACCCT AGCCTTACAC CATAAAATTC TCCCACCGAC CATTAATGTC
ACCCAACCCA ATCCGAAATT CAAAATAGAA GAGTCGCCCT TTTATATCAA CACAGAAACT
CGTCCCTGGA TACAAAATGG CGACACTCCC AGACGGGCCG GGGTCAGTTC TTTTGGATTT
GGCGGTACTG ACTATCATGT GGTTCTAGAA GAATATACCC CGAAGTTTCC CCAAGGAGGC
GATCGCATCC ATTCCACAGC CCAATCTATT TTATTGTGGG CTGATACGCC ACAACAATTA
TCTCAAAAGT GTCAGGCTAC TCTTGAGCAA CTGCAATCTG ACAACTGCGG CGAACAATAT
CACCAACTGC AACATCACAG CAAAACAGCC CTCATCCCCG AAACATCTGC CCGTGTTGGC
TTTGTGGCTA CCTCTCTAGA AGAAGCCCAA AAATTATTGA AAGCAGCAAT CAAACAACTG
CAAATACAGC CCCAAGCAGA AGCCTGGGAA CATCCCCAAG GGATTTACTA CCGCCAAACT
GGTTTATCTC TCAAGGGCAA AGTTGTAGCG TTGTTCCCCG GTCAAGGGTC TCAATATCTG
AATATGGGGC GAGAAGTTAG CTTGAACTTC CCAGAGATAC AACAAGCTTA TCAAGCTCTG
GATCAGTTGA TGGCTCAGGA TAACTTGCCA GCTTTGTCTG ATATCGCCTT TCCTATACCT
GCCTTTAACC CAGAGGTAAT CAAGCAACAG TCCCAGCAAT TACAGCGCAC AGAAAACGCT
CAACCTGCCA TTGGAGCCTT GAGCCTGGGA TTGTATAAAA TCTTGCAAAA CGTGGGATTT
AAGCCCGATT TTGTGGCTGG TCATAGCTTT GGGGAATTAA CCGCACTTTG GGCGGCTGGG
GTGTTTAGTG ATGAGGATTA TTGTTACCTG ATCAAAGCCA GGGGTCAAGC CATGACCGTT
CCCCAAGGGT CTCATGATTG CGATCGCGGT ACGATGTTAG CCGTCAGTGG TGATGTGGCA
AAAATTAAGC AACTGATCGC CGGCATGGCC AAGGTGCAGA TTGCTAATTA TAATGCTCCA
GAACAAGTAG TACTGTCTGG TTCTAAACCG GAAATTGCCA CCCTGGAGAA AGTATTAAGC
AAACAGGGCT ATACTGTTAC CCCTTTACCT GTTTCTGCTG CTTTTCACAC TCCCTTTGTT
CACCATGCAA GTCAACCCTT TGCAGAGGCG CTCAATCGAG TGACCTTTAA TACTCCTCAA
ATTCCTGTCT ATGCAAACAT GACGGGTAAT GCCTATCCGA CGGAGGCAAC TGCCATCAGG
AAACTGTTGG AAGCCCACCT TCTGAATGCA GTTCAGTTTG CACAGGAAAT CGAAAATCTG
TATGCCCAAG GAGGATATTG CTTTGTTGAA ATTGGGTCCC GACAAATTCT GACCAACTTG
GTTAAGCAGA CATTAGGCGA TCGCCCCCAT GTAGCCATTG CCCTCAACCC CAGTCGCCAA
AAAGACAGTG ATGTCCAACT GCGTCAAGGA GTCATCCAAC TCCGAGTAAT AGGACTGTCC
CTGAAGGATC TTGATGCTTC TCCCCGAAAA TTGCCTGCTC AAAAAAGCAA ATCTAAAGGA
TTATCTATTA AGCTCAATGC CACCAACTAC ATCAGTGACA AAACAAAAGC TGCCTTTGAA
TCTGCTTTCA AACCTGAGCC TGTAGTGCAG CTAGCTTCCA CCCCTCTTGA TCAGGTTGTT
TCTGATGCAG CAACTGTTGT TTTAGATTCT CCTGCTCAAA CTTTCGTTGC TTCCACAAAT
GATGAGGTTC CCATTACCCC TGTCTCTGTG ATGGAAGAGT CGGTATCTCT AGGAGAAATT
TACCATCAAT CTGATTACCA ATTTATTATC TCTAAGGACG AAACAAATAT CATGAATAAC
TCTACTCATA ATACTATCGC CTATCTTTTC CAACAGTTTT ATCAACACCA AAAAGAAATG
CTACAGGTTC ACGAACAATA CGTGAAGACT CAATCCCAGA GTTTTCAAGC ATTTCTACAA
TTATTGCAAC AACAGGAAGT GATGATTCCT CAAAGTTCTA CTATTGCCGA GCAAACAGTT
GTTGTACCAG CACCAGTGGT GCAACTTCCT AAGACCCCAG CCGCACCTAT AGTAGAACCG
CCCCAGACAC CCGTCGTTCC TGTGGCTGAA CTTCCCAAGA CCCCTGTGGC ACCTGTAGTA
GAACCTCCCC AGGTAACTCC TGTAGCTGAA CCCGTTCATA TCTCCCATAT TGCCAGCTAT
TTAGAGCCTC GTTATTCTGC CCCCGCTCCT CAACCCACAC CAGTTGTAAC CGAAACTGCT
TCTCCTGCGG CGAATGAAAG CTACATTAGC CGAGCATTAA TGGAGGTTGT CAGTGATAAA
ACTGGTTATC CTACAGAAAT GTTGGAATTA GAGATGGATT TAGAAGCGGA TTTAGGCGTT
GACTCCATCA AGCGTGTAGA AATTATGGGA ACTCTACATG AATCTTTTCC TCAGTTACCC
AAACTTAGCC CCGAAGAACT CGCGGAAAAG CGTACCCTGG CGCAAATTGT CCAATATTTG
GCAGGACAAA TCATGGTGGC AGAAAAAAAA ATAGCTTAG
 
Protein sequence
MVDQQLNPKK SVLQQTPIAI IGMSALFAKA ENLQQYWENI LAKINCISEV PETAWSLKDY 
YDPDPSTPDK TYARAGGFIP EIDFDPLEFG LPPDTLEALD SANLLALVLA KQALKDAGYG
EGKEFNRETT GVILGASGLW KSITPLTARL QYPLWEKVLK NSGLSAEQTE KIIEKMQAAF
VPWTENSFPG MLPNVVAGRI TNRLDLGGTN CVVDAACASS LSALKMAISE LIEGRCDMML
SGGFDTDNSI LNYMCFSKTP AFSKKDYLNP FDATSDGMMV GEGLGMLVLK RLADAERDGD
RIYAVIKGIG TASDGRYKSI YAPRTEGQVR ALRRAYEDAG FSPATLGLME AHGTGTPAGD
FCEFTALNQV LTENQAGRQQ IALGSVKSQI GHTKAAAGAA SLIKATLALH HKILPPTINV
TQPNPKFKIE ESPFYINTET RPWIQNGDTP RRAGVSSFGF GGTDYHVVLE EYTPKFPQGG
DRIHSTAQSI LLWADTPQQL SQKCQATLEQ LQSDNCGEQY HQLQHHSKTA LIPETSARVG
FVATSLEEAQ KLLKAAIKQL QIQPQAEAWE HPQGIYYRQT GLSLKGKVVA LFPGQGSQYL
NMGREVSLNF PEIQQAYQAL DQLMAQDNLP ALSDIAFPIP AFNPEVIKQQ SQQLQRTENA
QPAIGALSLG LYKILQNVGF KPDFVAGHSF GELTALWAAG VFSDEDYCYL IKARGQAMTV
PQGSHDCDRG TMLAVSGDVA KIKQLIAGMA KVQIANYNAP EQVVLSGSKP EIATLEKVLS
KQGYTVTPLP VSAAFHTPFV HHASQPFAEA LNRVTFNTPQ IPVYANMTGN AYPTEATAIR
KLLEAHLLNA VQFAQEIENL YAQGGYCFVE IGSRQILTNL VKQTLGDRPH VAIALNPSRQ
KDSDVQLRQG VIQLRVIGLS LKDLDASPRK LPAQKSKSKG LSIKLNATNY ISDKTKAAFE
SAFKPEPVVQ LASTPLDQVV SDAATVVLDS PAQTFVASTN DEVPITPVSV MEESVSLGEI
YHQSDYQFII SKDETNIMNN STHNTIAYLF QQFYQHQKEM LQVHEQYVKT QSQSFQAFLQ
LLQQQEVMIP QSSTIAEQTV VVPAPVVQLP KTPAAPIVEP PQTPVVPVAE LPKTPVAPVV
EPPQVTPVAE PVHISHIASY LEPRYSAPAP QPTPVVTETA SPAANESYIS RALMEVVSDK
TGYPTEMLEL EMDLEADLGV DSIKRVEIMG TLHESFPQLP KLSPEELAEK RTLAQIVQYL
AGQIMVAEKK IA