Gene Ava_4108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4108 
Symbol 
ID3681496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5112020 
End bp5116990 
Gene Length4971 bp 
Protein Length1656 aa 
Translation table11 
GC content44% 
IMG OID637719455 
ProductBeta-ketoacyl synthase 
Protein accessionYP_324603 
Protein GI75910307 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.307999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACTA ACGATAATGT TGAATTTTCT ACTTTAGTAG ACTTATTAAG ATACAGAGCG 
CAAAATCAAC CTACACAAAC AGCTTATACA TTTCTTGTTG ATGGGGAAAC AGAAAGCATC
TCTTTAACTT ATCAAGAATT AGATCAAAAA GCACGGGCGA TCGCAACTCA ACTTTTGCAA
CGGGGAGTAC CAGGTTCACG GGCTTTGTTA CTTTATCCAC CAGGAATGGA ATTTATCCCT
GCCTTTTTTG GTTGTTTATA TGCTGGCTTT ATTGCTGTTC CGGCTTATCC ACCCCGACGT
AACCAAAAAA TGTCTAGGCT ACAGGCGATC GTATCAGATG CCGAGGCTGT AGTCGCACTG
ACTACATCTA CAGAATTAAC TTCGATGGCC TTACAGTTAG CAGAAAATCC TACCTTGACA
GCGATACCCT GGATCACAAC TGATAATCTC AATGCAAATA TAGCTGAAGA TTGGCAACAA
CCCAATATTA ATAGTGATAC TTTGGCCTTT CTGCAATACA CTTCAGGTTC AACCGGCACA
CCCAAAGGGG TAATGATTAC CCACGGCAAT TTATTGCATA ATTCCCAACT CATCTATAAC
TTTTACCAAC ACACACCCAA TAGCCAAGGT GTAATTTGGT TGCCTCCGTA CCACGATATG
GGATTAATTG GTGGGGTTCT GCAACCTCTG TATGGTGGTT TTCCGGTGAC TTTGATGGCC
CCAGTCGCCT TTTTGCAAAA GCCTTTTCGT TGGTTGCAGG CCATTTCTCA TTACAAAGCA
ACTACCAGTG GCGGCCCTGA TTTTGCTTAT GACTTAGTCT GTCGTCAAAT TACTCCCGAA
CAACTAGCAA GCCTGGATTT GAGTAGTTGG GAAGTGGCGT TTACCGGTGC TGAACCAATA
CGCGCACAAA CTCTAGACCG ATTTGCAGAA ACTTTTGCGC CTTGCGGTTT CCGGAGAGAA
GCTTTCTATC CTTGCTATGG GATGGCGGAA ACTACTTTAA TTGTCTCTGG GGGTTGGAAA
TCTGAAGCTC CCATTGTGCG ACATATAGAT ACCACAGCTT TGTTACAAAA CCAAGTCATA
GATACGACTA CGGCGGCTGG TGGTAAAGCC ATTGTGGGTT GCGGTAAAAG TAGCCCAGAC
CAAACAGTAC TGATTGTCAA TCCTGAATCA TTGACATCTT GTGCAGATGG ACAGGTAGGA
GAAATTTGGG TATCGGGGTC TAGTGTTGCT CAAGGTTACT GGAATCGCCC CGAACAAACA
CAACATACCT TTCATGCTTA CCTAGCAGAT AATACAACTG GCCCTTTTCT GCGGACTGGA
GATTTGGGCT TTTTACAGGA TGGTGAGTTA TTTATTACTG GTCGTCTCAA AGATTTAATT
ATCATTATGG GACGCAATCA TTATCCCCAA GATATTGAAT TTACAGTGGA AAGTTGTCAC
CCAGCACTAC GTCCAGCAGG TGGTGCGGCT TTTGCAGTTG AAGTTAACAA CGTGGAGAAA
TTGGTGATTG TTCAAGAAGT GGAACGTAGC TATCTGCGCA AGCTAAATGC TGATGAAGTC
ATAGGTGCTA TTCGTAAGGC TGTAGCCGAA CATCATGATT TACAAACTCA TACCATCGCT
TTAATTAAAA CGAACAGTTT GCCGAAAACT TCTTCTGGGA AAGTCAGACG TAGTAACTGT
AAAGCTGAGT TAGAAGCTGG AAGTCTCGAC ATAATTGCAC AGTGGAGTGC AGATGCTCAA
AACCATAGTC CATCAGAAAA ATTAATTGCT CCACAGGTAG AAAAAACAGT CAATACCGAA
GCAGTTGCAA CTTGGTTGTT AACAAAAGTG AGCGAACAAT TACAAGTTCC TGCTCAAACA
ATCAACGTTA ACGAACCTCT AGCACAATAT GGTTTGGGTT CTTTAGCAGC AGTGAGAATT
TCCGGAGAAT TGCAGGAATG GTTAGGGCGG GAATTGCCAA CAACGCTGTT GTATGACTAT
CCTTCCATCG CTGCTTTAGC TCAATATTTA GGTGATGGAG TGCCACAACC CAAGGTAGTC
TCACAGCAAC CAACAGACAA TCATGCGATC GCCTCTGGCG GGGCTGCGCC CATCGCGATC
GTGGGTATTG GTTGCCGTTA TCCTGGTGCA AATAATCCGG AAGCCTTTTG GCAATTACTC
CGCAATGGCG TAGATGCTAT TAGTGAAGTT CCTCAACAAC GCTGGGATGT GAATTCTTTT
TATGATCCCA ACCGTGCTAC ACCAGGCAAA ATGAATACTC GCTGGGGTGG TTTCATCTCG
GAAGTAGATC AATTTGATGC CCCATTCTTT GGGATTTCTC CACGGGAAGC CGAGTCTCTC
GATCCCCAAC AACGGTTACT TCTAGAAGTT TGCTGGGAAG CATTAGAAAA TGCAGGTAAA
GCACCGAGTA AGTTAGCCGG AAGCAACACA GGGGTATTTG TTGGTATTAG TAATTTTGAT
TACTCCCAAT TGCTAGCCAA ACAAGTGTCT GGATTAGATG CTTATAGTGG AACTGGTAAC
GCTTTTAGTA TTGCCGCTAA CCGCATATCC TATTTATTAG ATGTACACGG GCCAAGTTGG
GCAGTAGACA CAGCTTGTTC TTCATCCTTA GTTGCAGTTC ACCAAGCTTG TCAAAGTCTG
CGTCAGGGAG AATGCGAAAT GGCTTTAGCT GCTGGTGTGA ACTTAATTCT CACTCCACAG
TTGACAGTGA CATTTTCCCA AGCTGGAATG ATGGCTGGTG ACGGTCGTTG TAAGACTTTT
GATGCTGATG CGGATGGTTA TGTGCGGGGC GAAGGTTGTG GTGTAGTGGT ACTCAAGCGT
CTGAGTGATG CTTATCGTGA TGGCGATCGC ATTTTGGCAG TGATTAAAGG TTCAGCCGTG
AACCAAGACG GACGCAGCAA CGGACTCACC GCCCCCAATG GACTAGCCCA GCAAGCTGTA
ATTCGGCAAG CCATGCAAAA TGCTGGTGTT GCACCTCATG AAATTGGCTA TGTAGAAGCG
CATGGTACAG GCACATTTTT AGGCGACCCC ATCGAGGTTA ATTCCCTGAA AACAGTCCTC
GCACCAGGAC GCGCACCTGG AGATACCTGT GTGATTGGTT CAGTCAAAAC CAATATCGGA
CATCTAGAAG CAGCCGCCGG CATTGCTGGT TTAATTAAAA CTGTCCTCTC ACTACACCAT
GAAGAAATTC CGCCCCATCT GCACCTTAAA CAAGTTAACC CCCACATTTC CTTAGCTGAT
ACGAATTTAT CTATCGCTAC TACATTGTTA CCTTGGAATA GAGGTAATAA ACGGCGGTTG
GCTGGTGTAT CTTCCTTTGG TTTTGGTGGT ACTAATGCCC ATATCATCTT AGAAGAAGCT
CCTCTAGTTA GTCCTAAAGA CACAGTCAGT GATGAACGTC CCTTGCATTT ATTCACACTC
TCAGCTAAAA GTGAAAATGC TTTGCGTGAT TTAGCAAAAG ACTATGAGAA TTATCTAGGT
AACCACCCTG ATGCTTCATT AACAGACATT TGTTTCACCG CCAATACTGG ACGATCGCAT
TTTGATCATC GCTTAGTGGC GATCGCTCAA TCTAATATCC AATTACAAAC CGTACTAGGT
GCTTACGCGA CTGGGAAGAA AACCTCACAA TTGGTCAGTG GTAAAATCGA AAACAAACAG
CAACCGAAAA TCGCCTTTTT ATTCACAGGT CAAGGTTCTC AATATATCAA CATGGGTCGC
CAACTGTATG CAACCCAGCC GACCTTCCGC GCCGCCATTG ACCAATGTGA GCAAGTTTTG
CGGTCATACT TAGATCAACC ACTGCTGTCA GTTTTGTATC CTGACACAGA AAGCAGCTGC
ATCGATGAAA CCGCTTATAC CCAACCTGCG ATATTTGCTG TAGAGTATGC CCTAGCGCAA
TTGTGGAAGT CTTGGGGTGT AGAACCAACA GCTGTCATTG GTCATAGTTT CGGTGAATAT
GTCGCCGCTT GTATTGCTGG AGTATTCAGT TTAGAAGCAG GACTGAAGTT AGTTACCGAG
CGATCGCGCT TAATGCAAGC ACTCAAATCA ATGGGTGCAA TGGCTGCGGT GTTTGCTAGT
GAGGAACAAG TGCAAGCCGC GATCGCTGAA TACAGTCCAG AGGTGACAAT CTCTGCTGTA
AACGCTCCTG ATAATATCAC CATCTCTGGT ACAGTCGCCA AAATTGCAGC AGCGATCGCC
AAGTTCACAG CCCAAGGAAT TGAGACTCGA CGCTTGAATG TATCCCATGC TTTTCATTCG
CCTTTAATGG ATGAAATGCT TGATGCTTTT GAACAAGCAA CCAACCTAGT CAACTTCCAA
GCACCACAGA TTCCTTTCGT TTCCAATGTC ACCGGCAATT TCCTTCCAGT CGGACAAGTA
CCGGATGCTC AATACTGGCG ATCGCATACA CGCCAGTCCG TCAGATTTAT GGACGGATTG
AATACTTTAT TAGCTGAAGG TTACGAATTA TTCATAGAAA TTGGTGCTAA ACCAGTTCTT
TGTAGTCTGG GTAAACGTTG TCACGGAGGC GAAAATTCTG TTTGGTTGCC TTCTTTGAAT
GCCAAGCGAG ATAATTGGCA AACATTACTA GAAAGTTTAT CAACATTGTA TCTACGGGGA
GTAGAGATTG ACTGGGCAGG ATTTGAAAGT GATTATTCCA GTCAGCTAGT TGCACTTCCT
ACTTATGCCT TTCAAAGAAA ACGTTACTGG ATTCAATCGC CAAATAGGGT TGACAATATC
CCAGTAAATC ACCACAGCAA TAATTCTCAA CCTGTTACTT CCGCTAGTAG TGATAGTGCA
AAACTACCAT CTCAGGCCTT AGTAGAAAAA ATGCTCAAAC AACAACTACA GATTATGTCT
CAGCAATTGG AAGTGTTGCG TACTCATAAT TTGACCAAAG GTCAATTCTT GCCATTGCAA
CATGGACACT TACCAAAGCC ACCCAAGCTG AACCATCAGC CTAGTATTTA G
 
Protein sequence
MITNDNVEFS TLVDLLRYRA QNQPTQTAYT FLVDGETESI SLTYQELDQK ARAIATQLLQ 
RGVPGSRALL LYPPGMEFIP AFFGCLYAGF IAVPAYPPRR NQKMSRLQAI VSDAEAVVAL
TTSTELTSMA LQLAENPTLT AIPWITTDNL NANIAEDWQQ PNINSDTLAF LQYTSGSTGT
PKGVMITHGN LLHNSQLIYN FYQHTPNSQG VIWLPPYHDM GLIGGVLQPL YGGFPVTLMA
PVAFLQKPFR WLQAISHYKA TTSGGPDFAY DLVCRQITPE QLASLDLSSW EVAFTGAEPI
RAQTLDRFAE TFAPCGFRRE AFYPCYGMAE TTLIVSGGWK SEAPIVRHID TTALLQNQVI
DTTTAAGGKA IVGCGKSSPD QTVLIVNPES LTSCADGQVG EIWVSGSSVA QGYWNRPEQT
QHTFHAYLAD NTTGPFLRTG DLGFLQDGEL FITGRLKDLI IIMGRNHYPQ DIEFTVESCH
PALRPAGGAA FAVEVNNVEK LVIVQEVERS YLRKLNADEV IGAIRKAVAE HHDLQTHTIA
LIKTNSLPKT SSGKVRRSNC KAELEAGSLD IIAQWSADAQ NHSPSEKLIA PQVEKTVNTE
AVATWLLTKV SEQLQVPAQT INVNEPLAQY GLGSLAAVRI SGELQEWLGR ELPTTLLYDY
PSIAALAQYL GDGVPQPKVV SQQPTDNHAI ASGGAAPIAI VGIGCRYPGA NNPEAFWQLL
RNGVDAISEV PQQRWDVNSF YDPNRATPGK MNTRWGGFIS EVDQFDAPFF GISPREAESL
DPQQRLLLEV CWEALENAGK APSKLAGSNT GVFVGISNFD YSQLLAKQVS GLDAYSGTGN
AFSIAANRIS YLLDVHGPSW AVDTACSSSL VAVHQACQSL RQGECEMALA AGVNLILTPQ
LTVTFSQAGM MAGDGRCKTF DADADGYVRG EGCGVVVLKR LSDAYRDGDR ILAVIKGSAV
NQDGRSNGLT APNGLAQQAV IRQAMQNAGV APHEIGYVEA HGTGTFLGDP IEVNSLKTVL
APGRAPGDTC VIGSVKTNIG HLEAAAGIAG LIKTVLSLHH EEIPPHLHLK QVNPHISLAD
TNLSIATTLL PWNRGNKRRL AGVSSFGFGG TNAHIILEEA PLVSPKDTVS DERPLHLFTL
SAKSENALRD LAKDYENYLG NHPDASLTDI CFTANTGRSH FDHRLVAIAQ SNIQLQTVLG
AYATGKKTSQ LVSGKIENKQ QPKIAFLFTG QGSQYINMGR QLYATQPTFR AAIDQCEQVL
RSYLDQPLLS VLYPDTESSC IDETAYTQPA IFAVEYALAQ LWKSWGVEPT AVIGHSFGEY
VAACIAGVFS LEAGLKLVTE RSRLMQALKS MGAMAAVFAS EEQVQAAIAE YSPEVTISAV
NAPDNITISG TVAKIAAAIA KFTAQGIETR RLNVSHAFHS PLMDEMLDAF EQATNLVNFQ
APQIPFVSNV TGNFLPVGQV PDAQYWRSHT RQSVRFMDGL NTLLAEGYEL FIEIGAKPVL
CSLGKRCHGG ENSVWLPSLN AKRDNWQTLL ESLSTLYLRG VEIDWAGFES DYSSQLVALP
TYAFQRKRYW IQSPNRVDNI PVNHHSNNSQ PVTSASSDSA KLPSQALVEK MLKQQLQIMS
QQLEVLRTHN LTKGQFLPLQ HGHLPKPPKL NHQPSI