Gene Ava_4834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4834 
Symbol 
ID3679332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6076081 
End bp6081021 
Gene Length4941 bp 
Protein Length1646 aa 
Translation table11 
GC content45% 
IMG OID637720191 
ProductBeta-ketoacyl synthase 
Protein accessionYP_325326 
Protein GI75911030 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase
[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.560935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0357231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAATA TGAAAACAGT GGAAAACCAA GAGCCTCTTG ATGGTGTTGC AATTGTTGGC 
ATGGTAGGAC GATTCCCAGG TGCGAAAAAT GTCCAGGAAT TTTGGGGAAA TCTTTGTGCA
GGGAAAGAGT CAACTACTTT CTTTCAAGAT GAGGAGTTAG ATCCCAGCAT AGATCCGCAC
CTCGTACAAG ATCCGAGTTA TGTGAAAGCT AGAGGGATAA TTCCTGGGGG AGAAACCTTT
GATGCTGCCT TCTTTGGAAT TAATCCCAGA GAAGCCGAAG TGATGGACCC CCAAAGTCGG
GTGTTTTTAG AACTAGTTTA TGAAGCCCTG GAAAACGCCG GTTACGATGC AGAAGCATAC
AGTGGCTTAA TTGGTTTGTA TGCTGGCTGT GGTCAAAATA CCTATTTCGC TAATCATATC
TGCGGTCGCC AGGAAATTAT CGATCGCGTC GGTGAATTTC AGACAATGCT GGCCAATGAA
AAGGATTTTT TGACTACCCG CGCTGCCCAT AAGCTCAACC TCAAAGGCCC GGCTGTTAGT
ATCAGTACTG CTTGTTCTAC TTCTTTGGTG GCAGTCATTC AAGCCTGTCA AAGCTTAAGT
AACTATCAGT GTGATATGGC TTTAGCTGGT GGTGTGTCCA TGACTACACC ACAAAACAGT
GGTTATATCG CCCAAGAAGG AACTATGTTA TCTGGGGATG GTCACTGTCG TCCTTTTGAT
GCCAAATCTC AAGGCACAAT GTTTAATAAT GGTGCGGGAT TAGTAGTATT AAAGCGTGTA
GAAGATGCAA TACAAGATGG CGATCGCATC TATGCAGTCA TTCGAGGCTT TGGTATAAAC
AATGACGGTG CTGACAAGGT AAGTTTCACC GCACCTAGTG TAGATGGGCA AGCCGAAGCC
ATAGCAATGG CTCAAGCCTA TGCTAACTTT CATCCAGAAA CGATTTCTTA CATTGAAGCT
CATGGTACTG CCACATCTCT AGGCGACCCC ATTGAGATTG AAGCTTTGAC CCAAGCCTTT
CGCATCCATA CCAATAGCAA ACAATTTTGT GCGATCGGTT CTCTCAAAAG TAATGTGGGA
CATTTAGTAG CGGCGGCTGG TGTTGCAGGT TTGATTAAAA CTGCCCTTGC CCTTTATCAT
CAGCAGATTC CACCCAGTTT AAACTTTGAG GCTCCAAATC CCAATATTGA TTTTGCCAAT
AGTCCTTTTT ATGTCAACAC GCAATTAACT CCTTGGCCGC AAGGAGAAAC GCCCAGGCGT
GCTGGTGTGA GTTCCTTTGG TGTGGGTGGG ACTAATGCCC ACATTGTCCT AGAAGAAGCA
CCAGCGATTC TACCTTCTAA CTTATCCCGT CCCAGCCAAT TATTATTGCT TTCAGCCAAA
ACCAGTACAG CTTTAGAGAC AGCTACAGTA AATTTGCAGG AGCATTTACG AAACAATGCC
TCTATCAACT TAGCTGATGT TGCCCATACC CTACAAAGGG GACGTAAAGC CTTTAACTAT
CGGCGCTTTG TGGTTTGTCA AAATACTAAA GAGGCCATCA CCACGCTACA GTCTTTAGAA
CCCAAGCGCG TGTTTACTCG CCACACACAA GTCCGTGATC CCGAAATTGC CTTCATGTTT
CCCGGACAAG GTTCACAATA CGTCAATATG GGATTTAATC TCTACAGCCG CGAAATTGTG
TTTCGGCAAG TAGTAGATCA GTGTGCAGAA ATTCTCAAAC CAATATTGGG TAGAGATTTG
CGAGAAATTA TTTATCCAGT AGCCACGGAT ATTGAAACTG CATCTACAGC ACTGCAACAA
ACCTGCTTTA CCCAGCCTGC ATTATTTGTA GTTGAATATG CCCTGGCGCA ATTGTGGCAA
AGTTGGGGAG TGAAGCCCCA AGCAATGATC GGTCACAGTA TTGGGGAATT TGTGGCTGCT
TGTCTTGCCG GTGTCTTTAG TTTAGAAGAT GCCTTAATGT TGGTAGCTAA TCGTGGTCGA
TTGATGTGGG AGTTACCAAG GGGGGCAATG TTATCTGTGC GTTTGTCCGC CCCAGAGGTA
GAAAAACGAT TGGCAGGAGA ATTAGCGATC GCCGCCATCA ACAGCCCTTC TTTATGTGTA
GTATCGGGAA CCACAGAAGC GATCGCCGCC TGGCAAACAC AACTAGAGAC AGCAGAAATT
GTCTGTCGTC TGTTACACAC ATCCCACGCA TTTCATTCTC CCATGATGGA GGCGATCGTT
GCTCCCTTTG CTGAGTTAGT CGGGAAAGTT AAATTATCGC CACCCCAAAT TCCCTTTGTT
TCCAGTGTTA CAGGTGATTG GATTACCCCA GAAGAAGCCA CCAATCCCAT GTATTGGGCC
CAGCATTTGC GCCAGACAGT CAGATTTGCT GATGGTGTCA AAACTTTATG GCAGCAGCCA
GAACGCCTCT TACTAGAAGT AGGGCCACGC ACAACCACCA CCACCCTAGC ACGGCAACAA
GCTAAAGATA TTAAACAACA AATAACCATC GCCTCTCTCA GCGACAATGC CGATAACGAG
GCAGAATGGA CAGCCCTACT GCAAGCAATG GGGCAACTAT GGTTAGCAGG AGTCACCATC
AACTGGAACA ACTTCTATCA AGAAGAAAGA CGACAACGTA TTCCTCTACC TAACTATCCC
TTTGAACGTC AACGCTTTTG GATTGATCCT CTACCCCATC CCAACCGTAA TACCAATCAC
AAACCTGCCC ATTATCAACT AGAAAAAACT CAAACTATGT CAGCCCAAAC TACACTCATT
TCACTGCTAA AAGAAATTAT TGAAGAAACC TCTGGACTAG AAATTGCCAG CGTTGACGCA
TCAACAACAT TTCTGGAAAT GGGGTTAGAC TCTTTGTCTC TCACACAAGT TGGACTGGCA
TTGAAGAAAA AATTCAAAGT TAAAGTTTCA CTGAGACATT TACTAGAAGT TTATCCAAAT
TTAGCAACAC TGGCTGATTT TCTCCAGCAA AACCTATCTG CCGAAGCTTT ATCTGCATTA
GTTCCTTCCG AAACAACATC ACCATCCGCC ACCTTAAGCC TGCAAGAAGT TTCGACAAAC
GGCTCAAATG GCAAGAATGG CAAGAATGGT TCTACCCCAT CCCCAATGTT GCTAGTACCT
CCAACTTCAA CAAATGGAAA TGGATCAAAG AATGGCTCCT TACCCCAAGT TTCTTTGCCC
AATGTAGCAT CCAGCGCCCT CGAAGGAGTG ATTAATCAGC AGCTACAAAT TATGGCTCAA
CAACTAGCGC TGCTGGGTAA TAACAGTCAA CCTGTAACTG TGCCAGCCGT AAACGGACAA
AATAACGGTG TCAAATCAGA AAAACCCGTT ACCCAAAGCA ACCAAAAACC AGAACCACAA
GATTCTACAG AGAATCTTCC CAAGAAAGTC TTTGGTGCAG GCGCTCGCAT AGAAACCACT
CAGACTAAAA CTCTCACACC GCAACAACGC ACTTATTTAG ACAGCATTAT TCAAAGATAT
ACACAAAGAA CTCAAAAATC TAAAGAATAT ACTCAAACCA ATCGCCCTCA TCTAGCAGAT
CCCAGGAGTG TTTCCGGCTT TAACCCAACG ATGAAAGAGA TGGTTTACCC AATTGTGGTG
TCTCGCTCAT CAGGTTCTAA ACTTTGGGAC ATTGACGGCA ATGAATACGT CGATTTAAGT
AACGGTTTTG GCCTGAATTT GTTCGGTTGG TCGCCTGCAT TTGTCACAGA AGCCATCGAA
GCCCAGCTTA AGCTCGGTAT GGAAATTGGA CCCCAGACCC CATTGGTGGG AGAAGTTGCC
AAGCTGATGT GTGAAATAAC CAACTTCGAT CGCGCCGCTT TTTGTAACAC AGGTTCAGAA
GCAGTACTAG GAGCGATGCG CCTGGCACGA ACCATTACAG GACGTAACTT AATCGCTATC
TTCTCTGGAG GTTATCACGG TATTTTAGAT GAAGTTATTG TTCGTGGTAC AAAAAAACTG
CGATCGATTC CCGCAGCCCC TGGTATCCCC CAAGAAAAAG TAGACAATAT CTTAGTAATT
GACTACGATG CCCCCGATGC TTTAGATATT CTCAAAAGTC GGGCTGATGA ATTAGCTGGT
GTCATGGTGG AATCGGTACA GAGTCGCCGC CCAGAATACC AGCCAAGGGA ATTTCTCCAA
CAATTGCGAC AATTCACTGA ACAAGAAGAT ATTGCCTTGA TTTTTGACGA AATTGTCACC
GGCTTTAGGA TTCATCCCGG TGGCGCACAA GCTCATTTTG GCATCAAAGC TGACATTGCT
ACCTACGGCA AGATTGTAGG TGGTGGTTTA CCTATTGGAG TCATTGCAGG CAAATCGAAA
TACATGGATG CGTTGGATGG TGGCTTTTGG CAATACGGAG ATGATTCTGT TCCAGAAGTT
GGTGTGACTT ACTTTGCCGG AACTTTTGTT CGTCACCCCC TAGCATTAGC GGCAGCAAAA
GCAGTACTCC AACATCTACT AAAGAATGGA CCAAGCTTGC AACAACAGTT AAACGCTAAA
ACTGATCAGT TTGTGGTAGA ACTAACGGAT TATTTCCAAC AAATGCAAGC ACCATACACT
GTTCACAATT TCGGTTCTCT GTTTATGGTG AAGTCTGCAC CAGAATTCCT CTACGGAGAC
TTATTATTCT ATTTGATGCG GGATAAGGGA GTGCATATTT GGGATCATCG ACCATGCTTC
CTCACCACAG CTCATTCTGA TGCTGATTTG TCCTTAGCAA TGGCAGCCTT TAAAGAAAGT
ATTGCCGAAA TGCAGTCTGC TGGTTTTTTT TCTGCCCCAA CTCCAACAAC TAAAGGTAGT
CCAGAATCAT CTAAAAACCT TCGTAACCGT CCACCCCAAC CCGGAGCTAA ATTAGGGCGA
GATCCTGAAG GTAATCCATC CTGGTATGTT CCAGATCCTC AACGACCAGG GAAATATTTA
CAAGTCAGTA GTGTTTCTTA A
 
Protein sequence
MVNMKTVENQ EPLDGVAIVG MVGRFPGAKN VQEFWGNLCA GKESTTFFQD EELDPSIDPH 
LVQDPSYVKA RGIIPGGETF DAAFFGINPR EAEVMDPQSR VFLELVYEAL ENAGYDAEAY
SGLIGLYAGC GQNTYFANHI CGRQEIIDRV GEFQTMLANE KDFLTTRAAH KLNLKGPAVS
ISTACSTSLV AVIQACQSLS NYQCDMALAG GVSMTTPQNS GYIAQEGTML SGDGHCRPFD
AKSQGTMFNN GAGLVVLKRV EDAIQDGDRI YAVIRGFGIN NDGADKVSFT APSVDGQAEA
IAMAQAYANF HPETISYIEA HGTATSLGDP IEIEALTQAF RIHTNSKQFC AIGSLKSNVG
HLVAAAGVAG LIKTALALYH QQIPPSLNFE APNPNIDFAN SPFYVNTQLT PWPQGETPRR
AGVSSFGVGG TNAHIVLEEA PAILPSNLSR PSQLLLLSAK TSTALETATV NLQEHLRNNA
SINLADVAHT LQRGRKAFNY RRFVVCQNTK EAITTLQSLE PKRVFTRHTQ VRDPEIAFMF
PGQGSQYVNM GFNLYSREIV FRQVVDQCAE ILKPILGRDL REIIYPVATD IETASTALQQ
TCFTQPALFV VEYALAQLWQ SWGVKPQAMI GHSIGEFVAA CLAGVFSLED ALMLVANRGR
LMWELPRGAM LSVRLSAPEV EKRLAGELAI AAINSPSLCV VSGTTEAIAA WQTQLETAEI
VCRLLHTSHA FHSPMMEAIV APFAELVGKV KLSPPQIPFV SSVTGDWITP EEATNPMYWA
QHLRQTVRFA DGVKTLWQQP ERLLLEVGPR TTTTTLARQQ AKDIKQQITI ASLSDNADNE
AEWTALLQAM GQLWLAGVTI NWNNFYQEER RQRIPLPNYP FERQRFWIDP LPHPNRNTNH
KPAHYQLEKT QTMSAQTTLI SLLKEIIEET SGLEIASVDA STTFLEMGLD SLSLTQVGLA
LKKKFKVKVS LRHLLEVYPN LATLADFLQQ NLSAEALSAL VPSETTSPSA TLSLQEVSTN
GSNGKNGKNG STPSPMLLVP PTSTNGNGSK NGSLPQVSLP NVASSALEGV INQQLQIMAQ
QLALLGNNSQ PVTVPAVNGQ NNGVKSEKPV TQSNQKPEPQ DSTENLPKKV FGAGARIETT
QTKTLTPQQR TYLDSIIQRY TQRTQKSKEY TQTNRPHLAD PRSVSGFNPT MKEMVYPIVV
SRSSGSKLWD IDGNEYVDLS NGFGLNLFGW SPAFVTEAIE AQLKLGMEIG PQTPLVGEVA
KLMCEITNFD RAAFCNTGSE AVLGAMRLAR TITGRNLIAI FSGGYHGILD EVIVRGTKKL
RSIPAAPGIP QEKVDNILVI DYDAPDALDI LKSRADELAG VMVESVQSRR PEYQPREFLQ
QLRQFTEQED IALIFDEIVT GFRIHPGGAQ AHFGIKADIA TYGKIVGGGL PIGVIAGKSK
YMDALDGGFW QYGDDSVPEV GVTYFAGTFV RHPLALAAAK AVLQHLLKNG PSLQQQLNAK
TDQFVVELTD YFQQMQAPYT VHNFGSLFMV KSAPEFLYGD LLFYLMRDKG VHIWDHRPCF
LTTAHSDADL SLAMAAFKES IAEMQSAGFF SAPTPTTKGS PESSKNLRNR PPQPGAKLGR
DPEGNPSWYV PDPQRPGKYL QVSSVS