Gene Ava_4750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4750 
Symbol 
ID3679637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5970419 
End bp5975236 
Gene Length4818 bp 
Protein Length1605 aa 
Translation table11 
GC content40% 
IMG OID637720106 
ProductBeta-ketoacyl synthase 
Protein accessionYP_325242 
Protein GI75910946 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR02816] PfaB family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.607154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGTT CTCGACTTGC TATTGTTGGC TTGGATTGCT TTTTAAATCA GCATTTTAAT 
TCAGCAATCT TCTCGGCATT AATTTATGAT GGCAAGACCA TCAAAGACAT TAATCATCTT
GATAATAAGG TAGAATCTTT CACTATCTTA TCGAAAATCT CTCATCAAGC TCTGCAAGAC
TGCCAAGTTA ATTGTCATAA AAATGTTGCT GTCATTACTG CCCATAACGA AACTGTAAAT
CCTCATAATT CGGCAGAAAA TTTGGGGAAA AATCTATCGA ATTTATGGAA TTTTACAGCA
ATTACATTTA ATTTAGATGG GCAGAAAGAT TGTTTACTGA AAGCGTTAGA AATTGCCCAA
ATCCTGTTAA CTGCAAAGGA AGTTGATACG GTAATTATTA GTGTAGTCAA TTCTGAGCTT
AATACTTATA GCGTTAATGG TGGAGCCATT ATTGTAAAAG ATTATGCAGT GGCTCAAAAA
GATGGCGATC GCATTTATGC GGTACTAGAA AGCTTCACCC GCATTGCAGA TACCCTCTCT
CCTTCATCAG TAGCCATTCA ACAAAGTTGT GAAAAAGCTT TGCACCAAGC CAACATCAGC
CCTCCAGAAG TTGGCTATAT TGAAGTTTGC CACCAAAACC TAGCAGAAGA AAATAGCCCA
GAATTTATCG GCTTAATTAA TGCTTACGCT ACGCATCAAC ATGACCTTAC TTGCGCTGTT
GGTAGTATAA AGGCAAATCT GGGTGATTTG GGTATTGCGA CGGGAGTCAT TAGCTTAATT
AAAACAGCCC TATGTGTTTA TCATCGCTAT TTGCCAGCAT ACCCCCAATG GCAGCAACCC
CAACACCCGG AAATCTGGCA AAATAGTCCC TTTTATGTGC CTACAAGTTC TTATCCTTGG
TTTTTGTCAA AGAACCAGGA AAAGCGCCAA GCTGCTGTGA ATTTGGTTGC AGAAGATGGT
ACTTATGGTC ATATTATTCT CTCAGAAGAC ACGACTCAAA CCAGCCGCAG CCCTAGCTAT
TTACCCTACG CTTCGTTTTA CCTTTTTCCC ATCGCTGCTG ATGAGGCTTC ATCTTTGCAG
ATACAACTAG ATGATTTAGC TGCCAAAATT GAAACTAGTT TATCTTTACC TCAGACTGCC
CAAGAAAATT TTAGCCAATT TCAACGTAAT TCCCCATCTC CTTACGTTAT AGCGATTGTT
GGGGAGAACA AAGAAACCCT ACAACGGGAA ATTAAACAAG CGCAAAAAGG TATTGAGAAA
GCATTTAGGG AAGGAAAATC TTGGAAAAGC CCCAAGGGAA GTTATTTTAC TCCCAAACCC
ATTGGTAAAC AGGGAAAAAT CGCTTTTGTT TATCCAGGCG CTTTTAATTC CTATTTAGGT
ATGGGACGCA ATCTGTTCCA ATTGTTTCCC CAATTATGGA ACCGCATCGC CAGTTTAGTT
AAAGATCCCA ATCAATTCTT ACAAGCAAAA CACCTGTACC CCAGAAGCCA ATATTCCCTA
TCGCAGCGAG ATTTAGAAGC CTTAGAAACT GAGTTTATCG CCAATCCCTT ATCCTTACTA
GAAACCGGAA CTGGTTTTGC GGTAATGTTC ACGGACATTA TGCAGCAATA TTTCCAGATT
AAACCCCAAG CAGCCTTTGG TTACAGCATG GGGGAAAGTA CCATGATGTA TAGTTTGGGA
GTTTGGGCTA ATGCTGATGC AGGAAGTCAA TATATCCATT CTTCCCCACT GTTCCGCGAT
CGCTTAATTG GGACAAAACA AACAGTAGCA GAATATTGGG GCGAACCAAC TGGGGAAAAT
TTATGGAGTA GTCATGTGTT AATGGCTGCA CCAGCAGCAG TCAAACAATG CTTAGAACAA
GAACCACGAG TTTATCTTAC CCACGTTAAC GCACCCCAAG AAGTGGTGAT TGCTGGCGAT
CCCCAAGGCT GTTTACGGGT GATTGAACGC TTACAATGTG ATGCTTTTCG TTCTCCGTCG
GATATGGTGC TGCACTGTGA AGCTATGCAG TCAGAATTTG CCGCATTTAT GCAGCTGAAC
ACTGTCAATA TGGGAACACC ACCAGACACA GTATTTTATT CCTCTGCTAA TTATCAACCA
ATTCCTTTAG AACAGTTAGC GATCGCGCAA AACTTATCAC AAGGAGTCTG TCAACCCTTG
GAATTTCCCC GTTTAATTGA TCGTGCTTAT CAAGATGGGG TCAGAATTTT CCTAGAGTTA
GGATCAGGCG GTAGCTGTTC TCGTTGGATT ACAGAAACTC TCCAAACACA AGATCACCTC
GCTATGTGCA TCAATCGTCG AGGTGCAGAT GATTTGGCTA CCATTGTTAA AATGTTAGCC
CAATTGGTAA GCCATAAAGT AGACCTGGAT TTATCACCCC TCTATCTTTC CCAGGAAACG
CCAACCGAAA TTACCACCTT GAAGGAAACA GTTCCCTCCT TGGTAGCTTC TCCTCCTTGT
CTATTTAACG AAGATGATAT TTTAGAGTTT ATTGAGGGGA AAGTATCAAA GGTCTTTGGT
GAAAGCTATC AAGAAATTGA TAACTACAGT AGACGAGTAA GAATGCCCTC ACCTCCCTTT
TTATTTGTGA GTCGTGTAAC TCAGTTAGAA GGGAAATTAG GTGATTATAA ATCAGGATTT
ATTGAAACAG AATATGATAT TCCCCAAGAT GCTTGGTATG CAATCGATGG TCAGTTAACT
GTCGGCATTT GTAAAGAAGC TGGTCATGGA CTGTTGATGT TATTAAGTTA TCTGGGAACA
GATTTTGAAA ATCAAGGTAA ACGTTCTTTC CGCTTATTAG ATTTATCAGC AACTTTCCTT
TTTGAACAGC CAGAAACCAT AAAAACATTG AAGTGTCGAG TCAAGATTAC TTCTTCTGTG
AAAACCGAAA AAAGTTTACT CGTCTTTTTC CAAGGAGAAG CTTTAATTGG CGATCAAGTG
TGGATGAAAC TCCATGACGG CTGTGCAGGA CTCTTTTCCG ATGAAGAATT AGAACAAGGG
CAAGGAATTG TTATTTCCGA AAGTGAAGCA AGAGAACGTC AGCGAATTAC AAAACAACAC
TTTACACCTC TACTAACTTG TTCTAAATCA AATTTCACAT CAGAAGAAAT TCTTGCCTTA
ACTATAGGTG ATTTGGGCGA GTGTTTTGGG GAAGAATATC AACAAAATTC CCTAAATCCA
TCCTTAAGAT TACCACCAGC AAAACTACTC ATGTTAGATG ATGTGATGAT GGTTAATCCA
CAGGGAGGAG TAGCCGGACT AGGGTTAGCC ATTGGGTCAA AAGAAGTCAC ACCGGAAGAT
TGGTATTACT TCTGTCATTT TCGCAATGAT CCGACCATGC CGGGGAATTT AATGATTGAA
GGATGTATCC AATTAGTGCA GTTTTATTGT TTATTTTTAG GGTTACAAAC TCGAACCAAA
GATGCTCGTT TTCAAATCAT TCCTGGCAAA ACTCAAGCAG CACGTTTTCG AGGACAAGTT
ACCCCCCAAA CAGGCACATT AATGTATCAA ATGGAGGTAT TAGAACTTGG ACTCTCTCCT
CAACCATACG CTGTGGCTAA TGTGGATGTT ATTTTTGGCG GAAAAACCAT TGCTACTATT
AAAAATATTG GTGTTCAGCT TGTTGAAAAA CCACTTGCCA TCAAAAATTC TCTCACTGAG
ATTAATCATC AACCCGTTTT ATTCAATGAA GAACAACTAA AGCAATTCGC CAAAGGCTCA
GTTGCGGCTT GTCTTGGTTC AGAATTTGAT ATATATGAAA ATCGTCAATC TGTTCGCCTA
CCTAATGGAG AGTTTCAGTT AGTTAGTCGA GTTTTAGAAA TAGAAGGCAA ACGTCATGAA
CTGCAAAAAC CTTCCCAAAT AATTACAGAA TATGATGTGA AACCTGATGC TTGGTTTTAC
GAACATAATG CTTACCCCAC TTTGCCCTAC TGCACCTACA TTGAAATTGC TGGACAACCT
TGCATTTTCT TAGGGGTTTA TATGGGGGCA ACCTTATTAT CTCCTGATGA TGACCTGCAT
TTTCGCAATT TAGATGGGCA AGGGACAATT CTTAAAGAAA TTGACCTCAG AAATAAAACT
ATTACTGATA AAGTCCGTCT TCTATCTACG ACTGCTGTGA AAGGGGCAAT AATTCAAAAA
TATGAGTTTG AATTATCCTG TGAAGGAGAA CCATTTTATC GCGGTAATAT GGTATTTGGT
GACTTTAGCA CCGCCGTTTT AGCGAATCAA GTGGGGTTAG ATGGGGGAAA ACGCCTCAAA
CCTTGGTATC AAGAACATGA AACTTCTGCT TCTGATTTAA CAACAATTGC CTTAAAAGAC
CCCAATTGGC GGCAGAAACT CTATCAAATT AATCCGAATA AACCCCATTA TCGCCTCTCG
GAAAAATATT TAGATTTTCT GGATGAAATG TTCATCATTG AAGATAGTGG AAACTATCAA
AAAGGGTATA TATATGCTAG AAAATCAATT ACCCCTCAAG ATTGGTATTT TCCTTTTCAT
TTTTATCAAG ATCCGGTCAT GCCTGGAGCC TTGGGAGTGG AATCAATTAT CCAAGCGATG
CAGGCCTATG CTTTGCAATT AGACTTAGGT AAATCATTCA AAAATCCTCG ATTTGGTCAA
GCTATTAATC ACGAAATCAC TTGGAAATAT CGGGGACAAA TTACGCCAGA AAATCATTTG
ATGTCTTTGG AAGTTCATAT TTCTAATATT GAAGTTGCGT CTGACCGCAT CACAATTATT
GGTGATGCCA GTTTGTGGAA AGAAGACTTA AGAATTTATG AAATTAAAGA TATTGCTCTG
TGCTTAGTAG AAGCATGA
 
Protein sequence
MERSRLAIVG LDCFLNQHFN SAIFSALIYD GKTIKDINHL DNKVESFTIL SKISHQALQD 
CQVNCHKNVA VITAHNETVN PHNSAENLGK NLSNLWNFTA ITFNLDGQKD CLLKALEIAQ
ILLTAKEVDT VIISVVNSEL NTYSVNGGAI IVKDYAVAQK DGDRIYAVLE SFTRIADTLS
PSSVAIQQSC EKALHQANIS PPEVGYIEVC HQNLAEENSP EFIGLINAYA THQHDLTCAV
GSIKANLGDL GIATGVISLI KTALCVYHRY LPAYPQWQQP QHPEIWQNSP FYVPTSSYPW
FLSKNQEKRQ AAVNLVAEDG TYGHIILSED TTQTSRSPSY LPYASFYLFP IAADEASSLQ
IQLDDLAAKI ETSLSLPQTA QENFSQFQRN SPSPYVIAIV GENKETLQRE IKQAQKGIEK
AFREGKSWKS PKGSYFTPKP IGKQGKIAFV YPGAFNSYLG MGRNLFQLFP QLWNRIASLV
KDPNQFLQAK HLYPRSQYSL SQRDLEALET EFIANPLSLL ETGTGFAVMF TDIMQQYFQI
KPQAAFGYSM GESTMMYSLG VWANADAGSQ YIHSSPLFRD RLIGTKQTVA EYWGEPTGEN
LWSSHVLMAA PAAVKQCLEQ EPRVYLTHVN APQEVVIAGD PQGCLRVIER LQCDAFRSPS
DMVLHCEAMQ SEFAAFMQLN TVNMGTPPDT VFYSSANYQP IPLEQLAIAQ NLSQGVCQPL
EFPRLIDRAY QDGVRIFLEL GSGGSCSRWI TETLQTQDHL AMCINRRGAD DLATIVKMLA
QLVSHKVDLD LSPLYLSQET PTEITTLKET VPSLVASPPC LFNEDDILEF IEGKVSKVFG
ESYQEIDNYS RRVRMPSPPF LFVSRVTQLE GKLGDYKSGF IETEYDIPQD AWYAIDGQLT
VGICKEAGHG LLMLLSYLGT DFENQGKRSF RLLDLSATFL FEQPETIKTL KCRVKITSSV
KTEKSLLVFF QGEALIGDQV WMKLHDGCAG LFSDEELEQG QGIVISESEA RERQRITKQH
FTPLLTCSKS NFTSEEILAL TIGDLGECFG EEYQQNSLNP SLRLPPAKLL MLDDVMMVNP
QGGVAGLGLA IGSKEVTPED WYYFCHFRND PTMPGNLMIE GCIQLVQFYC LFLGLQTRTK
DARFQIIPGK TQAARFRGQV TPQTGTLMYQ MEVLELGLSP QPYAVANVDV IFGGKTIATI
KNIGVQLVEK PLAIKNSLTE INHQPVLFNE EQLKQFAKGS VAACLGSEFD IYENRQSVRL
PNGEFQLVSR VLEIEGKRHE LQKPSQIITE YDVKPDAWFY EHNAYPTLPY CTYIEIAGQP
CIFLGVYMGA TLLSPDDDLH FRNLDGQGTI LKEIDLRNKT ITDKVRLLST TAVKGAIIQK
YEFELSCEGE PFYRGNMVFG DFSTAVLANQ VGLDGGKRLK PWYQEHETSA SDLTTIALKD
PNWRQKLYQI NPNKPHYRLS EKYLDFLDEM FIIEDSGNYQ KGYIYARKSI TPQDWYFPFH
FYQDPVMPGA LGVESIIQAM QAYALQLDLG KSFKNPRFGQ AINHEITWKY RGQITPENHL
MSLEVHISNI EVASDRITII GDASLWKEDL RIYEIKDIAL CLVEA