Gene Ava_4742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4742 
Symbol 
ID3679629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5948134 
End bp5951568 
Gene Length3435 bp 
Protein Length1144 aa 
Translation table11 
GC content43% 
IMG OID637720098 
ProductBeta-ketoacyl synthase 
Protein accessionYP_325234 
Protein GI75910938 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.419577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCTA TTGCAATTAT TGGTATTGGT TGCCGTTTTC CTGGTGCGGA TAGTCCGCAA 
GCATTTTGGC AATTGTTGTC TCAAGGAGTT GATGCGATTA CAGAAATACC GGCTGATCGT
TGGAACATTG ATGAGTTCTA CGATCGCAAT CCAGAAACTC CAGGAAAAAT GAACTCCCGT
TACGGCGGGT TTCTCTCCCA GGTCGATCGC TTCGATCCCC ATTTTTTCGG TATTTCTCCG
CGAGAAGCCT TATTAATGGA TCCCCAACAA AGGCTGTTAT TAGAAGTAGC TTGGGAAGCT
TTAGAAGATG CGGGAATTGT CCGCGAACAA CTGACTGGCT CAAAAACTGG CGTATTTGTC
GGCATTTCCA CCAATGATTA TAGCCGCATC CATCCTGAGT TTGATAGCAA TCCTCAAGGT
TACGATCTCA CCGGTAACTG TATTAATATT GCTGCGGGTC GTCTTTCCTA TCTGTTTAAC
TTACGGGGGC CGAGTTTAGC TGTAGATACG GCTTGTTCCT CTTCTTTGGT GGCTGTGCAT
TTAGCTTGCC AAAGTATCTG GAATCAAGAG TCTAGCATGG CGATCGCCTC TGGAGTTAAT
TTGGTTCTTT CTCCCATTGG TAATATTGCG TTAAGTAAAC TGAAAGCCCT TTCTCCAGAT
GGACGCTGTA AAACCTTTGA TGAAAGTGCC AATGGTTATG TGCGGAGTGA AGGGGCTGGC
TGTATTATTC TCAAACCTCT TGCCCAAGCA TTAGCAAATC ATGACCCCAT CTATGCAGTC
ATCCGGGGTA GTGCCATTAA CCATGATGGT CGCAGTAAAG GATTAACTGT TCCCTACGGG
CCAGCCCAAG AAGCCTTGAT ACGTCAAGCA TTGCAACAAG CACAAGTCCA GCCCAAAGAA
ATTAGTTATG TAGAACTACA TGGAACCGGC ACACCCTTGG GCGATCCGAT TGAAGCTATG
GCTATAGGTG CAGTCTTAGG GGAAGGACGA GACCCAGACC ATCCTTGTTT GGTAGGAGCA
GTTAAAAGTA ATATAGGACA TCTAGAAGCA GCTGCGGGGA TTGCTAGCAT CATTAAAATG
GCTCTTGCCC TCAAATATCA ACAAATTCCT CCCAGCCTGC ATTTTCATCA ACCTAATCCC
TACATTCCTT TTGATCAGTT ACCTTTACGG GTGCAGACTA GTCTCATTCC TTGGCCAAAA
AGTCAGTATG GTGCGAAAGC CGGAGTTAGT TCTTTCGGAT TTTCGGGAAC TAATGCCCAT
GTGATTTTAG AAGCATATTC ATCATTCAGC CCTACCGAGC AACCAACTAG CAAACTTCCC
CATTTATTAC CTTTATCAGC CCATACACCC GCAGCAGTGC AAACCCTAGC GCTAGGGTAT
CAAGACTTGA TTAACGCACA AAAACTGACT CCAGAATTTG TGCAGAATCT TTGCTATAGC
GCTAGTGTCA GACGCACCCA TCAGTCCTAT CGTAGTGCCG TTGTTGTCAA TTCTCCAGAA
GACTTACCAT CTTGTTTACA AGCCCTGACA ACCGCAGATA TTACCACCCA AGTAAAGCAG
GCAAAACGCA AACACAAAGT TGCTTTTGTG TTTTCTGGAC AAGGCCCCCA GTGGTGGGCG
ATGGGACGGC AATTATTAGC CCAAGAACCT GTTTTTCGGG CAGTGATTGA GGAATGTGAT
ACTTTAATTC AAAAGTATGC CCAATGGTCA TTATTAGCAG AGTTTGCCGT TCCTGAGTCT
CACTCTCGCT TCCAAGAAAC CGAAATTGCT CAACCTGCCT TGTTTGCATT GCAGGTTGGA
TTAGCCCATT TATGGCGTGC TTGGGGAATT GAACCCAAAG CCGTGGTAGG ACATAGTTTA
GGAGAAGTCG CTGCTGCCCA TTTCGCCGGG GTTTTGAGTT TAGAAGATGC TATATATTTA
ATTTGTCACC GGGGACGGTT AATGCAGCAA GCCACTGGTA ATGGCAAGAT GCTAGCAGTG
GAATTACCAG TTACTGAAGT TGAACCCTTA TTAGCAGCTT GGGCTGGTAA ATTGGAAATT
GCCGCCATTA ATAGTCCCAC GGCAACGGTA ATTTCTGGAC AGTCTCAGGC ATTGGAAGTA
TTTGTCACCC AACTGCAACA GCAACACCCC GATATTTTTT ATAAAGAATT ACCAGTTAAT
TATGCCTTCC ATAGCCAACA AATGGCTCCC TTTGCTGAGG CTTTAGTTGA GAAATTATCA
CACATCCAAC CCCAAACAGG TAGCCTAGCA ATTTTCTCCA CAGTGACAGG AGATAAACAA
GCTGGTCAGA AATTTAATGG TGATTATTGG GGTCAGAACC TCCGCCACAC AGTTTGCTTT
GCGCCAGCTT TGACAGCTTT GATTCAATCG GGCTATACCC AATTTATAGA AATTAGTCCA
CATCCCGTAT TATCAGGATA TATCAATGCT TGTTTGAAAA AGCAAGAAGT TGATGGGGTG
GTCTTACCAT CTTTAAAACG GGGTTTTGGA GAACGGGCAA CCCTGTTGAA AAGTTTGGGG
ACACTTTACA CCCTTGGTCA CGCAGTAAAT TGGCAGTCTT TATATCCTGA TGGCTGCCAA
ATGGTGGATT TACCCCTTTA TCCTTGGCAA AGAGAATCTT ATTGGATCAG TGAATCTCAA
CCCCAATTTC AAAAAGCACA ACCAGCCTCC TCTTTACTCA ATTTATTACT AGCAGGAAAA
ACAGAACAAC TCACCCAAGA ATTAAGCAAC CATCACCAGT TATCTCCTGA AGCTAAACAA
TTGATTCCCC AATTAATAGA ATTATTAGCC ACAGGAAAAT CTACAGCAAA AATTACTCAA
GATTTAAGTA ATGCCCGTTA TGGAATTGAA TGGCAACTTA GCCCTTTAAC TGTAGATAAT
AAAATCTCTC AGGCTGAAAA TTGGTTAATT TTTAGTGATA ACCAAGGACT AGGAAAGGAA
TTAGCTGCTG TAGTTAATGA CTCTTGTATT TTAGTTTCAT CGGGTGAGAG TTACGAAAAA
TTATCATCCA GTCATTATCA AATTAATCCG CATCAAGCTG CTGATTTTCA AAAATTATTA
ACGGATATTT CTCAAACCGT AACCAAAGTA GTTTATCTAT GGGGGTTAGA AAATTCTCAA
ACTCAATGCT ATAGTTTGCT GTATTTAGTA CAAGCCTTAG CTAAAATTAA CGGCAAAATT
GCACCTAAAC TGTGGATTGG TACACAACAA GCTCAAGCCG TCACCACCAG TTGTCATCCT
GACTTTTCAG ATCATGTGGA AAAGCTAACG CCAGACCTAA CCCCCCAGCC CCCTTCCCTA
CTAGGGAAGG GGGAGAAATC AAAGCCTCTC TCCTTGCAGG ATACTGTTGG CGAATTGATA
GAGGGCAAAA CCAATCATCA CAAAATAGAA AATCAGGAAT ACTATGAGGC AGTGGGAGCA
ATTCGAGCAA AGTGA
 
Protein sequence
MEPIAIIGIG CRFPGADSPQ AFWQLLSQGV DAITEIPADR WNIDEFYDRN PETPGKMNSR 
YGGFLSQVDR FDPHFFGISP REALLMDPQQ RLLLEVAWEA LEDAGIVREQ LTGSKTGVFV
GISTNDYSRI HPEFDSNPQG YDLTGNCINI AAGRLSYLFN LRGPSLAVDT ACSSSLVAVH
LACQSIWNQE SSMAIASGVN LVLSPIGNIA LSKLKALSPD GRCKTFDESA NGYVRSEGAG
CIILKPLAQA LANHDPIYAV IRGSAINHDG RSKGLTVPYG PAQEALIRQA LQQAQVQPKE
ISYVELHGTG TPLGDPIEAM AIGAVLGEGR DPDHPCLVGA VKSNIGHLEA AAGIASIIKM
ALALKYQQIP PSLHFHQPNP YIPFDQLPLR VQTSLIPWPK SQYGAKAGVS SFGFSGTNAH
VILEAYSSFS PTEQPTSKLP HLLPLSAHTP AAVQTLALGY QDLINAQKLT PEFVQNLCYS
ASVRRTHQSY RSAVVVNSPE DLPSCLQALT TADITTQVKQ AKRKHKVAFV FSGQGPQWWA
MGRQLLAQEP VFRAVIEECD TLIQKYAQWS LLAEFAVPES HSRFQETEIA QPALFALQVG
LAHLWRAWGI EPKAVVGHSL GEVAAAHFAG VLSLEDAIYL ICHRGRLMQQ ATGNGKMLAV
ELPVTEVEPL LAAWAGKLEI AAINSPTATV ISGQSQALEV FVTQLQQQHP DIFYKELPVN
YAFHSQQMAP FAEALVEKLS HIQPQTGSLA IFSTVTGDKQ AGQKFNGDYW GQNLRHTVCF
APALTALIQS GYTQFIEISP HPVLSGYINA CLKKQEVDGV VLPSLKRGFG ERATLLKSLG
TLYTLGHAVN WQSLYPDGCQ MVDLPLYPWQ RESYWISESQ PQFQKAQPAS SLLNLLLAGK
TEQLTQELSN HHQLSPEAKQ LIPQLIELLA TGKSTAKITQ DLSNARYGIE WQLSPLTVDN
KISQAENWLI FSDNQGLGKE LAAVVNDSCI LVSSGESYEK LSSSHYQINP HQAADFQKLL
TDISQTVTKV VYLWGLENSQ TQCYSLLYLV QALAKINGKI APKLWIGTQQ AQAVTTSCHP
DFSDHVEKLT PDLTPQPPSL LGKGEKSKPL SLQDTVGELI EGKTNHHKIE NQEYYEAVGA
IRAK