Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3822 |
Symbol | |
ID | 4242273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5887337 |
End bp | 5891401 |
Gene Length | 4065 bp |
Protein Length | 1354 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638108756 |
Product | beta-ketoacyl synthase |
Protein accession | YP_723339 |
Protein GI | 113477278 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3208] Predicted thioesterase involved in non-ribosomal peptide biosynthesis [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACCTA GCTCCGAAAC TATTCAACTC TCAAATCAGC AACGTCTTTT ATTAGCAGTT AAACAAGCAA CTGCTAAATT AAATGAAATA GAAACAGCAA CCACCGAACC AATAGCAATT ATTGGTACTG CTTGTCGTTT TCCAGGGGGT GTTGATACTC CAGAGTCTTA CTGGGAATTT CTCAGGGAAT CAATAGATGG CAGAGTAGAG ATTCCTAAAG AGCGATGGGA TAATGACCTC TACTACAACC CCGACCCAGA AGCACCAGGT CAAATATACG TTCGTCATGG TTATTTTCTC CAACAACCAG TAGACCAGTT TGACCCCGCT TTTTTCAGTA TTTCTGGGGT AGAAGCAAAT AAGATGGACC CATCACAACG ACTTTTGTTG GAGGTAACTT GGGAAGCATT AGAAAATGCA GGTATTTCAC CCAGGAGTTT GAAAAATACT GATACTGGAG TATATATAGG TCAATGTTTT AATGATTATG CCTTGATTGG TATAAATGCT TTACCTAATA ACTTTGGTGA TTTTTATTTA GGTCTAGGTA CAGCGATGAA TACTTCATCG GGTCGGGTGG CTTATGCTTT TGGGTTACAA GGACCAACTT TTACTCTGGA TACAACTTGC TCCTCGTCCT TAGTTACTTT GCACATAGCT TGTCAGAGTT TGCGGAGTGG AGAATCTAAC CTTGCTCTGG TGGGTGGAGT AAATTTAATG CTTCATCCCA ATGTTACTCA TGGATTCTGT AAAGGTCGAG CATTATCCCC TGATAGTCGT TGTAAGACTT TTGATGCTAG TGCTGATGGT TATGCTAGGG GAGAGGGTTG TGGTATTATT GTTGCCAAAC GTCTTCGGGA TGCTGTGGCA GATGGCGATC GCATTTTGGC TCTGGTGAAA GGTTCTGCTA TTAACCATGA TGGCCCTAGT AGTGGACTGA CAGTTCCTAA TCAACAGGCA CAAAAGAAGG TAATTCGGCA AGCTTTGGTA AATGCTAAGG TAGACCCGTT AGCAGTTGAC TATGTTGAAT GTCACGGTAC GGGAACTTCT CTGGGAGATC CTCTGGAGGT CAAAGCTATT GATGAGGTTT ATTGTCGAGA AAGAAGTAAG GATGACCCTC TGGTTTTGGG TGCGGTTAAA AGTAATGTGG GGCACTTAGA AGCTGCTGCC GGGGTTGCAG GTTTAATTAA AATTATTTTG GCTCTGCAAA ACAAGGAAAT TCCACCTAAT CTTCATTTTA ATCAACCTAA CCCTCAAATT GATTGGGATA AAATTCCGGT GCAGGTGGCA ACAAGTGTTG TACCTTGGGA AAAACCAGGT AAACCTCTTT TGGCAGGAAT TAGCGGATTT GGTATGAGTG GGACTAATGC TCATGTAATT TTACAGGAAG CACCAGAACA AGTTAAAAGT AAAAATGATA CAGATAGATC CCTTCATTTA TTAACTCTGT CAGCTAAAAC TAAAAAAGCT TTGGAGAAGT CGGTTGTTCG TTATCAAAAT TATCTAGCAC AAGAGAATAA TGGGAATGAG TTAGCTGATA TTTGTTATAC AGCTAACACA GGACGTACCC ATTTCAATCA TAGGTTAGCA GTTATCGCAG CTAATCAAGC AGAATTATTG GAGAAATTAA GCGCTGGTTT AACTGGGGAG GAAATATTCT CTGGACAGGT ATCTAGTAGC AGTTTACCAA AGGTGGCTTT CTTATTCACT GGTCAAGGTT CCCAATACGT TAATATGGGT CGGAGATTAT ATGAGCAAGC ACCAGTTTTC CGACAGGCGA TCGACCAATG TAATCAAATT TTCACTACAA TTGTTGAGGA GCAAGCAGAA TCAGAGGAAA TGTCTCTGTT AGATGTCATA TATTCTGATA CTACAGAAGA TTCTGATTCA TCTCCACTAC ACCAAACTGC TTATACTCAA CCAGCACTAT TTTCTATTGA ATATGCTTTG GCTAAATTAT GGCAGTCTTG GGGAATTCAA CCAGATGTAG TCATGGGCCA TAGTGTGGGT GAATATGTTG CAGCAACAAT AGCAGGAATT TTTAGTTTAG AAGACGGATT AAAGTTAATT ACTGCTAGGG GACAATTAAT GCAACAGTTA CCTTCTGGTG GTGAAATGGT TTCTGTTATG GCCTCAGAGT CAAAAATACG CCCTCTCCTA AAAAATTACA CAGATCAAGT GGCTATGGCA GCGATTAATG GGCCAGCAAG TGTGGTGATT TCTGGAGAGT CAGAAGGGGT AAGAGCGATC GCTACTAAGT TAAAGTCGGA AAGAATTAAA ACGAAACAAC TGCAAGTATC CCATGCTTTT CATTCACCTC TAATGGAACC AATGTTATCT GAGTTTGAAG CTATAGCTAA TCAAATTGAT TATAGTATAC CCAGGATACC GATAATATCT AACGTCACTG GGGCAAAGGC AGATAGTAGT ATTACTACTG CTAAATATTG GGTAAATCAT GTCCGACAAC CAGTTAAGTT TGCCCAAAGT ATGAATGCTT TACACCAACA GGGAGTTGAT ATTTTCCTAG AAGTCGGAGC AAAACCAATA TTATTAGGTA TGGGCCGTCG GTGTTTACCA GAAGATGTAG GTGTGTGGTT ACCATCATTA CGCCCTAATG TGGATGAATG GCAGCAAATA CTTTCTAGTC TATCAGAATT ATACGTTCGG GGAGCCAAAA TAGATTGGTC AGGGTTTGAT GGAGATTATC AGCGACAAAA AGTGACTTTA CCTAATTATC CTTTCGAACG TCAACGGTAT TGGATAGAAA GTGAGAAGTC AGGGTTATCA GGATCGACTG TTTCTCAAAT AACAGAAACG GCAACAGAAA CTTTAACTAA GCAACTGGCA GAAACTGGTA ATCTTTCAGC AAGCGAACTG AAGTTAGTGC CAAAAATACT GGATTTGCTC AAACAACAAC AGTTAAAATC TGTTACAGAG TCGGACCAAA AGTATACAGA ATTAGAAGAC TTAGTTTCCC CTTCGTCAAT AATAGATATT GAGAAGTTTC AAGCTGCTTC GAGAAAAGAA CAGAAGTTGA TGATAGTAGA GTATTTGCAA GAATTGGCAA TGAAGGTACT GCAACTGAAT ACTTCTGAGG TTTTAGACCC CAATGAATCT GTTCTAGAAC TAGGTTTTGA TTCTTTGAGC GTTGTAGAAC TTCGGAGCAA AGTCGAAAAA CAACTAGCAG TAACTATTCC TGCCAGCTTA ATTTTGCAAG GTCCTAGCAT TATGGAATTA GCAGAAGCAT TGGTTGAACA ATTAACTAAT AGCGGCTCAT CTGACCAAGG TCCCGTTAAG AGCAAAAAAG GAAGTGGTTG GATTGCATAC CACAAACCTA AACTTAATGC AAGTACCCGT TTATTCTGTT TTCACCCATG GGGTGCTAGT GCTTCTATGT ATCAACAATG GTCTGATGCT TTGCCTCCAG AAATAGAAGT TTTACCCATT CAACTACCAG GTAGGCAAAG GCGTATTCAG GAAAAACCAT TTACAGATTT TGCAAGTCTA ATAGAGGTTT TAGCAGATTT CCTTTCTCCT TATTTGGATA AACCTTTTGC TTTCTTTGGT CATAGCATGG GAGGGTTCAT TGCTTTTGAG CTTGCCTATT TTCTGGAAAA ACAATATAAT TTGAAACCAA GACACTTATT TTTGAGTGGT GTTGTTCCAC CATCAGACAA TACTTTCTTA GAAAAAATAG GATCTCTTTC AGAAACAGAA CGACTTAATT ATCTTCTAGA AATTTCAGAA ATTCCAGAAA GTATTACTGA AGATTCATCC CTTTTCCATG AGTTAATGAA TATCTTTAAG GCAGATTTTC AACTGTTACA AAGTTATCGT TATCTAGAGA AAAAGCCACT AGATTTCCCA ATCTCCAGTT TTAGTGGAGT AGATGATTAT ACGATTAGCG ATCGCCAACT TAATAATTGG TCTAAGTATA CTACCAGCAA CCTAAAAATA GATAGGATAC CTGGTAAACA TATGTTTATG TTCTTGAAAG ATAGCCAAAA ATTACTTTTG GAGCTGATTT CTCAAGAACT TTTACCACAA TTAATTGCTC AATAA
|
Protein sequence | MEPSSETIQL SNQQRLLLAV KQATAKLNEI ETATTEPIAI IGTACRFPGG VDTPESYWEF LRESIDGRVE IPKERWDNDL YYNPDPEAPG QIYVRHGYFL QQPVDQFDPA FFSISGVEAN KMDPSQRLLL EVTWEALENA GISPRSLKNT DTGVYIGQCF NDYALIGINA LPNNFGDFYL GLGTAMNTSS GRVAYAFGLQ GPTFTLDTTC SSSLVTLHIA CQSLRSGESN LALVGGVNLM LHPNVTHGFC KGRALSPDSR CKTFDASADG YARGEGCGII VAKRLRDAVA DGDRILALVK GSAINHDGPS SGLTVPNQQA QKKVIRQALV NAKVDPLAVD YVECHGTGTS LGDPLEVKAI DEVYCRERSK DDPLVLGAVK SNVGHLEAAA GVAGLIKIIL ALQNKEIPPN LHFNQPNPQI DWDKIPVQVA TSVVPWEKPG KPLLAGISGF GMSGTNAHVI LQEAPEQVKS KNDTDRSLHL LTLSAKTKKA LEKSVVRYQN YLAQENNGNE LADICYTANT GRTHFNHRLA VIAANQAELL EKLSAGLTGE EIFSGQVSSS SLPKVAFLFT GQGSQYVNMG RRLYEQAPVF RQAIDQCNQI FTTIVEEQAE SEEMSLLDVI YSDTTEDSDS SPLHQTAYTQ PALFSIEYAL AKLWQSWGIQ PDVVMGHSVG EYVAATIAGI FSLEDGLKLI TARGQLMQQL PSGGEMVSVM ASESKIRPLL KNYTDQVAMA AINGPASVVI SGESEGVRAI ATKLKSERIK TKQLQVSHAF HSPLMEPMLS EFEAIANQID YSIPRIPIIS NVTGAKADSS ITTAKYWVNH VRQPVKFAQS MNALHQQGVD IFLEVGAKPI LLGMGRRCLP EDVGVWLPSL RPNVDEWQQI LSSLSELYVR GAKIDWSGFD GDYQRQKVTL PNYPFERQRY WIESEKSGLS GSTVSQITET ATETLTKQLA ETGNLSASEL KLVPKILDLL KQQQLKSVTE SDQKYTELED LVSPSSIIDI EKFQAASRKE QKLMIVEYLQ ELAMKVLQLN TSEVLDPNES VLELGFDSLS VVELRSKVEK QLAVTIPASL ILQGPSIMEL AEALVEQLTN SGSSDQGPVK SKKGSGWIAY HKPKLNASTR LFCFHPWGAS ASMYQQWSDA LPPEIEVLPI QLPGRQRRIQ EKPFTDFASL IEVLADFLSP YLDKPFAFFG HSMGGFIAFE LAYFLEKQYN LKPRHLFLSG VVPPSDNTFL EKIGSLSETE RLNYLLEISE IPESITEDSS LFHELMNIFK ADFQLLQSYR YLEKKPLDFP ISSFSGVDDY TISDRQLNNW SKYTTSNLKI DRIPGKHMFM FLKDSQKLLL ELISQELLPQ LIAQ
|
| |