Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3821 |
Symbol | |
ID | 4242272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5881443 |
End bp | 5887172 |
Gene Length | 5730 bp |
Protein Length | 1909 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638108755 |
Product | beta-ketoacyl synthase |
Protein accession | YP_723338 |
Protein GI | 113477277 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | [TIGR00128] malonyl CoA-acyl carrier protein transacylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAGC CAGAAGAACC ACAACAATAT AATGATAATG AAATTGCTAT CATAGGAATG TCTGGACGGT TTCCAGGGGC AAAAAACGTG GAGGATTTTT GGGATAATCT CAAGAATGGT GTTGAATCTA TCTCATTATT ATCAGATAAA CAGTTGAGCA AGTCTGGTGT TGCACCCGAA ATACTTAATA ATCCCAACTA TGTTAAAGTT AATTCTATGG TCTCAGACAT CGATATGTTT GATGCTAATT TTTTTAACTA CAGTCCTAGA GAAGCAGAAG AAATAGACCC GCAACAACGA CTTTTTTTAG AGTGCGCCTG GGAAGCAATA GAAAGTAGTG GTTATAACCC AGAAAACTAT GAAGGTTCAA TCGGGGTATA TGCTGGAGGT GGACTGCCTA CCTATTTAAT GTATAACCTG GAAGACCAAG ATTTTATTCT ACTGGGTAAT CGCTACTTTA CCCAAATGGT AGGTAATGAC AAAGATTACT TAGCAACCCG CACCGCTTAC AAACTAAATC TTACTGGACC AGCGCTCAAT ATTCAAACAG CTTGCTCCAC TTCATTAGTT GCAGTACATT TAGCATGTCA AAGTTTGCTC AATGGAGAAT GTGACATGGC TCTGGCAGGT GGTGTTTCGA TCCAAATCCC TCAAAATGTG GGATACTTAC ATCAGGAAGG ATTGATCGGC TCTCACGATG GTCATTGTCG AGCCTTTGAT GCCAGAAGTT CGGGAACAGT TTTTGGTAAT GGTGTAGGAG TTGTGTTACT TAAACCACTG CAAGATGCTA TAGCAGATGG AGATTGTATT CATGCTGTAA TTAAGGGTTC GGCGATTAAT AATGATGGTT CTTTGAAACT GGGTTATACT GCTCCCAGTG TTGATGGTCA AGCTGCAGTT ATTTCTGAAG CTCAAGCTGT TGCTAGTGTA ACTCCAGAAA CAATTACATA TATAGAAGCT CATGGCACGG GTACGGAACT AGGAGATCCG ATTGAGATTG AAGCTTTAAC AAAAGCCTTT TCAGAGCATA CTGATAAAAA ACAATTTTGC GCCATAGGTT CCCTAAAAAC AAATGTGGGA CACATGAATA CGGCGGCAGG GATAGGAGCA TTGATTAAAA CTGTTTTGAC TCTCAAACAT AATTTAATAC CTCCCAGTCT ACACTACGAA AAACCTAATC CCCAGATTAA TTTTAGCGAT AGTCCGTTTT ACGTCAACAG TACTTTATCT GAATGGAAAC GCAACGGCAC TCCTCTACGA GCAGGAGTTA GTTCTTTTGG TATAGGTGGA ACTAACTCTC ACGTTGTATT GGAAGAAGCT CCCAGTCAGG TTAAAGAAGA AGATGATTTG CAACGTCCCG TTCATATATT AACTTTGTCA GCTAAGACTC CAACAGCTTT AGCAGATTTA GTAGATAGTT ATCATCATTA TCTAGAAACT AATCTAGATT TGGATCTAGG AGATGTTTGT TACACTGCAA ATACAGGAAG AGTTCATTTT AACCACAGAT TAGCAGTTGT TGCTGAGAAA CAAACAGAAT TAACTGAAAA ACTGCGACAG TTTAAATTAG AAGATAAAGT AGAGGGTATC TGTTCTGGGA AACTGTTAAT CAACGCTACT GCTCCTAAAG TCGCCTTCCT ATTCACAGGT CAAGGTTCCC AATATGTTAA TATGGGCAAG GAGTTATACG AACAAGCACC AGTATTCAGA GCGGCACTTG ATGAATGTGA GGAAATATTA GAGGAACTGG GTGTAAATTC TATATTAGAA ATTATTTATC CTGAAGGTGG AGAGACATCA CCCCTAGACC AAACAGCTTA TACCCAACCT GCTATATTTG CCATAGAATA CGCCCTAGCT AAATTATGGG AATCTTGGGG AATTAAACCA GATGTAGTGA TGGGTCATAG TGTAGGTGAA TATATAGCAG CAACAGTAGC TGGAATATTT AGTTTAGAAG ATGGTCTGAA ACTCATTGCA GCAAGGGGAC GGTTAATGCA ACAGTTACCC GCAGGTGGAG AAATGGTAGC TGTGATGGCA TCAGAAGCAA AAGTGAAAAA GCTCCTCAGA CCTTATGCAG AAAAAGTAGC TATCGCAGCA ATAAACGGAC CAAAAAGTGT AGTTATTTCT GGTGAGGCGA CAGCAGTCAG GGAAATTGTC AGCAGTTTAG AATCAGAAAA AATTAAAACC AAACAATTAC AGGTATCTCA CGCCTTCCAT TCACCATTGA TGGAACCAAT GTTGGCAGAA TGGGAAGCAG TAGCTAAGGA ACTAACTTAT CATCAACCAG AGATTCCAGT CATATCTAAT GTCACGGGAA CAATAGCAGA TAAAAGCATC ACAACAGCCA AATATTGGGT AGATCATGTA CGTCTACCAG TCAGGTTTGC TGAAGGTATG GATGCTTTAC AGAAACAGGG CGTTGAAATT TTCTTAGAAG TCGGACCGAA ACCAATATTG TTGGGTATGG GTCGCCGATG TCTGCCTAAA GATGTAGGTG TTTGGTTGCC ATCATTACGT CCTGAGCACA TCCCTCTCCA GTCCTCCTTA GAAAGCGCGA GGTCAGAGGA TTTTTCACAG ATGCTTTCTT CTCTGGGAGA GTTATATGTG CGGGGAGTAA AGGTAGACTG GGTAGAATAT GATGGAGAGT ATAGTCGTCA GAAAGTAATA TTACCTACTT ACCCATTCCA ACGACAAAGA TACTGGATAG ATAACTATTC GCGGGGGAGA AATTCTGTTT CAGTCAGTAG CAATAATGGT TTTGTCACTG AGGAGGGAAC ATCAAATCCT TTATTAGGTC GTAAATTACT TTTACCCTTT GCACAACAAT TCCGCTTTCA GACTAAATTA ACTCCTGAGT TTCCTTCTTA TGTGAAAGAC CACAGATATT ACGGCAAAAT AGTTGTAGCT GGTGCGTCTC ACATTGTTCT TGGTCTCCTT GCTGGTAAAG AAGCACTAAA CAGTGATTCT TGCGTTGTCG AAAATATTGA ATTTTTGCAG ATATTGGGAG CAGATGAAAC TTCTAGTCGT ACAGTGCAAA TACTTCTGGA TCAAGAAAAA GAAACAGAAT ATACTTATCA ATTGATAAGT TGTGAAGCAG GAACAGAATA CGACCCATCA ACAGTTTGGA CGGTTCACGC TAAGGCAAAA GTTCGCTCTG TTAATAACTC AGAGTTACCG AAAGAAAATA TTGATATTGC AGCAGTAAAA AACCGATGTA TTCAGTCTCT TTCCAGTGAT GAATATTACT CAACTGCACT GGAACCCATG AAAGGAGACT TTCACCTAGG ACCGACATTT CAATGGACAG AACAAGCTTG GATAAGTAGT ACAGAAGGCT TAGTCAAAGT AAAAGCAGGT GAAAACAATG AAGAGATGCA GGAATATTGG CCTCATCCGG GTATGGTTGA TTCTGGAATT GTGCCAGTAG CTCTGTTGTC AATACTTAGG GAGTTGTCTC AGAATACTGA GAATAATGGT AATGGCAAAA AAAATGCCAT TGATAGTGAA ATGGAAATTC CTACCTACGC ATTTGCTGGA GCTAAGAGCT TTAAGTTTTT TGATAATTTT GACATTGATG ATGATTTGTG GTATTACACC AAAATTGACG AGTCAAGTTC TTATGATCGA GGAGAACTTC TAGGTGAGAC TTACTTGCTC AATGGAGATG GCAAAGTATT AGCAGAATAT AGTGGTATAG ACTTCCGTCA ATTAAGTCGC AAGTTATTGT TGAGGAGCTT TGGACTAGAT TTCAGTCAGT GGTACTATCA AACAGAATGG CAACCTGCTG CTTTGATGCC TGGTGAGACA CAGGAAACAG GAACTTATCT GCTATTTTGC CCTACAGGAG AATCAAATAG TAAGTTAAAA GAATGGAGCG ATAGCCTGAA CTCCCAATTA TTAGACCAAG GTCATCAATG TATTGTTGTT TATCCAGCAG ATAGTTACAA AAAGTTATCC TCTGAAGGCA AAAAACAGAT AATTCAATTA TCTCCAACAC AACCAGAACA TTTCCAAAAG TTACTTGATG AGGTAAATGA ATTAACAGAA GAGTTACCTT TGAAGGGTGT TATTCACCTG TGGAGTTTAG ATACAGATGT TGAGGAGTTG ACAAAAACAG AAGAGATAAT TTGTGGTAGT ATCATTAACT TCTTGCAAGG GATGCGATCG CTAGAAAAAT TACCTCCTCT TTTGTTAGTG ACTCAAGGAG CTTCATTAGA AGTCAGAAGC AAGGAGTCAG GAGTCAGTAT TAACGCTCAA CCGCAACAAG CTCTTTTATG GGGGGTAGGC AAAGTCATAA CTATGGAGTA TCCGCAGTTA GACTGTCGTT GTTTAGACCT AGACCCTAAT GCTGATGAAC AAGAAACTTT AAAAGTTTTA CTTGATGAGG TCGCTAACCA TCAGACATCA ACCAGTGTGG AAAATCAAAT CCGCTATTGT CAAGGAAAAC GTCAAGTAGC AAGACTAACT CAACCCAGGT TAGATACTGA TGCTAAGTTA GCTATTCAGA CGGAAGCAAG TTATTTGATT ACTGGTGGTT TAGGTGCTTT GGGGTTAGAA GTGGCTCAAT TACTCGTACA GCAGGGCGTT AAGAGTATAG CCTTAGTAGG GCGCAACTCT CCCTCAGAAA CCGCCCAGGA AAGTATCCAA GAATTAGAAG CAGCAGGAAC TCAAGTTTCA GTATTCTTAG GGGATGTCTC TGTGGAAAAA GATATGGTTA ATATTTTCCA AAAGATACAG ACATCCTTAC CTCCCCTTAA AGGTGTAATT CATGCTGCCG GAGTTTTAGA TGACGGTTTC ATACAACAAA TGACTTGGCA ACAATTCACA AAAGTTACTG CTCCCAAAGT TACTGGAACT TGGAATCTAC ATAAATTAAC AAAAGATATA CCATTAGATT TCTTTGTTTG CTTCTCCTCC CTTGCATCAG TGTTAGGTTC CCCTGGTCAA AGTAACTATG CCGCAGCTAA CGCCTTTATG GATGCAGTGG CCCAATATCG TCAGAATTTA GGATTACCAG GATTGAGTAT TAACTGGGGA CCTTGGGCAA ATGTTGGCAT CGCCGCTCGG ATGGGGGCAC AACAACAAGG TCGTCTACAA AGTCAAGGTT TGCAAGGGAT TCAAACAGAG CAAGGGTTAC AAGCTTTAGA AGAAGTATTG GCGACCGATG AAGCACAAAT AGCAATATTT AATATTGACT GGCCACACTT GTTGAGTCAG TTTGGACAGA TGACTCCAGC ATTTCTATCG GAAATAGCAA GTCAGCATCC ACTCCAAGGT AAAGCAAATC AGGGACCTAA ACAACGGGAG TTGTTAGAAC AAATGAAGGT GGCAACAACT GACCAACGAC AAAGGTTAAT GGTTGATTAC TTGATCGGTG TTGTGGCTAA AGTATTGAGG CGGGGTAAAA ATGATTTACC AGACCCAGAG GAAGGATTCT TTAATTTAGG AATGGATTCG CTGATGGCAT TGGATTTTGG GCAAATGATT CAGGTTGATT TAGGTATTAC TTTGTCCTCA ACTTCTACTT TTGAGTATCC AAATATTCAG GCATTAGCTG AGTATCTAGA GGAAATAATT CCGAAAGTGG ATGAAAAAGA GGCTGAGTAT GAAACAGATG ATACTGCTAT TGATGCAGAA AGTCTAATTA CAGAGATTAG TCAGCTTTCA GAAGACAAAA TGGATGAAGC TATTGATGAA GCTCTGACTC AACTTTATCA GTTTATATAG
|
Protein sequence | MNQPEEPQQY NDNEIAIIGM SGRFPGAKNV EDFWDNLKNG VESISLLSDK QLSKSGVAPE ILNNPNYVKV NSMVSDIDMF DANFFNYSPR EAEEIDPQQR LFLECAWEAI ESSGYNPENY EGSIGVYAGG GLPTYLMYNL EDQDFILLGN RYFTQMVGND KDYLATRTAY KLNLTGPALN IQTACSTSLV AVHLACQSLL NGECDMALAG GVSIQIPQNV GYLHQEGLIG SHDGHCRAFD ARSSGTVFGN GVGVVLLKPL QDAIADGDCI HAVIKGSAIN NDGSLKLGYT APSVDGQAAV ISEAQAVASV TPETITYIEA HGTGTELGDP IEIEALTKAF SEHTDKKQFC AIGSLKTNVG HMNTAAGIGA LIKTVLTLKH NLIPPSLHYE KPNPQINFSD SPFYVNSTLS EWKRNGTPLR AGVSSFGIGG TNSHVVLEEA PSQVKEEDDL QRPVHILTLS AKTPTALADL VDSYHHYLET NLDLDLGDVC YTANTGRVHF NHRLAVVAEK QTELTEKLRQ FKLEDKVEGI CSGKLLINAT APKVAFLFTG QGSQYVNMGK ELYEQAPVFR AALDECEEIL EELGVNSILE IIYPEGGETS PLDQTAYTQP AIFAIEYALA KLWESWGIKP DVVMGHSVGE YIAATVAGIF SLEDGLKLIA ARGRLMQQLP AGGEMVAVMA SEAKVKKLLR PYAEKVAIAA INGPKSVVIS GEATAVREIV SSLESEKIKT KQLQVSHAFH SPLMEPMLAE WEAVAKELTY HQPEIPVISN VTGTIADKSI TTAKYWVDHV RLPVRFAEGM DALQKQGVEI FLEVGPKPIL LGMGRRCLPK DVGVWLPSLR PEHIPLQSSL ESARSEDFSQ MLSSLGELYV RGVKVDWVEY DGEYSRQKVI LPTYPFQRQR YWIDNYSRGR NSVSVSSNNG FVTEEGTSNP LLGRKLLLPF AQQFRFQTKL TPEFPSYVKD HRYYGKIVVA GASHIVLGLL AGKEALNSDS CVVENIEFLQ ILGADETSSR TVQILLDQEK ETEYTYQLIS CEAGTEYDPS TVWTVHAKAK VRSVNNSELP KENIDIAAVK NRCIQSLSSD EYYSTALEPM KGDFHLGPTF QWTEQAWISS TEGLVKVKAG ENNEEMQEYW PHPGMVDSGI VPVALLSILR ELSQNTENNG NGKKNAIDSE MEIPTYAFAG AKSFKFFDNF DIDDDLWYYT KIDESSSYDR GELLGETYLL NGDGKVLAEY SGIDFRQLSR KLLLRSFGLD FSQWYYQTEW QPAALMPGET QETGTYLLFC PTGESNSKLK EWSDSLNSQL LDQGHQCIVV YPADSYKKLS SEGKKQIIQL SPTQPEHFQK LLDEVNELTE ELPLKGVIHL WSLDTDVEEL TKTEEIICGS IINFLQGMRS LEKLPPLLLV TQGASLEVRS KESGVSINAQ PQQALLWGVG KVITMEYPQL DCRCLDLDPN ADEQETLKVL LDEVANHQTS TSVENQIRYC QGKRQVARLT QPRLDTDAKL AIQTEASYLI TGGLGALGLE VAQLLVQQGV KSIALVGRNS PSETAQESIQ ELEAAGTQVS VFLGDVSVEK DMVNIFQKIQ TSLPPLKGVI HAAGVLDDGF IQQMTWQQFT KVTAPKVTGT WNLHKLTKDI PLDFFVCFSS LASVLGSPGQ SNYAAANAFM DAVAQYRQNL GLPGLSINWG PWANVGIAAR MGAQQQGRLQ SQGLQGIQTE QGLQALEEVL ATDEAQIAIF NIDWPHLLSQ FGQMTPAFLS EIASQHPLQG KANQGPKQRE LLEQMKVATT DQRQRLMVDY LIGVVAKVLR RGKNDLPDPE EGFFNLGMDS LMALDFGQMI QVDLGITLSS TSTFEYPNIQ ALAEYLEEII PKVDEKEAEY ETDDTAIDAE SLITEISQLS EDKMDEAIDE ALTQLYQFI
|
| |