Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3021 |
Symbol | |
ID | 7104508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 3125659 |
End bp | 3128922 |
Gene Length | 3264 bp |
Protein Length | 1087 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643476048 |
Product | Erythronolide synthase |
Protein accession | YP_002373161 |
Protein GI | 218247790 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | [TIGR00128] malonyl CoA-acyl carrier protein transacylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACGA GTAAGCAACA AGAAGCTATT GCGATTATTG GAATGGGTTG TCGCTTTCCT GGGGCAAAAA ATCCATCAGA ATTTTGGCAA GTAATTCAAA CAGGAGTTGA TACAATAAAG GAAGTTCCTA GGAATCGATG GGATATTGAT CAATACTATG ATTCAAATCC AGATTCACCT GGAAAAATGA ACACTTGTTG GGGTGGTTTT TTAGAAAAAG TTGACGAATT TGAACCCAGT TTTTTTAATA TTTCTCCTCG TGAAGCAGAA CGGATTGATC CCCAACAGCG ACTTTTATTA GAGGTAGTTT GGGAAGCCTT AGAAAATGGA GGAGTTATAC CTGAAACTCT TAGTGGAAGT AATACTGGGG TTTTTATAGG ATTAACTAAT CAAGACTATC ATCGATTACT TTATCAAGAA ATTGATCAGT TAGATGCTTA TTATGGAACA GGGACTTCTG CGTCTATTGC TGCCAATCGA ATTTCTTATT TTTTAAATTT ACAAGGACCA AGTTTTACGG TAGATACGGC TTGTTCATCA TCCTTAGTTG CGATTCATTT AGCTTGCCAA AGTTTACACC AAAAAGAATC AAATTTGGCT ATAGCAGGGG GAGTCAATCT GATTTTGACT CCCGAACAAA CTATTACTTT TAGTAAAAGT CGAATGATGT CTCCTGATGG CCGTTGTAAG ACCTTTGATG CGAAAGCAAA TGGATATGTG CGAAGTGAGG GATGTGGGGT GGTTATCCTC AAGCGTCTTG AAGATGCGTT GCAACAAGGT GATGACATTC AAGCTATTAT TAGAGGATCG GCAGTGAACC AAGATGGTTT GAGTCAGGGA TTAACTGCCC CTAATAGTCT GGCGCAACAA CGGGTAATCC GTCAAGCCTT GCACAATGCA CAGATAGAAA GCGATCGCAT TAGTTATGTG GAGACTCACG GGACAGGAAC GGCTTTAGGC GACCCAATTG AGGTAAAATC GCTTAAGGCA GTCTTAATGG GCGATCGTAC TGCGGATCAG CCTTGTTGGT TGGGCTCGGT TAAAACAAAT ATTGGTCATT TAGAAGCAGC AGCAGGAGTA GCAGGGTTAA TTAAACTCGT GTTATGTTTA CAAAATCAAG AGATTCCGCC CATTTTGCAT TTTAATCAAT TAAATCCCTA TATTTCTTTT AAAAATACCT CTTTTGTAAT TCCTACGACT TCGCAAACTT GGATCGTTAA CGATCAAACG AGGGTGGCGG GGATCAGTGG GTTTAGTTTT GGGGGAACTA ATTGTCATTT AATTGTTGAG GAAAGTCCTA GGGTTAAGGG AACAGGAATA AACAACAAAA ATAAAACAGA AATTGATCGA CCTGTTCATA TTTTAACCTT ATCAGCTAAA ACAGAAGACA GTTTAAACGA ATTAGTCAAA AATTATTATA ATCATTTACA ATCTGAAGGA AATTTATCCC TTTCTGATAT TTGTTTTAGT GCCAATATCG GTCGATCTCA ATTTGAGCAT CGTTTGGCTA TTATTTCTCA ATCAAGGGAA CAGTTAAAAG AACAATTACA AGTGTTACAA GAAGAAAAAA AGACTAGCGG ATATTTTGCA GGTAAAGTTC TAGAGGAAAC CTATCCTAAG ATTGCTTTTT TGTTTACAGG ACAAGGTTCT CAATATATAG GAATGGGACA AGAGTTATAT CAAAAATCTC CTTTATTTCG TCAGATAATT GATCAATGTG ATCAGATTTT ACGCAATGAG TTAGATGAGT CTTTGATCAA TATTATTTAT CCAGAACAGG GAACACAAAC GAGAGAAAAT ACTAATCTTT TGAATCAAAC AATTTATACT CAACCTGCTT TATTTACGAT TGAATACGCT TTAGCGAAAT TATGGCAGAG TTGGGGAATT GAACCATCAG TTGTGATGGG TCATAGTGTG GGTGAATATG TGGCTGCTTG TCTTGCGGGA GTATTTAGTT TAGAAGATGG ATTAAAATTA GTTGCTGCGC GAGGAAGGTT AATGCAAAGT TTGCCCGAAG ATGGTGGTAT GGTAGCGATT TTAGCGACTG TTGAGCAAGT TAATCAAGTC ATTAACTCCT ATCAAAATGA AGTAGCGATC GCGGCTATTA ATGGGTCAGA TAGTTTAGTT ATTTCTGGGA AAAAAGAAGC TATTCAAAGC ATTATTAAAA CTCTAGAAAG TCAAGGAATA AAAACAAAAT TATTATCGGT TTCTCATGCT TTTCATTCTC CCTTAATGGA ACCTATTTTA GAGGATTTTC GGAACATTGC AGGCACAATT TCCTATAGTT CTCCAACAAT TCAGATTATT TCTAACGTAA CAGGTAAATC TATTACAGAA GAAATAATGA CTCCTGATTA TTGGGTCAAT CATCTGCGAT CACCTGTTCA GTTTCTAGAC AGTATGCAGG AATTACTCAA CAAAAACTAT GAGGTTTTTT TGGAAATTGG ATCGAAACCA ATTTTATTGG GAATGGGTCG AAATATTGTT GAAAAAATGG ATAATTATTC AACGAACCAA ACTTGGTTGC CAAGTCTTCG TCCTGGTCGT TCCGACTGGG AACAACTGTT AGAAAGTTTG GGACAATTAT TCATTAAAGG ATCAGTGATT GATTGGCATA AATTCGATCA AGATTATCAG CGAAACCGGG TAATTATTCC TACCTATCCT TGGGTGCGAT CGCGTTATTG GATTGATAAA AATAATTTGA ATCAGAGACA AAAAAACAAG AGTATTACTT TACCCTTGTC CCAGAGAAAA AACCAATCTA TTAACACCCA TAAAGTGGTT AGGTTAAGTA ATTTAAGACA GCAACTTGAA CAAAATTATC AAAGTTCTGC TTTTGTTCAG CAATTAGAAA AGACTTTTGA GGCAGAAAGA TTACCCAAAA TTATTAACTA TTTACAACAA GAAATTAGGA CTATTCTTGG GTTAAATAAG ACACAAATAC CACCTCACAC AATGGGATTT TTTCAAATGG GAATGGATTC TTTAATGGCT GTGGAATTTA GAAACAAATT AGAAGCGACT TTTTCTATTA CTTTGCCGAC TACTTTAGCT TTCAATTACC CTAATATTGA AGCTTTATCA AACTATATTT TAGAAAAAAT CAACCGCAGA TTCAAACTAG AAGAAAAATC TCACAGCAAT GGTCAATCAG CTATCAATAA ACTCGAACAA CTAGCACAAG AAATTGAAGC TTATCCAGAG GAAGAAATTG AACGGTTATT AATCGAGAAA ATTACGACTA TTTTAGAGAA GTAA
|
Protein sequence | MTTSKQQEAI AIIGMGCRFP GAKNPSEFWQ VIQTGVDTIK EVPRNRWDID QYYDSNPDSP GKMNTCWGGF LEKVDEFEPS FFNISPREAE RIDPQQRLLL EVVWEALENG GVIPETLSGS NTGVFIGLTN QDYHRLLYQE IDQLDAYYGT GTSASIAANR ISYFLNLQGP SFTVDTACSS SLVAIHLACQ SLHQKESNLA IAGGVNLILT PEQTITFSKS RMMSPDGRCK TFDAKANGYV RSEGCGVVIL KRLEDALQQG DDIQAIIRGS AVNQDGLSQG LTAPNSLAQQ RVIRQALHNA QIESDRISYV ETHGTGTALG DPIEVKSLKA VLMGDRTADQ PCWLGSVKTN IGHLEAAAGV AGLIKLVLCL QNQEIPPILH FNQLNPYISF KNTSFVIPTT SQTWIVNDQT RVAGISGFSF GGTNCHLIVE ESPRVKGTGI NNKNKTEIDR PVHILTLSAK TEDSLNELVK NYYNHLQSEG NLSLSDICFS ANIGRSQFEH RLAIISQSRE QLKEQLQVLQ EEKKTSGYFA GKVLEETYPK IAFLFTGQGS QYIGMGQELY QKSPLFRQII DQCDQILRNE LDESLINIIY PEQGTQTREN TNLLNQTIYT QPALFTIEYA LAKLWQSWGI EPSVVMGHSV GEYVAACLAG VFSLEDGLKL VAARGRLMQS LPEDGGMVAI LATVEQVNQV INSYQNEVAI AAINGSDSLV ISGKKEAIQS IIKTLESQGI KTKLLSVSHA FHSPLMEPIL EDFRNIAGTI SYSSPTIQII SNVTGKSITE EIMTPDYWVN HLRSPVQFLD SMQELLNKNY EVFLEIGSKP ILLGMGRNIV EKMDNYSTNQ TWLPSLRPGR SDWEQLLESL GQLFIKGSVI DWHKFDQDYQ RNRVIIPTYP WVRSRYWIDK NNLNQRQKNK SITLPLSQRK NQSINTHKVV RLSNLRQQLE QNYQSSAFVQ QLEKTFEAER LPKIINYLQQ EIRTILGLNK TQIPPHTMGF FQMGMDSLMA VEFRNKLEAT FSITLPTTLA FNYPNIEALS NYILEKINRR FKLEEKSHSN GQSAINKLEQ LAQEIEAYPE EEIERLLIEK ITTILEK
|
| |