Gene PCC8801_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3021 
Symbol 
ID7104508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3125659 
End bp3128922 
Gene Length3264 bp 
Protein Length1087 aa 
Translation table11 
GC content36% 
IMG OID643476048 
ProductErythronolide synthase 
Protein accessionYP_002373161 
Protein GI218247790 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID[TIGR00128] malonyl CoA-acyl carrier protein transacylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACGA GTAAGCAACA AGAAGCTATT GCGATTATTG GAATGGGTTG TCGCTTTCCT 
GGGGCAAAAA ATCCATCAGA ATTTTGGCAA GTAATTCAAA CAGGAGTTGA TACAATAAAG
GAAGTTCCTA GGAATCGATG GGATATTGAT CAATACTATG ATTCAAATCC AGATTCACCT
GGAAAAATGA ACACTTGTTG GGGTGGTTTT TTAGAAAAAG TTGACGAATT TGAACCCAGT
TTTTTTAATA TTTCTCCTCG TGAAGCAGAA CGGATTGATC CCCAACAGCG ACTTTTATTA
GAGGTAGTTT GGGAAGCCTT AGAAAATGGA GGAGTTATAC CTGAAACTCT TAGTGGAAGT
AATACTGGGG TTTTTATAGG ATTAACTAAT CAAGACTATC ATCGATTACT TTATCAAGAA
ATTGATCAGT TAGATGCTTA TTATGGAACA GGGACTTCTG CGTCTATTGC TGCCAATCGA
ATTTCTTATT TTTTAAATTT ACAAGGACCA AGTTTTACGG TAGATACGGC TTGTTCATCA
TCCTTAGTTG CGATTCATTT AGCTTGCCAA AGTTTACACC AAAAAGAATC AAATTTGGCT
ATAGCAGGGG GAGTCAATCT GATTTTGACT CCCGAACAAA CTATTACTTT TAGTAAAAGT
CGAATGATGT CTCCTGATGG CCGTTGTAAG ACCTTTGATG CGAAAGCAAA TGGATATGTG
CGAAGTGAGG GATGTGGGGT GGTTATCCTC AAGCGTCTTG AAGATGCGTT GCAACAAGGT
GATGACATTC AAGCTATTAT TAGAGGATCG GCAGTGAACC AAGATGGTTT GAGTCAGGGA
TTAACTGCCC CTAATAGTCT GGCGCAACAA CGGGTAATCC GTCAAGCCTT GCACAATGCA
CAGATAGAAA GCGATCGCAT TAGTTATGTG GAGACTCACG GGACAGGAAC GGCTTTAGGC
GACCCAATTG AGGTAAAATC GCTTAAGGCA GTCTTAATGG GCGATCGTAC TGCGGATCAG
CCTTGTTGGT TGGGCTCGGT TAAAACAAAT ATTGGTCATT TAGAAGCAGC AGCAGGAGTA
GCAGGGTTAA TTAAACTCGT GTTATGTTTA CAAAATCAAG AGATTCCGCC CATTTTGCAT
TTTAATCAAT TAAATCCCTA TATTTCTTTT AAAAATACCT CTTTTGTAAT TCCTACGACT
TCGCAAACTT GGATCGTTAA CGATCAAACG AGGGTGGCGG GGATCAGTGG GTTTAGTTTT
GGGGGAACTA ATTGTCATTT AATTGTTGAG GAAAGTCCTA GGGTTAAGGG AACAGGAATA
AACAACAAAA ATAAAACAGA AATTGATCGA CCTGTTCATA TTTTAACCTT ATCAGCTAAA
ACAGAAGACA GTTTAAACGA ATTAGTCAAA AATTATTATA ATCATTTACA ATCTGAAGGA
AATTTATCCC TTTCTGATAT TTGTTTTAGT GCCAATATCG GTCGATCTCA ATTTGAGCAT
CGTTTGGCTA TTATTTCTCA ATCAAGGGAA CAGTTAAAAG AACAATTACA AGTGTTACAA
GAAGAAAAAA AGACTAGCGG ATATTTTGCA GGTAAAGTTC TAGAGGAAAC CTATCCTAAG
ATTGCTTTTT TGTTTACAGG ACAAGGTTCT CAATATATAG GAATGGGACA AGAGTTATAT
CAAAAATCTC CTTTATTTCG TCAGATAATT GATCAATGTG ATCAGATTTT ACGCAATGAG
TTAGATGAGT CTTTGATCAA TATTATTTAT CCAGAACAGG GAACACAAAC GAGAGAAAAT
ACTAATCTTT TGAATCAAAC AATTTATACT CAACCTGCTT TATTTACGAT TGAATACGCT
TTAGCGAAAT TATGGCAGAG TTGGGGAATT GAACCATCAG TTGTGATGGG TCATAGTGTG
GGTGAATATG TGGCTGCTTG TCTTGCGGGA GTATTTAGTT TAGAAGATGG ATTAAAATTA
GTTGCTGCGC GAGGAAGGTT AATGCAAAGT TTGCCCGAAG ATGGTGGTAT GGTAGCGATT
TTAGCGACTG TTGAGCAAGT TAATCAAGTC ATTAACTCCT ATCAAAATGA AGTAGCGATC
GCGGCTATTA ATGGGTCAGA TAGTTTAGTT ATTTCTGGGA AAAAAGAAGC TATTCAAAGC
ATTATTAAAA CTCTAGAAAG TCAAGGAATA AAAACAAAAT TATTATCGGT TTCTCATGCT
TTTCATTCTC CCTTAATGGA ACCTATTTTA GAGGATTTTC GGAACATTGC AGGCACAATT
TCCTATAGTT CTCCAACAAT TCAGATTATT TCTAACGTAA CAGGTAAATC TATTACAGAA
GAAATAATGA CTCCTGATTA TTGGGTCAAT CATCTGCGAT CACCTGTTCA GTTTCTAGAC
AGTATGCAGG AATTACTCAA CAAAAACTAT GAGGTTTTTT TGGAAATTGG ATCGAAACCA
ATTTTATTGG GAATGGGTCG AAATATTGTT GAAAAAATGG ATAATTATTC AACGAACCAA
ACTTGGTTGC CAAGTCTTCG TCCTGGTCGT TCCGACTGGG AACAACTGTT AGAAAGTTTG
GGACAATTAT TCATTAAAGG ATCAGTGATT GATTGGCATA AATTCGATCA AGATTATCAG
CGAAACCGGG TAATTATTCC TACCTATCCT TGGGTGCGAT CGCGTTATTG GATTGATAAA
AATAATTTGA ATCAGAGACA AAAAAACAAG AGTATTACTT TACCCTTGTC CCAGAGAAAA
AACCAATCTA TTAACACCCA TAAAGTGGTT AGGTTAAGTA ATTTAAGACA GCAACTTGAA
CAAAATTATC AAAGTTCTGC TTTTGTTCAG CAATTAGAAA AGACTTTTGA GGCAGAAAGA
TTACCCAAAA TTATTAACTA TTTACAACAA GAAATTAGGA CTATTCTTGG GTTAAATAAG
ACACAAATAC CACCTCACAC AATGGGATTT TTTCAAATGG GAATGGATTC TTTAATGGCT
GTGGAATTTA GAAACAAATT AGAAGCGACT TTTTCTATTA CTTTGCCGAC TACTTTAGCT
TTCAATTACC CTAATATTGA AGCTTTATCA AACTATATTT TAGAAAAAAT CAACCGCAGA
TTCAAACTAG AAGAAAAATC TCACAGCAAT GGTCAATCAG CTATCAATAA ACTCGAACAA
CTAGCACAAG AAATTGAAGC TTATCCAGAG GAAGAAATTG AACGGTTATT AATCGAGAAA
ATTACGACTA TTTTAGAGAA GTAA
 
Protein sequence
MTTSKQQEAI AIIGMGCRFP GAKNPSEFWQ VIQTGVDTIK EVPRNRWDID QYYDSNPDSP 
GKMNTCWGGF LEKVDEFEPS FFNISPREAE RIDPQQRLLL EVVWEALENG GVIPETLSGS
NTGVFIGLTN QDYHRLLYQE IDQLDAYYGT GTSASIAANR ISYFLNLQGP SFTVDTACSS
SLVAIHLACQ SLHQKESNLA IAGGVNLILT PEQTITFSKS RMMSPDGRCK TFDAKANGYV
RSEGCGVVIL KRLEDALQQG DDIQAIIRGS AVNQDGLSQG LTAPNSLAQQ RVIRQALHNA
QIESDRISYV ETHGTGTALG DPIEVKSLKA VLMGDRTADQ PCWLGSVKTN IGHLEAAAGV
AGLIKLVLCL QNQEIPPILH FNQLNPYISF KNTSFVIPTT SQTWIVNDQT RVAGISGFSF
GGTNCHLIVE ESPRVKGTGI NNKNKTEIDR PVHILTLSAK TEDSLNELVK NYYNHLQSEG
NLSLSDICFS ANIGRSQFEH RLAIISQSRE QLKEQLQVLQ EEKKTSGYFA GKVLEETYPK
IAFLFTGQGS QYIGMGQELY QKSPLFRQII DQCDQILRNE LDESLINIIY PEQGTQTREN
TNLLNQTIYT QPALFTIEYA LAKLWQSWGI EPSVVMGHSV GEYVAACLAG VFSLEDGLKL
VAARGRLMQS LPEDGGMVAI LATVEQVNQV INSYQNEVAI AAINGSDSLV ISGKKEAIQS
IIKTLESQGI KTKLLSVSHA FHSPLMEPIL EDFRNIAGTI SYSSPTIQII SNVTGKSITE
EIMTPDYWVN HLRSPVQFLD SMQELLNKNY EVFLEIGSKP ILLGMGRNIV EKMDNYSTNQ
TWLPSLRPGR SDWEQLLESL GQLFIKGSVI DWHKFDQDYQ RNRVIIPTYP WVRSRYWIDK
NNLNQRQKNK SITLPLSQRK NQSINTHKVV RLSNLRQQLE QNYQSSAFVQ QLEKTFEAER
LPKIINYLQQ EIRTILGLNK TQIPPHTMGF FQMGMDSLMA VEFRNKLEAT FSITLPTTLA
FNYPNIEALS NYILEKINRR FKLEEKSHSN GQSAINKLEQ LAQEIEAYPE EEIERLLIEK
ITTILEK