Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18741 |
Symbol | bcsA |
ID | 4777782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1635891 |
End bp | 1636985 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640087383 |
Product | chalcone synthase (CHS) |
Protein accession | YP_001017881 |
Protein GI | 124023574 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3424] Predicted naringenin-chalcone synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.406295 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCTGA CCATTCGCGG CATGGGAACT GCAGTGCCGG AACAACAAGT CAATCAGATC GATTCCATGG CGCTTGCAGA GTTCGTGAGT GCCGACAGTC CCGAACGAGC GGCACTGGTC CGCCGGATTC AGCGACGAAC TCAGGTTCAG AACAGGGGCA GCGTTCTGCT GGGAGACGAC GCGAGTGAGC CGATCAGCCA GCGTTTGCCC TTCTACGGGG CCAAAAGCCC CACTACTGCC GCAAGGATGA ACGAGTTTGA TCACCATGCT TCAGCGCTGG CAATCAATGC ATGCCGTCTT GCCCTCGAGG AGGCCGAGCT AGCCGCTGAT GTGATCACTC ATCTTGTGAC CGTTTGCTGT ACCGGATTCA AGGCTCCGGG TGTTGACCTT GCTCTGATCG ACCAACTTGA ACTTGACCCC GGTGTTCAGA GAACCCATGT GGGATTCATG GGCTGTCACG GTGCCTTGAA CGGTCTACGT GTGGCGCGGG CGTTCGCTGA GGCTGATGCC AACGCTGTAG TGCTGCTGTG TTCTGTCGAA CTGTGCAGTC TCCATCTTCA TTACGGCTGG GACCCCGAGA AAGTGGTAGC CAACGCCCTG TTCGCGGATG GAGCAGCAGC TTTGGTTGCG TCTCAAGCTT TGGCGCAATC GAAGGAATCT CTGATATTGA AGGCTTCTGG CTCCACCGTG ATTCCGAACA CCAGCAGTCT GATGCATTGG CAGGTCGGAG ATCATGGCTT CGCTATGGGT TTATCACCAC AGGTGCCAGA GGCGATAGCT GGTGCATTGC AACCTTGGCT GAGCAGTTGG TTGAGGGGCC ATGGATCCGA TCTGACTGAA ATCCGCAGCT GGGCCCTCCA CCCTGGGGGG CCGAGGATCC TGCAGGCCTG CGCCGACGCC TTGGCCCTCA AGCATGAGCA CCTGGCTGAA TCCCGGTCCG TCCTTCAAGC TCACGGCAAC ATGTCTTCGG CCACCGTTCT TTTCATTCTT GAACGCATGC GTCATAAGGC TTGCAGCGGA CCTTGCCTGG CACTCGCCTT TGGTCCCGGT CTTTGTGCCG AGGTCGCGTT GTTCGATCTT TGCAACAGCA ACTGA
|
Protein sequence | MPLTIRGMGT AVPEQQVNQI DSMALAEFVS ADSPERAALV RRIQRRTQVQ NRGSVLLGDD ASEPISQRLP FYGAKSPTTA ARMNEFDHHA SALAINACRL ALEEAELAAD VITHLVTVCC TGFKAPGVDL ALIDQLELDP GVQRTHVGFM GCHGALNGLR VARAFAEADA NAVVLLCSVE LCSLHLHYGW DPEKVVANAL FADGAAALVA SQALAQSKES LILKASGSTV IPNTSSLMHW QVGDHGFAMG LSPQVPEAIA GALQPWLSSW LRGHGSDLTE IRSWALHPGG PRILQACADA LALKHEHLAE SRSVLQAHGN MSSATVLFIL ERMRHKACSG PCLALAFGPG LCAEVALFDL CNSN
|
| |