Gene P9303_18741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18741 
SymbolbcsA 
ID4777782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1635891 
End bp1636985 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content58% 
IMG OID640087383 
Productchalcone synthase (CHS) 
Protein accessionYP_001017881 
Protein GI124023574 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3424] Predicted naringenin-chalcone synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.406295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCTGA CCATTCGCGG CATGGGAACT GCAGTGCCGG AACAACAAGT CAATCAGATC 
GATTCCATGG CGCTTGCAGA GTTCGTGAGT GCCGACAGTC CCGAACGAGC GGCACTGGTC
CGCCGGATTC AGCGACGAAC TCAGGTTCAG AACAGGGGCA GCGTTCTGCT GGGAGACGAC
GCGAGTGAGC CGATCAGCCA GCGTTTGCCC TTCTACGGGG CCAAAAGCCC CACTACTGCC
GCAAGGATGA ACGAGTTTGA TCACCATGCT TCAGCGCTGG CAATCAATGC ATGCCGTCTT
GCCCTCGAGG AGGCCGAGCT AGCCGCTGAT GTGATCACTC ATCTTGTGAC CGTTTGCTGT
ACCGGATTCA AGGCTCCGGG TGTTGACCTT GCTCTGATCG ACCAACTTGA ACTTGACCCC
GGTGTTCAGA GAACCCATGT GGGATTCATG GGCTGTCACG GTGCCTTGAA CGGTCTACGT
GTGGCGCGGG CGTTCGCTGA GGCTGATGCC AACGCTGTAG TGCTGCTGTG TTCTGTCGAA
CTGTGCAGTC TCCATCTTCA TTACGGCTGG GACCCCGAGA AAGTGGTAGC CAACGCCCTG
TTCGCGGATG GAGCAGCAGC TTTGGTTGCG TCTCAAGCTT TGGCGCAATC GAAGGAATCT
CTGATATTGA AGGCTTCTGG CTCCACCGTG ATTCCGAACA CCAGCAGTCT GATGCATTGG
CAGGTCGGAG ATCATGGCTT CGCTATGGGT TTATCACCAC AGGTGCCAGA GGCGATAGCT
GGTGCATTGC AACCTTGGCT GAGCAGTTGG TTGAGGGGCC ATGGATCCGA TCTGACTGAA
ATCCGCAGCT GGGCCCTCCA CCCTGGGGGG CCGAGGATCC TGCAGGCCTG CGCCGACGCC
TTGGCCCTCA AGCATGAGCA CCTGGCTGAA TCCCGGTCCG TCCTTCAAGC TCACGGCAAC
ATGTCTTCGG CCACCGTTCT TTTCATTCTT GAACGCATGC GTCATAAGGC TTGCAGCGGA
CCTTGCCTGG CACTCGCCTT TGGTCCCGGT CTTTGTGCCG AGGTCGCGTT GTTCGATCTT
TGCAACAGCA ACTGA
 
Protein sequence
MPLTIRGMGT AVPEQQVNQI DSMALAEFVS ADSPERAALV RRIQRRTQVQ NRGSVLLGDD 
ASEPISQRLP FYGAKSPTTA ARMNEFDHHA SALAINACRL ALEEAELAAD VITHLVTVCC
TGFKAPGVDL ALIDQLELDP GVQRTHVGFM GCHGALNGLR VARAFAEADA NAVVLLCSVE
LCSLHLHYGW DPEKVVANAL FADGAAALVA SQALAQSKES LILKASGSTV IPNTSSLMHW
QVGDHGFAMG LSPQVPEAIA GALQPWLSSW LRGHGSDLTE IRSWALHPGG PRILQACADA
LALKHEHLAE SRSVLQAHGN MSSATVLFIL ERMRHKACSG PCLALAFGPG LCAEVALFDL
CNSN