Gene P9303_11831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_11831 
Symbol 
ID4776408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1038744 
End bp1040375 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content62% 
IMG OID640086692 
Productzeta-carotene desaturase-like protein 
Protein accessionYP_001017197 
Protein GI124022890 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.253964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAG GTTTTCCGGG TAAAAAGCAA TCTCATGCGG TTGTCATCGG GGCCGGCTGG 
GCTGGCTGGG GTGCGGCCAA GGCACTCTGC GAAGCGGGTG TTCGCGTGAC CCTGATGGAT
GGGATGGCGG ATCCAACCGG CAGCCAGCCC CTCACCACGC CAAGCGGCAA ACCGTTCGAG
GCAGGCACCC GGGGGTTCTG GAAGGATTAT CCCAACATCA ATGCCCTGAC GGCAGAGCTC
GGGCTCGGCT CGATCTTCAC GGAGTTCACC ACTAGTGCGT TCTGGTCACC CGAAGGCCTG
GAGGCGACGG CTCCTGTCTT CGGTGACGCT CCGCTGTGGC CGAGTCCCCT GGGTCAGGTT
GCTGCAACGA TCAATAACTT CAAGCGCCTG CCTGTTGCTG ATCGGCTCAG CATCGCCGGC
CTGCTCTACG CCATGCTCGA TCTGAACCGC AGCGATGCGG TATACAGGAG CTACGACGCG
ATCGATGCGC TGACGCTGTT TAGACAGCTC AGGATCAGTG ATCGCATGAT CGATGATTTC
CTGCGGCCCA CGTTGCTGGT GGGGCTGTTT AAGCCGCCGG AGGAGCTGTC GGCCGCTGTC
ACGATGGAGC TCCTCTACTA CTACGCCCTG GCGCATCAGG ATTCCTTCGA TGTGCGTTGG
ATCAGGTCGA AAAGCATCGC CGAACAGCTG ATTGCGCCGC TCAGTGAGCG GCTGCAGGAG
CAGCATCAGC TCGAGGTGCT GGGCGGCACC CTGGCCACCC GGCTGAACAT CTCGCCGGAG
ACTCAGGCCA TCTGCTCGGT GGGAACCCGT TCCGTAACAA CTGGGAGCAC CGGTTTAATC
GAGGATGTCG ATGCCGTGGT GCTTGCCGTG AGTGCCAAAG GAATGGGTGC CTTGATGGCG
CAATCCCCGC AGTGCGGCGC GCTGGCGCCG GAGCTTGTGC GTGCCGCCAC GCTCGGATCG
ATCGATGTGG TGTCGATCCG TCTGTGGCTG GATCGCACCG TGCCGGTCGC CGATCCCGCC
AATGTGTTCT CACGTTTCAG CGCACTGAGA GGCGCCGGCG CCACCTTCTT CATGTTGGAT
CAACTGCAGC GGGAGTCGGA GCAGGCTCTC TGGGGTGATC AGCCAGCGCA GGGTTCGGTG
ATCGCCAGCG ATTTCTACAA CGCCTCGGCC ATCGTCGAGC TGAGCGATCA GGAGATCGTC
GACTGCCTGA TGCAGGATCT GCTGCCCATA GCGCAGCCTG CTTTCAGGGG GGCCAGGGTC
GTGGATCAGG AGGTGCGGCG TTATCCGGGT TCGGTGTCTC TCTTCTCGCC GGGAAGTTTC
AGCAAGCGGC CACCAATGGA GACGTCACTG GCTTCGGTGG TCTGCGCCGG CGACTGGGTG
CGGATGGGCG AGAAAGAACA TGGTGCTAAA GGCCTTTGTC AGGAACGCGC CTACGTGTGT
GGTCTGGAAG CGGGCAACTC ACTGCTCAGG CGCGGGATCG TGAGGGGCGC CGACCTGCCC
ACGGCCCTGC AGCACTCTGT GATCCCCATC CGCGCTGATG AACCGCAGGT GCTGCTCGGG
CGTGCCCTTA ACAAGCTGGT GATGGATCCC CTCGAGGCCT TCGGGATCCA GTGGCCTTGG
TTGGCTAGCT AG
 
Protein sequence
MAEGFPGKKQ SHAVVIGAGW AGWGAAKALC EAGVRVTLMD GMADPTGSQP LTTPSGKPFE 
AGTRGFWKDY PNINALTAEL GLGSIFTEFT TSAFWSPEGL EATAPVFGDA PLWPSPLGQV
AATINNFKRL PVADRLSIAG LLYAMLDLNR SDAVYRSYDA IDALTLFRQL RISDRMIDDF
LRPTLLVGLF KPPEELSAAV TMELLYYYAL AHQDSFDVRW IRSKSIAEQL IAPLSERLQE
QHQLEVLGGT LATRLNISPE TQAICSVGTR SVTTGSTGLI EDVDAVVLAV SAKGMGALMA
QSPQCGALAP ELVRAATLGS IDVVSIRLWL DRTVPVADPA NVFSRFSALR GAGATFFMLD
QLQRESEQAL WGDQPAQGSV IASDFYNASA IVELSDQEIV DCLMQDLLPI AQPAFRGARV
VDQEVRRYPG SVSLFSPGSF SKRPPMETSL ASVVCAGDWV RMGEKEHGAK GLCQERAYVC
GLEAGNSLLR RGIVRGADLP TALQHSVIPI RADEPQVLLG RALNKLVMDP LEAFGIQWPW
LAS