Gene P9211_12331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_12331 
SymbolpsbD 
ID5730366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1108607 
End bp1109683 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content45% 
IMG OID641285601 
Productphotosystem II PsbD protein (D2) 
Protein accessionYP_001551118 
Protein GI159903774 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B))
[TIGR01152] Photosystem II, DII subunit (also called Q(A)) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.153207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATAG CTGTTGGTGG CGCCCAAGAA AGAGGATGGT TTGACGTCCT TGATGACTGG 
CTTAAGCGCG ACCGATTCGT ATTTGTTGGT TGGTCTGGTC TTCTACTTTT TCCTACTGCC
TATTTGGCAA TAGGGGGGTG GTTCTTAGGC ACAACCTTTG TTACTTCCTG GTACACACAT
GGTGTAGCCA GCTCTTACCT AGAAGGTTGT AATTTTCTTA CAGCTGCAGT TAGCACCCCT
GGTGATGCCA TGGGTCATAG CCTTCTATTC CTTTGGGGAC CTGAGGCACA AGGCAGTCTT
GTTCGTTGGT TACAACTTGG CGGTCTATGG AACTTTGTAG TTCTCCATGG AATATTTAGC
CTTATAGGCT TCATGCTTCG TCAATTTGAA ATCGCAAGAC TTGTTGGAAT TCGTCCCTAC
AACGCTCTTG CATTTTCTGC TGTAATTGCC GTTTATACAG CTTGCTTCCT TATATACCCA
CTAGGTCAGC ACAGCTTTTT CTTTGCTCCT TCTTTTGGGG TAGCAGCAAT TTTCCGCTTC
ATCCTCTTTA TTCAAGGTTT CCATAACATC ACTCTTAACC CATTTCACAT GATGGGAGTT
GCGGGAATTC TTGGAGGTGC GCTTCTTTGT GCAATTCATG GAGCCACTGT GCAGAACACT
CTGTATGAGG ACACGAGTAT TTATACGGGT GGCAAAGCTC AAAGTACTAC TTTTAGAGGT
TTTGACCCAA CTCAAGAAGA AGAGACCTAC TCCATGGTTA CTGCTAATCG CTTTTGGAGT
CAGATCTTTG GAATTGCCTT CTCAAACAAG CGCTTCCTTC ATTTCTTAAT GCTCTTCGTA
CCTGTAATGG GTATGTGGTG CGCTGCTATT GGCATCGTTG GTTTAGCCCT AAACCTAAGG
GCCTATGACT TTGTTAGCCA AGAAATTCGC GCTGCTGAAG ACCCTGAGTT TGAAACGTTC
TATACCAAAA ACATTCTTTT AAATGAAGGT ATGAGAGCCT GGATGTCTTC TGTGGATCAG
CCACACGAAA ACTTTGTATT CCCTGAGGAG GTACTCCCAC GTGGAAACGC CCTTTAA
 
Protein sequence
MTIAVGGAQE RGWFDVLDDW LKRDRFVFVG WSGLLLFPTA YLAIGGWFLG TTFVTSWYTH 
GVASSYLEGC NFLTAAVSTP GDAMGHSLLF LWGPEAQGSL VRWLQLGGLW NFVVLHGIFS
LIGFMLRQFE IARLVGIRPY NALAFSAVIA VYTACFLIYP LGQHSFFFAP SFGVAAIFRF
ILFIQGFHNI TLNPFHMMGV AGILGGALLC AIHGATVQNT LYEDTSIYTG GKAQSTTFRG
FDPTQEEETY SMVTANRFWS QIFGIAFSNK RFLHFLMLFV PVMGMWCAAI GIVGLALNLR
AYDFVSQEIR AAEDPEFETF YTKNILLNEG MRAWMSSVDQ PHENFVFPEE VLPRGNAL