Gene P9301_13491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_13491 
SymbolpsbD 
ID4912435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1126362 
End bp1127438 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content41% 
IMG OID640160938 
Productphotosystem II PsbD protein (D2) 
Protein accessionYP_001091573 
Protein GI126696687 
COG category 
COG ID 
TIGRFAM ID[TIGR01152] Photosystem II, DII subunit (also called Q(A)) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATCG CAGTTGGTAG CGCCCCACAA AGAGGATGGT TTGATGTCCT CGATGATTGG 
TTGAAGCGCG ACCGCTTTGT ATTTATTGGT TGGTCCGGAC TACTTCTACT TCCTTGTGCA
TACCTTGCTA TAGGTGGTTG GTTTGTCGGA ACAACATTTG TTACCTCTTG GTACACACAC
GGAGTTGCAA GTTCATACCT TGAAGGTTGT AACTTCTTAA CAGCAGCTGT AAGCACCCCT
GGTGATGCCA TGGGACACAG TCTTCTATTT TTATGGGGTC CTGAAGCCCA AGGTAGTTTC
GTAAGATGGC TACAACTTGG TGGTCTTTGG AACTTCGTTG CATTACATGG AGTATTTGGC
CTAATTGGTT TTATGCTTCG TCAGTTTGAA ATTGCTGGCC TTGTTGGAAT TAGACCATAC
AACGCACTAG CTTTCTCAGC AGTAATTGCA GTATTCACAA GTATTTTCCT TATTTATCCT
TTAGGACAGC ATAGTTGGTT CTTCGCACCT TCATTCGGTG TTGCAGCAAT CTTCCGTTAC
ATCCTATTCA TTCAAGGTTT TCACAATATC ACTTTAAACC CATTCCATAT GATGGGAGTT
GCTGGAATTC TTGGTGGTGC TCTACTTTGC GCTATTCATG GAGCTACAGT TCAAAATACT
TTGTATGAAG ATACAAGTAT TTACACAGAT GGTAAGGTTC AAAGTTCAAC ATTTAGAGCT
TTTGATCCAA CTCAAGAAGA AGAAACCTAT TCAATGATTA CAGCGAATAG ATTTTGGAGT
CAAATCTTCG GTATTGCTTT CTCAAACAAG CGTTTCTTAC ATTTCTTGAT GCTATTTGTA
CCTGTTATGG GTATGTGGAC ATCTTCTATT GGTATTGTCG GCTTAGCACT AAACTTGAGA
GCTTATGACT TCGTAAGCCA AGAAATTCGT GCAGCAGAAG ATCCAGAATT TGAAACTTTC
TATACAAAAA ATATACTTTT GAACGAAGGT ATGCGAGCAT GGATGTCTTC TGTGGATCAA
CCACACGAAA ACTTTGTATT CCCTGAGGAG GTTCTTCCAC GTGGAAACGC CCTTTAA
 
Protein sequence
MTIAVGSAPQ RGWFDVLDDW LKRDRFVFIG WSGLLLLPCA YLAIGGWFVG TTFVTSWYTH 
GVASSYLEGC NFLTAAVSTP GDAMGHSLLF LWGPEAQGSF VRWLQLGGLW NFVALHGVFG
LIGFMLRQFE IAGLVGIRPY NALAFSAVIA VFTSIFLIYP LGQHSWFFAP SFGVAAIFRY
ILFIQGFHNI TLNPFHMMGV AGILGGALLC AIHGATVQNT LYEDTSIYTD GKVQSSTFRA
FDPTQEEETY SMITANRFWS QIFGIAFSNK RFLHFLMLFV PVMGMWTSSI GIVGLALNLR
AYDFVSQEIR AAEDPEFETF YTKNILLNEG MRAWMSSVDQ PHENFVFPEE VLPRGNAL