Gene A9601_13341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_13341 
SymbolpsbD 
ID4718053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1108626 
End bp1109702 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content40% 
IMG OID640079053 
Productphotosystem II PsbD protein (D2) 
Protein accessionYP_001009725 
Protein GI123968867 
COG category 
COG ID 
TIGRFAM ID[TIGR01152] Photosystem II, DII subunit (also called Q(A)) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATCG CAGTTGGTAG CGCCCCACAA AGAGGATGGT TTGATGTCCT TGATGATTGG 
TTGAAGCGCG ACCGCTTTGT ATTTATTGGT TGGTCCGGAC TACTCCTACT TCCTTGTGCA
TATCTTGCTA TAGGTGGTTG GTTCGTCGGA ACAACATTTG TTACCTCTTG GTACACACAT
GGAGTTGCAA GCTCATACCT TGAAGGTTGT AACTTTTTAA CAGCAGCTGT AAGTACCCCT
GGTGATGCCA TGGGACACAG TCTTCTATTT TTATGGGGTC CTGAAGCCCA AGGTAGTTTC
GTAAGATGGC TACAGCTTGG TGGTCTTTGG AACTTCGTTG CATTACATGG AGTATTTGGC
CTTATTGGTT TTATGCTTCG TCAGTTTGAA ATTGCTGGCC TTGTTGGAAT TAGACCTTAC
AACGCTTTAG CATTCTCAGC AGTAATTGCA GTATTTACAA GTATTTTCCT TATTTATCCT
TTAGGACAGC ATAGTTGGTT CTTCGCACCT TCATTCGGTG TTGCAGCAAT CTTCCGTTAC
ATTCTGTTCA TTCAAGGTTT TCATAATATT ACTTTAAATC CATTTCACAT GATGGGTGTT
GCTGGAATTC TTGGTGGTGC TCTACTTTGC GCTATTCATG GAGCTACAGT ACAAAACACT
TTGTATGAAG ATACAAGTAT TTATACAGAT GGTAAGGTTC AAAGTTCAAC ATTTAGAGCT
TTTGACCCAA CTCAAGAAGA AGAAACTTAT TCAATGATTA CAGCGAATAG ATTCTGGAGT
CAAATCTTCG GTATTGCTTT CTCAAACAAG CGTTTCTTAC ATTTCTTGAT GCTATTTGTA
CCTGTAATGG GTATGTGGAC ATCTTCAATT GGTATCGTCG GCTTAGCACT AAACTTAAGA
GCTTATGATT TCGTAAGCCA AGAAATTCGT GCAGCAGAAG ATCCAGAATT TGAAACTTTC
TATACAAAAA ATATACTTTT GAACGAAGGT ATGCGAGCAT GGATGTCTTC TGTGGATCAA
CCACACGAAA ACTTTGTATT CCCTGAGGAG GTTCTTCCAC GTGGAAACGC CCTTTAA
 
Protein sequence
MTIAVGSAPQ RGWFDVLDDW LKRDRFVFIG WSGLLLLPCA YLAIGGWFVG TTFVTSWYTH 
GVASSYLEGC NFLTAAVSTP GDAMGHSLLF LWGPEAQGSF VRWLQLGGLW NFVALHGVFG
LIGFMLRQFE IAGLVGIRPY NALAFSAVIA VFTSIFLIYP LGQHSWFFAP SFGVAAIFRY
ILFIQGFHNI TLNPFHMMGV AGILGGALLC AIHGATVQNT LYEDTSIYTD GKVQSSTFRA
FDPTQEEETY SMITANRFWS QIFGIAFSNK RFLHFLMLFV PVMGMWTSSI GIVGLALNLR
AYDFVSQEIR AAEDPEFETF YTKNILLNEG MRAWMSSVDQ PHENFVFPEE VLPRGNAL