Gene P9211_03501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_03501 
SymbolpsbB 
ID5730768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp329417 
End bp330973 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content47% 
IMG OID641284698 
Productphotosystem II PsbB protein (CP47) 
Protein accessionYP_001550235 
Protein GI159902891 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.234957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTGC CCTGGTATCG GGTGCACACT GTCGTTATTA ACGATCCTGG CCGACTCTTG 
GCCGTGCACC TCATGCACAC TGCTTTGCTA GCCGGCTGGG CCGGCTCCAT GGCCCTATAT
GAATTGGCCA TATTTGATCC TTCTGATCCA GTCTTGAACC CTATGTGGCG CCAAGGCATG
TATGTCATGC CGTTCATGGC CCGCTTAGGT GTCACAAGCA GTTGGAAGGG TTGGGACATC
ACTGGAGGTG TCGGCTCATT CGACTTTGAT TCATTAGGGT TCTGGGGAAA AGCTCTACCT
TATTCAACTT TTGAAGGGGT CGCTGTAGCA CATATCCTCT TCAGTGGCCT ATTAATGCTT
GCTGCCATCT GGCATTGGAC CTATTGGGAT CTAGAGCTCT GGGAAGACTC CCGAACAGGA
GAGCCAGCCT TAGATCTACC AAGAATTTTC GGAATCCATT TACTTCTTGC AGGGCTTACT
TGCTTTGGTT TTGGCGCATT CCATCTATCA GCCGTTGGCA TGTGGGTCTC AGACTCATAT
GGCCTAGGAG GTCACGTAGA AAAAGTCGCT CCTGTTTGGG GTGCAGACGG CTTCAACCCC
TTTAGTGCTG GAGGAATTGT CGCTAACCAT ATTGGAGCTG GTCTTTTAGG AATTATTGGT
GGAGTTTTCC ATATCACCAA CCGTCCTGGA GAAAGGCTCT ATAGGAATTT AAGGATGGGA
AGCCTAGAAG GTGTTCTCGC AAGTGCGCTT GCTGCAGTCC TATTTGTCTC GTTTGTGGTT
GCTGGAACTA TGTGGTATGG ATCTGCTACC ACACCAATTG AGCTATTTGG TCCTACCAGA
TACCAATGGG ATTCTGGGTA CTTCAAAACT GAAATAAATA GAAGGGTGCA AGCCTCTATA
AATGAGGGTG CTTCTAAAGA AGAAGCTTAT GCAGCAATCC CTGAGAAGCT AGCTTTCTAT
GACTATGTAG GAAATAGCCC TGCAAAAGGA GGATTATTTA GAGCTGGTGC TCTAGTTAAT
GGCGATGGTG TCCCAACTGG CTGGCAAGGT CACGTTTCAT TCTCAGATAA AGAAGGAAAT
GAGCTTGAAG TCAGAAGAAT GCCAAACTTC TTTGAGAACT TCCCAGTAAT TCTTGAAGAC
AAAGATGGAA ATGTCAGAGC TGACATTCCA TTCCGTAGAG CAGAAGCCAA GTATTCCTTT
GAACAAACAG GCATTACTGC AACAGTTTAT GGTGGTGAAC TAAGTGGGCA AACCTTTAGT
GACCCTGTAG TTGTTAAGCG TCTTGCTCGC AAAGCACAAC TTGGTGAGTC CTTTAAGTTC
GATAGAGATC GCTACAAATC AGATGGGGTC TTCCGAAGTG GCCCAAGAGC ATGGTTTACT
TATGCTCATG CTTGCTTTGG GTTGCTCTAC TTATTTGGGC ACTGGTGGCA TGCTGCCAGA
ACTCTATATC GAGATACCTT TGCTGGAATT GATCCAGACC TTGGCGACCA GGTCGAGTTT
GGTCTCTTCA AGAAACTTGG AGATGAATCC ACACGACGCG TCCCAGGGCG TGCTTAA
 
Protein sequence
MGLPWYRVHT VVINDPGRLL AVHLMHTALL AGWAGSMALY ELAIFDPSDP VLNPMWRQGM 
YVMPFMARLG VTSSWKGWDI TGGVGSFDFD SLGFWGKALP YSTFEGVAVA HILFSGLLML
AAIWHWTYWD LELWEDSRTG EPALDLPRIF GIHLLLAGLT CFGFGAFHLS AVGMWVSDSY
GLGGHVEKVA PVWGADGFNP FSAGGIVANH IGAGLLGIIG GVFHITNRPG ERLYRNLRMG
SLEGVLASAL AAVLFVSFVV AGTMWYGSAT TPIELFGPTR YQWDSGYFKT EINRRVQASI
NEGASKEEAY AAIPEKLAFY DYVGNSPAKG GLFRAGALVN GDGVPTGWQG HVSFSDKEGN
ELEVRRMPNF FENFPVILED KDGNVRADIP FRRAEAKYSF EQTGITATVY GGELSGQTFS
DPVVVKRLAR KAQLGESFKF DRDRYKSDGV FRSGPRAWFT YAHACFGLLY LFGHWWHAAR
TLYRDTFAGI DPDLGDQVEF GLFKKLGDES TRRVPGRA