Gene P9303_18681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18681 
Symbol 
ID4777504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1629607 
End bp1630683 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content54% 
IMG OID640087377 
Producthypothetical protein 
Protein accessionYP_001017875 
Protein GI124023568 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B)) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.228862 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA CCATTCGCAG TGGTCGCCTC AGTAGCTGGG AATCCTTCTG CAATTGGGTC 
ACCTCCACCA ATAACCGCAT TTATGTGGGT TGGTTCGGTG TCCTGATGGT TCCCACGCTG
TTGGCAGCTG CCATCTGCTT CACCATTGCC TTTATTGCGG CACCGCCGGT TGATATCGAC
GGTATCCGTG AGCCTGTTGC TGGCTCTTTC CTCTACGGCA ATAACATCAT TTCTGGTGCT
GTTGTTCCTT CCAGCAACGC CATCGGCCTG CACTTTTATC CCATTTGGGA AGCTGCTTCT
GTTGATGAGT GGCTTTACAA CGGCGGTCCA TACCAGCTTG TTGTGTTCCA CTTCCTGATC
GGCATCTGTT GCTGGTTGGG TCGCCAATGG GAACTCTCCT ATCGCTTGGG CATGCGCCCT
TGGATCTGTG TGGCCTACAG CGCACCGCTG TCGGCAGCCT TTGCTGTGTT CCTGATCTAT
CCAGTAGGTC AGGGTTCCTT CTCGGATGGC ATGCCTTTAG GCATCTCTGG AACCTTCAAC
TTCATGTTGG TGTTCCAGGC AGAGCACAAC ATCCTCATGC ATCCCTTCCA CATGATTGGT
GTGGCGGGCA TGTTTGGCGG CAGTTTGTTC TCCGCCATGC ACGGTTCACT GGTGACCTCA
TCGTTGATTC GTGAAACCAC CGAAACCGAG TCTCAGAACT ACGGCTATAA GTTCGGCCAA
GAGGAAGAGA CCTACAACAT CGTGGCTGCC CACGGTTATT TCGGTCGTCT GATCTTCCAG
TACGCCAGCT TCAATAACAG CCGCAGTCTG CACTTCTTCC TGGCTGCCTG GCCGGTGATC
TGCATCTGGA TCACCTCACT GGGTATTAGC ACCATGGCCT TCAACTTGAA CGGCTTTAAC
TTCAACCAGT CGGTTCTCGA TGCCCAAGGC AGAGTTGTAC CAACCTGGGC TGATGTGCTC
AACCGCAGCA ACCTCGGTAT GGAAGTGATG CACGAGCGTA ATGCTCATAA CTTCCCTCTC
GACCTGGCAG CTGCTGAGTC CACACCTGTG GCTCTGATTG CTCCTGCGAT TGGCTGA
 
Protein sequence
MTTTIRSGRL SSWESFCNWV TSTNNRIYVG WFGVLMVPTL LAAAICFTIA FIAAPPVDID 
GIREPVAGSF LYGNNIISGA VVPSSNAIGL HFYPIWEAAS VDEWLYNGGP YQLVVFHFLI
GICCWLGRQW ELSYRLGMRP WICVAYSAPL SAAFAVFLIY PVGQGSFSDG MPLGISGTFN
FMLVFQAEHN ILMHPFHMIG VAGMFGGSLF SAMHGSLVTS SLIRETTETE SQNYGYKFGQ
EEETYNIVAA HGYFGRLIFQ YASFNNSRSL HFFLAAWPVI CIWITSLGIS TMAFNLNGFN
FNQSVLDAQG RVVPTWADVL NRSNLGMEVM HERNAHNFPL DLAAAESTPV ALIAPAIG