Gene P9211_11571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_11571 
Symbol 
ID5731121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1056212 
End bp1057297 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content42% 
IMG OID641285525 
Producthypothetical protein 
Protein accessionYP_001551042 
Protein GI159903698 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0991489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCT ATGGAAATCC AGAAGTTACC TATGGATGGT GGGCTGGTAA TTCTGTGGTT 
ACCAACCGCT CTGGCCGATT TATTGCCTCT CATATAGGCC ATACGGGCTT GATCTGCTTT
GCGGCTGGTG GCAGCACTCT TTGGGAGCTA GCCAGATATA ACCCAGAAAT ACCTATGGGT
CATCAAAGCT CTCTATTTCT GGCACATCTT GCTTCTATTG GGCTTGGCTT TGATGAAGCT
GGAGTTTGGA CAGGGGTTGG TGTTGCAACC ATTGCAATTT TTCACCTTAT CTTCTCAATG
GTTTATGGAG GAGGAGGACT TCTTCATGCA ATCCTATTTG AGGAGAATGT AGAAGATAGT
GAAGTCTTAC AAGCTAAGAA ATTTAAGCTT GAGTGGAATA ACCCTGATAA TCAGACCTTC
ATACTTGGCC ACCACCTTAT ATTTTTTGGT GTTGCATGTA TTTGGTTTGT TGAATGGGCG
AGAATTCACG GAATTTATGA TCCTGCAGTA GGAGCAATTC GCCAGGTTAA TTACAATCTT
GACTTGTCAA TGATTTGGGA AAGGCAGTTT GACTTTCTTG CTATTGATAG TCTTGAAGAT
GTTATGGGTG GCCATGCTTT CTTAGCTTTT GTTGAGATTA CTGGGGGTGC TTTTCATATA
GTTGCAGGCT CAACCCCTTG GGAAGATAAA AGACTAGGTG AATGGAGTAA GTTTAAAGGA
GCAGAACTTC TTTCAGCTGA AGCAGTACTT TCTTGGTCAC TAGCTGGCAT TGGTTGGATG
GCTATTGTTG CAGCTTTCTG GTGTGCTTCT AATACCACTG TTTATCCAGA AGCTTGGTAT
GGTGAGCCAC TTGAATTTAA ATTTTCAGTT TCACCATATT GGATAGATAC TGGAGATTTA
TCTGATGCGA CTGCTTTTTG GGGGCATTCC ACTAGAGCTG CCTTGGCTAA TGTGCATTAT
TATCTAGGCT TCTTCTTCCT TCAAGGTCAT TTCTGGCATG CCCTTAGAGC CTTAGGCTTT
GACTTCAAGA GTGTCACTAG TGCTATAGGA AATGAAAAGA CAGCCACCTT TACTATTAAA
TCTTGA
 
Protein sequence
MQTYGNPEVT YGWWAGNSVV TNRSGRFIAS HIGHTGLICF AAGGSTLWEL ARYNPEIPMG 
HQSSLFLAHL ASIGLGFDEA GVWTGVGVAT IAIFHLIFSM VYGGGGLLHA ILFEENVEDS
EVLQAKKFKL EWNNPDNQTF ILGHHLIFFG VACIWFVEWA RIHGIYDPAV GAIRQVNYNL
DLSMIWERQF DFLAIDSLED VMGGHAFLAF VEITGGAFHI VAGSTPWEDK RLGEWSKFKG
AELLSAEAVL SWSLAGIGWM AIVAAFWCAS NTTVYPEAWY GEPLEFKFSV SPYWIDTGDL
SDATAFWGHS TRAALANVHY YLGFFFLQGH FWHALRALGF DFKSVTSAIG NEKTATFTIK
S