Gene P9211_11601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_11601 
Symbol 
ID5730367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1058285 
End bp1059334 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content44% 
IMG OID641285528 
Producthypothetical protein 
Protein accessionYP_001551045 
Protein GI159903701 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0985613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCT ACGGAAACCC AGATGTCACC TACGGGTGGT GGGCTGGTAA TTCTGGAGTC 
ACCAACCGCT CAGGCAAATT CATTGCTGCT CATGCAGCTC ATACTGGACT TATTGCCTTT
GGGTGCGGTG CAGCCACACT TGTCGAACTA GCTGGCTTTG ACGCTTCCCT GCCAATGGGA
CATCAAAGCT CTCTCTTTCT TGCTCACTTA GCATCAGTCG GCATTGGTTT CAATGATGCT
GGAGTTTGGA CAGGTGTAGG TGTGGCAAAT ATTGCAATAC TTCACTTGAT TCTCTCCATG
GTTTATGGGG GAGGAGGACT TTTGCACTCT GTTTATTTCA CAGGAGATAT GCAGCAGTCA
GAAGTACCAC AAGCTCGAAA ATTTAAATTG GAATGGGATA ACCCAGACAA CCAAACTTTT
ATTCTTGGTC ACCATTTGCT TTTCTTTGGT GTTGCGAATA TTTGGTTTGT TGAATGGGCC
AGGATCCATG GAATTTATGA TCCTGCTATT GATGCAATAC GCCAAGTCAA CTACAACCTT
GACCTTACCC AGATTTGGAA CCATCAATTT GATTTTCTAG CTATTGATAG CCTTGAGGAT
GTAATGGGTG GACATGCCTT CTTAGCTTTC TTCCAGCTCG GAGGAGGTGC TTTCCATATA
GCAACAAAGC AAATTGGCAC TTATACAAAA TTCAAAGGCA AAGGTTTACT GTCCGCTGAA
GCAATACTTT CTTGGTCACT AGCAGGTATT GGCTGGATGG CATGTGTTGC TGCTTTCTGG
GCTGCAACAA ACACAACTGT TTATCCAGAA GCTTGGTATG GAGAAGTTCT TCAGTTTAAA
TTTGGAGTAG CTCCTTATTG GATAGACACA GTTCCTGGAG GAACTGCGTT CTGGGGCCAT
ACCACTAGGG CTGCTTTGGT TAATGTGCAT TACTACTTTG GATTTTTCTT TATTCAAGGA
CATTTATGGC ATGCATTAAG AGCTATGGGC TTTGACTTCA AGCGATTGAG AGATACAAAT
GGTCCTTTCG GAGTTCCAAG GACTCTTTAA
 
Protein sequence
MQTYGNPDVT YGWWAGNSGV TNRSGKFIAA HAAHTGLIAF GCGAATLVEL AGFDASLPMG 
HQSSLFLAHL ASVGIGFNDA GVWTGVGVAN IAILHLILSM VYGGGGLLHS VYFTGDMQQS
EVPQARKFKL EWDNPDNQTF ILGHHLLFFG VANIWFVEWA RIHGIYDPAI DAIRQVNYNL
DLTQIWNHQF DFLAIDSLED VMGGHAFLAF FQLGGGAFHI ATKQIGTYTK FKGKGLLSAE
AILSWSLAGI GWMACVAAFW AATNTTVYPE AWYGEVLQFK FGVAPYWIDT VPGGTAFWGH
TTRAALVNVH YYFGFFFIQG HLWHALRAMG FDFKRLRDTN GPFGVPRTL