Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_11571 |
Symbol | |
ID | 5731121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1056212 |
End bp | 1057297 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641285525 |
Product | hypothetical protein |
Protein accession | YP_001551042 |
Protein GI | 159903698 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0991489 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACCT ATGGAAATCC AGAAGTTACC TATGGATGGT GGGCTGGTAA TTCTGTGGTT ACCAACCGCT CTGGCCGATT TATTGCCTCT CATATAGGCC ATACGGGCTT GATCTGCTTT GCGGCTGGTG GCAGCACTCT TTGGGAGCTA GCCAGATATA ACCCAGAAAT ACCTATGGGT CATCAAAGCT CTCTATTTCT GGCACATCTT GCTTCTATTG GGCTTGGCTT TGATGAAGCT GGAGTTTGGA CAGGGGTTGG TGTTGCAACC ATTGCAATTT TTCACCTTAT CTTCTCAATG GTTTATGGAG GAGGAGGACT TCTTCATGCA ATCCTATTTG AGGAGAATGT AGAAGATAGT GAAGTCTTAC AAGCTAAGAA ATTTAAGCTT GAGTGGAATA ACCCTGATAA TCAGACCTTC ATACTTGGCC ACCACCTTAT ATTTTTTGGT GTTGCATGTA TTTGGTTTGT TGAATGGGCG AGAATTCACG GAATTTATGA TCCTGCAGTA GGAGCAATTC GCCAGGTTAA TTACAATCTT GACTTGTCAA TGATTTGGGA AAGGCAGTTT GACTTTCTTG CTATTGATAG TCTTGAAGAT GTTATGGGTG GCCATGCTTT CTTAGCTTTT GTTGAGATTA CTGGGGGTGC TTTTCATATA GTTGCAGGCT CAACCCCTTG GGAAGATAAA AGACTAGGTG AATGGAGTAA GTTTAAAGGA GCAGAACTTC TTTCAGCTGA AGCAGTACTT TCTTGGTCAC TAGCTGGCAT TGGTTGGATG GCTATTGTTG CAGCTTTCTG GTGTGCTTCT AATACCACTG TTTATCCAGA AGCTTGGTAT GGTGAGCCAC TTGAATTTAA ATTTTCAGTT TCACCATATT GGATAGATAC TGGAGATTTA TCTGATGCGA CTGCTTTTTG GGGGCATTCC ACTAGAGCTG CCTTGGCTAA TGTGCATTAT TATCTAGGCT TCTTCTTCCT TCAAGGTCAT TTCTGGCATG CCCTTAGAGC CTTAGGCTTT GACTTCAAGA GTGTCACTAG TGCTATAGGA AATGAAAAGA CAGCCACCTT TACTATTAAA TCTTGA
|
Protein sequence | MQTYGNPEVT YGWWAGNSVV TNRSGRFIAS HIGHTGLICF AAGGSTLWEL ARYNPEIPMG HQSSLFLAHL ASIGLGFDEA GVWTGVGVAT IAIFHLIFSM VYGGGGLLHA ILFEENVEDS EVLQAKKFKL EWNNPDNQTF ILGHHLIFFG VACIWFVEWA RIHGIYDPAV GAIRQVNYNL DLSMIWERQF DFLAIDSLED VMGGHAFLAF VEITGGAFHI VAGSTPWEDK RLGEWSKFKG AELLSAEAVL SWSLAGIGWM AIVAAFWCAS NTTVYPEAWY GEPLEFKFSV SPYWIDTGDL SDATAFWGHS TRAALANVHY YLGFFFLQGH FWHALRALGF DFKSVTSAIG NEKTATFTIK S
|
| |