Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_11601 |
Symbol | |
ID | 5730367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1058285 |
End bp | 1059334 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641285528 |
Product | hypothetical protein |
Protein accession | YP_001551045 |
Protein GI | 159903701 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0985613 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACCT ACGGAAACCC AGATGTCACC TACGGGTGGT GGGCTGGTAA TTCTGGAGTC ACCAACCGCT CAGGCAAATT CATTGCTGCT CATGCAGCTC ATACTGGACT TATTGCCTTT GGGTGCGGTG CAGCCACACT TGTCGAACTA GCTGGCTTTG ACGCTTCCCT GCCAATGGGA CATCAAAGCT CTCTCTTTCT TGCTCACTTA GCATCAGTCG GCATTGGTTT CAATGATGCT GGAGTTTGGA CAGGTGTAGG TGTGGCAAAT ATTGCAATAC TTCACTTGAT TCTCTCCATG GTTTATGGGG GAGGAGGACT TTTGCACTCT GTTTATTTCA CAGGAGATAT GCAGCAGTCA GAAGTACCAC AAGCTCGAAA ATTTAAATTG GAATGGGATA ACCCAGACAA CCAAACTTTT ATTCTTGGTC ACCATTTGCT TTTCTTTGGT GTTGCGAATA TTTGGTTTGT TGAATGGGCC AGGATCCATG GAATTTATGA TCCTGCTATT GATGCAATAC GCCAAGTCAA CTACAACCTT GACCTTACCC AGATTTGGAA CCATCAATTT GATTTTCTAG CTATTGATAG CCTTGAGGAT GTAATGGGTG GACATGCCTT CTTAGCTTTC TTCCAGCTCG GAGGAGGTGC TTTCCATATA GCAACAAAGC AAATTGGCAC TTATACAAAA TTCAAAGGCA AAGGTTTACT GTCCGCTGAA GCAATACTTT CTTGGTCACT AGCAGGTATT GGCTGGATGG CATGTGTTGC TGCTTTCTGG GCTGCAACAA ACACAACTGT TTATCCAGAA GCTTGGTATG GAGAAGTTCT TCAGTTTAAA TTTGGAGTAG CTCCTTATTG GATAGACACA GTTCCTGGAG GAACTGCGTT CTGGGGCCAT ACCACTAGGG CTGCTTTGGT TAATGTGCAT TACTACTTTG GATTTTTCTT TATTCAAGGA CATTTATGGC ATGCATTAAG AGCTATGGGC TTTGACTTCA AGCGATTGAG AGATACAAAT GGTCCTTTCG GAGTTCCAAG GACTCTTTAA
|
Protein sequence | MQTYGNPDVT YGWWAGNSGV TNRSGKFIAA HAAHTGLIAF GCGAATLVEL AGFDASLPMG HQSSLFLAHL ASVGIGFNDA GVWTGVGVAN IAILHLILSM VYGGGGLLHS VYFTGDMQQS EVPQARKFKL EWDNPDNQTF ILGHHLLFFG VANIWFVEWA RIHGIYDPAI DAIRQVNYNL DLTQIWNHQF DFLAIDSLED VMGGHAFLAF FQLGGGAFHI ATKQIGTYTK FKGKGLLSAE AILSWSLAGI GWMACVAAFW AATNTTVYPE AWYGEVLQFK FGVAPYWIDT VPGGTAFWGH TTRAALVNVH YYFGFFFIQG HLWHALRAMG FDFKRLRDTN GPFGVPRTL
|
| |