Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_17721 |
Symbol | |
ID | 4778975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1547562 |
End bp | 1548668 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640087279 |
Product | light-harvesting complex protein |
Protein accession | YP_001017779 |
Protein GI | 124023472 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAAACCT ACGGGAAAAC AGACGTTACC TACGCCTGGT ACGCGGGTAA CAGTGGAGTG ACCAACCGAT CAGGCCGATT CATTGCCTCG CATATTGGCC ATACAGGATT AATTTGCTTC GGCGCTGGTG CCAACACCCT ATTTGAGCTA GCTCGTTACG ATTCTGCATT GCCAATAGGT GACCAAGGAT TTGTAGTTCT TCCTCACCTA GCAGGACTTG GTATTGGTGG CATAGAGAAC GGTGTGATTA CAGACTCCTA TGGGATGCTT GTCGTCGCAG TCTTCCATCT AATCTTTTCG GCCGTTTATG CCGGCGGAGC GATGCTTCAC TCCTTTCGAT ACAAGGAGGA CCTGGGAGAA TACCCGCAAG GATCCAGACC CAACAAATTT GATTTTAAAT GGGATGATCC AGACAGGCTC ACCTTCATAC TTGGACATCA CCTGCTATTC CTAGGTCTTG GCTGTGTTCA ATTTGTTGAA TGGGCTAAAT ACCATGGAAT TTATGACCCA GCAATGGGTG TTGTACGTAA GGTTGAATAC AACCTTGACT TGTCAATGGT TTGGAATCAC CAGATTGATT TCCTTACGAT TAACAGTTTG GAAGATGTGA TGGGCGGTCA TGCATTCTTG GCCTTCTTCT TGAGTGCTGG TGCTGTTTGG CATATTTTCA GCAAGCCATT TGGGGAATAC ACTGAATTCA AAGGAAAAGG ACTCTTATCT GCTGAATTTG TTCTTTCTAC CTCATTAGCA GGTGCAGCCT TTATTGCTTT CGTGGCAGCC TTCTGGGCTT CTATGAACAC TACAATTTAT CCAACTGATC TGTATGGAGG TCCTCTCAAT ATCGAATTGA ACTTCGCTCC ATATTTCTCA GATACGGATC CATTGTTTGG TGGAGACGTA CACTCAGCCC GTTCATGGCT GTCAAACTTC CATTTCTACC TTGGATTCTT CTATCTTCAG GGTCATTTCT GGCATGGATT GAGAGCGATG GGCTTTGACT TCAAGCGTGT TGAAAAATTG TTCGATCAGC TAGAAAGCAA CGAAATTAGT CTTAACCCAG GTAAAAGTAC GACCGTGCCA TCAACATCGA CAGATAACGC CACATAA
|
Protein sequence | MQTYGKTDVT YAWYAGNSGV TNRSGRFIAS HIGHTGLICF GAGANTLFEL ARYDSALPIG DQGFVVLPHL AGLGIGGIEN GVITDSYGML VVAVFHLIFS AVYAGGAMLH SFRYKEDLGE YPQGSRPNKF DFKWDDPDRL TFILGHHLLF LGLGCVQFVE WAKYHGIYDP AMGVVRKVEY NLDLSMVWNH QIDFLTINSL EDVMGGHAFL AFFLSAGAVW HIFSKPFGEY TEFKGKGLLS AEFVLSTSLA GAAFIAFVAA FWASMNTTIY PTDLYGGPLN IELNFAPYFS DTDPLFGGDV HSARSWLSNF HFYLGFFYLQ GHFWHGLRAM GFDFKRVEKL FDQLESNEIS LNPGKSTTVP STSTDNAT
|
| |