Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_10131 |
Symbol | |
ID | 4777925 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 923704 |
End bp | 924756 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640086522 |
Product | light-harvesting complex protein |
Protein accession | YP_001017027 |
Protein GI | 124022720 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.379961 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAAACTT ATGGAAATCC TAACCCCACC TATGGGTGGT GGGCAGGAAA TGCTGGAACT ACAAATCGAT CAGGAAAATT CCTGGCAGCG CATATTGCCC ACACAGGTTT GATGGCCTTC TGGGCAGGAT CGTTCACCTT ATTTGAACTT TCCCGTTATG ACCCCTCAGT ACCCATGGGG CATCAGCCTC TGGTTGCCCT CCCTCATCTG GCAACTCTTG GAATAGGCGT CGGTGATGGT GGTGTCATAA CTGATACTTA CCCAATTGTT GTGACGGCAG TCCTGCACCT GGTTCTATCC ATGGTTTATG CAGCTGGCGG ACTTATGCAC TCCCTCCTAT TCAATGGAGA CATTGGCGAA ATGGGAGTTA AGTGGGCTAG AAAGTTTGAC TTTAAGTGGG ATGATCCAGA CAAGTTGACC TTTATTCTTG GTCATCACTT ATTCCTATTA GGCCTTGGCA ACGTTCAATT TGTTGAATGG GCTAAGTATT ACGGCTTGTA TGACAATGCA GAAGGGGTAG TACGAACTGT AGTACCAAAC CTGAACATTG GCATGGTTTG GAATGCTCAG TTTAATTTCC TAGCTATTAA TAGCCTGGAA GATGTAATGG GCGGCCATGC CTTCCTCGCA TTGTTCATGA TGTCTGGTGG CCTTTGGCAC ATTGTTACCA AGCAAGCTGG TGAATACACC ACGTTTAAAG GTAAAGGAAT CCTTAGTGCT GAAGCTCAGC TCTCATGGGC CCTTGCAGGT GTAGGTTGGA TGGCTTTAGT TGCTGCCTTC TGGTGTGCTT CTAACACAAC CATTTATCCA GATACATTCT TCGGTGAAGT TCTAGATCTC AAATTCAGTA TCTCCCCTTA TTGGGTTGAC ACTGCAAATC TTCCTGAAGG AACCTACACA TCACGTGCTT GGTTGACCAA CATCCACTAC TACTTGGGAT TCTTCTATAT CCAAGGTCAT CTGTGGCATG CCCTCAGAGC TCTCGGATTT GATTTCAAGA GAGTTTCCAA TGCTATCGGC AATGCCGACA GCGCAACAAT TACTCTGAAC TGA
|
Protein sequence | MQTYGNPNPT YGWWAGNAGT TNRSGKFLAA HIAHTGLMAF WAGSFTLFEL SRYDPSVPMG HQPLVALPHL ATLGIGVGDG GVITDTYPIV VTAVLHLVLS MVYAAGGLMH SLLFNGDIGE MGVKWARKFD FKWDDPDKLT FILGHHLFLL GLGNVQFVEW AKYYGLYDNA EGVVRTVVPN LNIGMVWNAQ FNFLAINSLE DVMGGHAFLA LFMMSGGLWH IVTKQAGEYT TFKGKGILSA EAQLSWALAG VGWMALVAAF WCASNTTIYP DTFFGEVLDL KFSISPYWVD TANLPEGTYT SRAWLTNIHY YLGFFYIQGH LWHALRALGF DFKRVSNAIG NADSATITLN
|
| |