Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07351 |
Symbol | |
ID | 5730733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 639264 |
End bp | 640319 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641285098 |
Product | light-harvesting complex protein |
Protein accession | YP_001550620 |
Protein GI | 159903276 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.112937 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACAT ATGGAAATCC AGACGTCACC TACGGGTGGT GGGCTGGTAA TTCTGGGGTT ACTAATCGCT CAGGCAAATT CATCGCTGCT CATGCCTCAC ATACTGGCTT GATAGCTTTC TGGGCTGGTG CCTTCACTTT ATTTGAATTA GCGCGTTTCG ACCCATCAGT ACCTATGGGT CACCAACCTT TGATCGCTCT TCCTCATTTA GCGACATTAG GAATTGGTTT TGATGAGACA GGTACTTTTG TAGGTGGAAG CACTGTTATT GCAGTTGCTG TTGTTCACCT TGTAGCTTCT ATGGTTTATG GGGCTGGCGG ACTTCTTCAT TCACTTCTCT TTGCAGGTGA TATGCAGGAC TCTCAAGTAG CTCAGGCCAG AAAATTCAAA TTGGAATGGG ATAACCCAGA TAATCAAACT TTTATCTTGG GTCATCATCT TATTTTCTTG GGTATAGCTA ATATCCAGTT TGTTGAGTGG GCCAGAGTTC ACGGTATCTG GGATTCAGCA GCAGGTGCTG TCCGTCAGGT TGAATACAAT CTCAACTTGT CCGCAATTTG GAATCACCAG TTCGACTTCC TTTCTATAAG TAATCTTGAG GACGTCATGG GAGGGCACGC TTTCTTGGCA TTCCTAATGA TTTCTGGCGG TGCTTTCCAT ATTGCTACTA AGCAAGTTGG AGAATACAGC AAGTTCAAAG GTTCAGGATT ACTTTCTGCA GAAGCAGTAC TTTCTTGGTC ACTAGCTGGT ATTGGTTGGA TGGCTATTGT TGCAGCCTTC TGGTGTGCAA CAAACACCAC CGTTTACCCA GTTGAGTATT TTGGTGAGGT TCTAGAGCTT AAGTTTGGAG TTTCGCCTTA TTGGGTGGAT ACTGTAGCTC TCGCTGAAGG TGCCCATACA TCTAGAGCTT GGTTGACGAA TGTCCATTAC TATCTTGGAT TCTTCTATAT TCAAGGACAT TTATGGCATG CGTTGCGTGC AATGGGCTTT GATTTCAAAC GAGTAGCATC TGCTGTAAGT AATATTGGTT CTGCAGATAT TACCTTGAAT GATTGA
|
Protein sequence | MQTYGNPDVT YGWWAGNSGV TNRSGKFIAA HASHTGLIAF WAGAFTLFEL ARFDPSVPMG HQPLIALPHL ATLGIGFDET GTFVGGSTVI AVAVVHLVAS MVYGAGGLLH SLLFAGDMQD SQVAQARKFK LEWDNPDNQT FILGHHLIFL GIANIQFVEW ARVHGIWDSA AGAVRQVEYN LNLSAIWNHQ FDFLSISNLE DVMGGHAFLA FLMISGGAFH IATKQVGEYS KFKGSGLLSA EAVLSWSLAG IGWMAIVAAF WCATNTTVYP VEYFGEVLEL KFGVSPYWVD TVALAEGAHT SRAWLTNVHY YLGFFYIQGH LWHALRAMGF DFKRVASAVS NIGSADITLN D
|
| |