Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_14531 |
Symbol | |
ID | 5731026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1307492 |
End bp | 1308559 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641285831 |
Product | hypothetical protein |
Protein accession | YP_001551338 |
Protein GI | 159903994 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00358475 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGACCT ACGGTAATCC AGATGTCACT TATGACTATT GGGCTGGTAA TGCTTCTGTT ACCAACCGAT CTGGTCGATT TATTGCCTCG CATGCAGCGC ATACAGGCAT GATCGCTTTT GGGGCTGGTT CAAACACACT TTTTGAACTA TCACGTTTTG ACCCCTCTTT ACCTATGGGT GACCAAGGGC TTATCTTCCT TCCCCACTTG GCATCTATAG GTATTGGTTT TGACGAAGCA GGAGTTTGGA CTGGTGCAGG AGTATTAACT ATTGCAATTG TTCATCTCAT CCTCTCCATG GTTTATGGAG CTGGTGGCTT AATGCATGCC ATTTATTTCC CAGATGACAT GCAGAAAAGC AGTGTGGCTC AAGCAAGAAA GTTCAAACTA GAATGGGATA ACCCAGATAA TCAAACTTTT ATTCTTGGTC ACCACTTAAT TCTATTTGGG ATTGCTTGTG CTTGGTTTGT TGAATGGGCA AGGATTCATG GAATATATGA CCCTGCAATT GGCGCAGTAA GACAAGTCAA TTACAATCTT GACTTATCAA TGATTTGGGA AAGACAGGTT AATTTCTTAA CCATCGACAG CCTTGAAGAT GTTATGGGAG GTCATGCCTT CTTAGCATTT GTTGAGATTA TTGGTGGTTG TTTTCATGCA ATAGCTGGTT CAACAAAATG GGAAGACAAG CGCCTTGGTT CTTACGACAA ACTCAAGGGT GCAGGTTTAC TTTCTGCTGA AGGCATTCTT TCTTTCAGTC TTGCTGGTAT AGGTTGGATG GCTATTGTTG CTTCTTTCTG GGTTTCACAA AACACGACTG TTTTTCCTGT TGAGTTCTAT GGAGAACCTT TGAACCGTGC ATTTGTAGTA GCGCCAGCTT TTGTTGATTC TATTGATTAC AGCAATGGAA TAGCTCCATT GGGTCATTCT GGACGTTGTT GGACTGCAAA CTTCCATTAC ATTGCAGGAT TCTTTGCATT GCAAGGACAC CTTTGGCATG CACTTCGTGC AATGGGCTTC AATTTCAAGG ATATTGGAGC AAAACTAAGG TCTGCACCAT CAACTTAG
|
Protein sequence | MQTYGNPDVT YDYWAGNASV TNRSGRFIAS HAAHTGMIAF GAGSNTLFEL SRFDPSLPMG DQGLIFLPHL ASIGIGFDEA GVWTGAGVLT IAIVHLILSM VYGAGGLMHA IYFPDDMQKS SVAQARKFKL EWDNPDNQTF ILGHHLILFG IACAWFVEWA RIHGIYDPAI GAVRQVNYNL DLSMIWERQV NFLTIDSLED VMGGHAFLAF VEIIGGCFHA IAGSTKWEDK RLGSYDKLKG AGLLSAEGIL SFSLAGIGWM AIVASFWVSQ NTTVFPVEFY GEPLNRAFVV APAFVDSIDY SNGIAPLGHS GRCWTANFHY IAGFFALQGH LWHALRAMGF NFKDIGAKLR SAPST
|
| |