Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_14281 |
Symbol | |
ID | 5730686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1289835 |
End bp | 1290932 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641285805 |
Product | hypothetical protein |
Protein accession | YP_001551313 |
Protein GI | 159903969 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00517015 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAACCT ATGGGAATCC AAACCCCACT TACGGGTGGT GGGTTGGTAA TTCTGTAGTA ACCAACAAGT CAAGTCGATT TATAGGCTCG CATGTTGCGC ATACAGGATT GATTGCATTT ACCGCTGGCG CAAACACACT TTGGGAACTT GCCCGTTACA ACCCTGATAT CCCTATGGGA CATCAAGGAA TGGTAAGCAT CCCTCATTTG GCATCTATTG GCATTGGTTT TGACCAAGCT GGAGTATGGA CCGGGCAAGA TGTTGCTTTC ATTGGCATCT TTCACCTGAT TTGTTCATTT GTATATGCCC TGGCTGGACT ATTGCACTCA ATAGTTTTCA GTGAAGACAC TCAAAACTCA TCAGGCCTTT TTGCTGAAGG TCGTCCCGAG CATCGTCAAG CGGCTAGATA CAAGCTTGAA TGGGATAACC CAGATAACCA AACCTTTATT CTTGGACACC ATTTGATTTT CTTTGGTGTT GCATGTATTT GGTTTGTTGA ATGGGCAAGA ATTCATGGTA TTTACGATCC TGCTATTGGT GCAGTTCGCC AGGTTGAGTA CAACTTGAAC TTGAATGCTA TTTGGAACCA TCAATTTGAC TTCTTGACTA TAGATAGCCT TGAAGATGTA ATGGGAGGCC ATGCATTCTT GGCTTTTGCT GAGATTCTTG GTGGAGCTCA CCACATTGCA ACCAAGATGG GTTCTGGAGC TCTTGGAGAA TATACTGAAT TCAAAGGTAA GAATGTTTTG TCAGCTGAGG CCGTTCTTTC TTGGTCTTTA GCTGGTATTG GCTGGATGGC AATTATTGCT GCATTCTGGT GCGCTACTAA CACAACTGTT TACCCTGAAG CTTGGTATGG CGAACCTCTT GCTATCAAAT TTGGAATTTC TCCTTATTGG ATAGACACAG GAAACATGGA TGGTGTTGTT ACCGGTCACA CATCTCGTGC ATGGCTGACT AATGTTCATT ATTATCTTGG ATTCTTCTTT ATCCAAGGTC ATTTATGGCA TGCAATTCGT GCATTGGGCT TTGACTTCAA GCGAGTTACA AATGCTATCG GTAACTTAGA CAATCAAAAA ATTACTCTTA ATGGTTGA
|
Protein sequence | MQTYGNPNPT YGWWVGNSVV TNKSSRFIGS HVAHTGLIAF TAGANTLWEL ARYNPDIPMG HQGMVSIPHL ASIGIGFDQA GVWTGQDVAF IGIFHLICSF VYALAGLLHS IVFSEDTQNS SGLFAEGRPE HRQAARYKLE WDNPDNQTFI LGHHLIFFGV ACIWFVEWAR IHGIYDPAIG AVRQVEYNLN LNAIWNHQFD FLTIDSLEDV MGGHAFLAFA EILGGAHHIA TKMGSGALGE YTEFKGKNVL SAEAVLSWSL AGIGWMAIIA AFWCATNTTV YPEAWYGEPL AIKFGISPYW IDTGNMDGVV TGHTSRAWLT NVHYYLGFFF IQGHLWHAIR ALGFDFKRVT NAIGNLDNQK ITLNG
|
| |