Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_07201 |
Symbol | hslO |
ID | 4717424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 640410 |
End bp | 641318 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640078434 |
Product | Hsp33-like chaperonin |
Protein accession | YP_001009113 |
Protein GI | 123968255 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1281] Disulfide bond chaperones of the HSP33 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGATA GGATAGTTCG GGCTACTGCA GCCAATGGAG GAATAAGATT AGTTGCGGTC TTAACAACAG AATCTTCTTT AGAAGCAAAA AAAAGACACG GCCTTTCTTA CTTAACCACC TGTATCTTAG GCAGAGCATT TAGTGCTTCA CTGCTTTTAG CAAGCTCGAT GAAAATAATG CATGGGAGAG TCACTTTAAG AGTTAGATCT GACGGACCTT TAAAGGGATT ACTAGTTGAT GCAGGAAGAG ACGGGAAAGT TAGGGGTTAT GTAGGTAATC CTAATTTAGA ATTGGACCTA GTCAAAATAG ATAATGATAA ATATTCTTTT GATTTCACAA AAGCACTAGG TACAGGATAT TTAAATGTAA TTAGAGATAG TGGATTTGGA GAACCCTTTA CAAGCACTGT TGAATTAGTA AATGGGAATA TTGCTGAAGA CTTAGCTTCA TATTTATATC ATTCAGAGCA AACTCCCTCT GCTGTATTTA TTGGAGAAAA AATTCAAAAT AAAAGTGTTA TTTGTAGTGG TGGCTTATTA GCTCAAGTTT TACCTAAAAA AGATACTGAC CCTCTGCTAA TCTCACTACT TGAAGAAAGA TGCAAAGAAA TTAATTCTTT CAGCGAAGAT CTATTTAAGT CAAAACATAA TCTTCTTGAG TTAATTAGAA ATATATTTCC CGATATTGAC GATAAATCAA TCTCTGAAAA AGCTCGTTCT CAAGAAGTGA GTTTTAAATG CAAGTGTTCC AAACAAAGAA GTTTAAATGC GATGAAAATG CTTGATAAGA GCGAGTTAGA GGACATCCTC AAGAAAGATG GCAAAGCAGA GTTGGTTTGT GAATTTTGTA AAAATAAATA TCTTATAAAT TTTGAAGAAA TTAAATCTAT GATAGAAAAT CAATCATAA
|
Protein sequence | MQDRIVRATA ANGGIRLVAV LTTESSLEAK KRHGLSYLTT CILGRAFSAS LLLASSMKIM HGRVTLRVRS DGPLKGLLVD AGRDGKVRGY VGNPNLELDL VKIDNDKYSF DFTKALGTGY LNVIRDSGFG EPFTSTVELV NGNIAEDLAS YLYHSEQTPS AVFIGEKIQN KSVICSGGLL AQVLPKKDTD PLLISLLEER CKEINSFSED LFKSKHNLLE LIRNIFPDID DKSISEKARS QEVSFKCKCS KQRSLNAMKM LDKSELEDIL KKDGKAELVC EFCKNKYLIN FEEIKSMIEN QS
|
| |