Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_49043 |
Symbol | |
ID | 5000865 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 255025 |
End bp | 257014 |
Gene Length | 1990 bp |
Protein Length | 536 aa |
Translation table | |
GC content | 55% |
IMG OID | 640416286 |
Product | predicted protein |
Protein accession | XP_001416598 |
Protein GI | 145344145 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02343] T-complex protein 1, epsilon subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0141345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGGCGGCCCG CGACGGAACC ATGTCGCTGG CGTTCGATGA GTTCGGGCGA CCGTTCATCA TCATCAAGGT GCGAGGGCGA CGAACGAGCC GTCGCGGTAT CCGCGCGCGC GGCGGACGAA CGAACGGACG CGCGCGCGCG ACGTGCGCGC GCGATGGTGA AGGACTCGCG CGATGCGATC GGGACACTGA CGCTGGAACG CGTTTGTCGT TTACGCGATG CAGGAACAAG GAGCGAAAAC GCGCGTGCGC GGGATCGATG CTCAGAAAGC GAACATCGCG GCGGCGAAGA GCGTCGCTCG AACGCTTCGG TCTTCGCTCG GGCCAAAGGT GAGATGAAGA CGCGGCGACG CGCGCGACGT TTGACTTTTT CGTAAAGCGA AGCGAGACGC GATTCATTCG CGCGGCGACT GACGAGAGGC GACGCGATGT CGCGAAATAT CGTAGGGTAT GGATAAGATT TTACAGTCTG GCGACGGCGA CATCACGATA AGTGCGTGTG CGATTTCGAA ACTCGAGGCG AAGCGCTCGC GAGGTTTGAC TTTCGAACGC GATGACTGAC GGTCATTTTA TCTCGCAGCA AATGATGGAG CGACTATTTT GGATCAAATG GAAGTCGAAC ATGAGATCGG TAAGCTCATG GTTGAGCTGT CCAAGTCGCA GGACTACGAG ATCGGTGATG GGACGACCGG CGTGGTGGTT TTAGCGGGCG CGCTGTTGGA GCAAGCCGAG TCACTTCTTG ACCGCGGTAT CCATCCCCTT CGAATCGCGG AAGGCTACGA GATGGCGTCC AAGGTGGCGA CAAAGGAGCT GGCAAGAATC AGCGAAAAGT TCGAATTCGA TGCGGAGAAT ATCGAACCGT TAATTCAAAC GTGTATGACG ACGCTGAGTA GTAAGATTGT GAACCGGTGC AAGCGCGAAA TGGCAGAGAT TTGCGTGAAG GCTGTGATGG CGGTGGCCGA CTTAGAACGT AAGGATGTGA ACTTAGACTT GATTAAGGTT GAAGGCAAGG TGGGCGGCAA GCTAGAAGAT ACCATGTTGG TGAACGGTAT CGTGTTGGAC AAGGACATCA GCCACCCGCA GATGGCGAAG GAGATTAAGG ACGCAAAGAT CGCCATCTTG ACTTGCCCTT TTGAGCCGCC AAAGCCGAAG ACGAAGCACA AGATTGAAAT CGACACGGCG GAAAAGTACG AGGAGCTCCG TCAGCAAGAA GAAAAGTACT TTAACGACAT GGTGAAGCAA TGCAAGGACT GTGGCGCGAC GCTCGTCATT TGCCAGTGGG GTTTTGATGA CGAGGCAAAT TCGATGCTCA TGCAGCAAAA GCTTCCCGCC ATTCGCTGGG TCGGTGGTGT CGAGCTTGAG CTTTTGGCTA TCGCCACTGG CGGTCGTATC GTACCGCGAT TCACGGAGTT AACGCCAGAG AAACTCGGTT CAGCGGGCAT GGTGAAGGAG GTTTCGTTTG GAACGACTAA GGAACGCATG GTAATCATTG AAGACTGCGC CGCGAGCAAA GCGGTCACAG TCTTCGTGCG CGGCGGCAAC AAAATGATGG TTGATGAAAC GAAGCGTTCT CTGCATGACG CGATCTGTGT TGCTCGCAAC TTGGTTCGAT CGAACAACAT TGTGTACGGC GGCGGTTCTG CTGAAGTTGC GTGCGCAATC GCCGTCGAGG AGGAAGCCGA CAAGATTCCA AGCGTCGAGC AATACGCCAT GCGCGCCTTC GCGGACGCTC TTGACGCTGT TCCGAACGCT TTGGCCGAAA ACAGCGGCCT TCCTCCGATC GAGAGCGTGG CCACGATCAA GGCGCAGCAA TTGAAGGACA AGAATCCGTT CCTCGGCGTC GACTGCAAAG AAATTGGCAC CAACGACATG AAGTCTCAGG GCGTGTTCGA GACTCTCATC GGCAAGCAGC AACAAATTTT GCTCGCCACG CAAGTCGTCA AGCTCATTCT CAAGATTGAC GACGTAATCT TAGCGGGAGA AGGGCAGTAG
|
Protein sequence | MSLAFDEFGR PFIIIKEQGA KTRVRGIDAQ KANIAAAKSV ARTLRSSLGP KGMDKILQSG DGDITITNDG ATILDQMEVE HEIGKLMVEL SKSQDYEIGD GTTGVVVLAG ALLEQAESLL DRGIHPLRIA EGYEMASKVA TKELARISEK FEFDAENIEP LIQTCMTTLS SKIVNRCKRE MAEICVKAVM AVADLERKDV NLDLIKVEGK VGGKLEDTML VNGIVLDKDI SHPQMAKEIK DAKIAILTCP FEPPKPKTKH KIEIDTAEKY EELRQQEEKY FNDMVKQCKD CGATLVICQW GFDDEANSML MQQKLPAIRW VGGVELELLA IATGGRIVPR FTELTPEKLG SAGMVKEVSF GTTKERMVII EDCAASKAVT VFVRGGNKMM VDETKRSLHD AICVARNLVR SNNIVYGGGS AEVACAIAVE EEADKIPSVE QYAMRAFADA LDAVPNALAE NSGLPPIESV ATIKAQQLKD KNPFLGVDCK EIGTNDMKSQ GVFETLIGKQ QQILLATQVV KLILKIDDVI LAGEGQ
|
| |