Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19801 |
Symbol | |
ID | 5005192 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 361119 |
End bp | 362950 |
Gene Length | 1832 bp |
Protein Length | 534 aa |
Translation table | |
GC content | 56% |
IMG OID | 640420613 |
Product | predicted protein |
Protein accession | XP_001421126 |
Protein GI | 145353663 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02347] T-complex protein 1, zeta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.289192 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0151262 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTCA AGTCCATCAA CCCCAACGCC GAGCTCATGA ACGCCGATGC CGCGCTGTTC ATGAACATCA ACGCCGCCAA GGGTCTCCAA GACGTGATGA AGACGAATTT AGGCCCGAAA GGGACGATCA AGATGCTCGT CGACGGTGCT GGAGGTGCGT GTACGATCGT TTCGTCGATA CATCGCGGTT TCGCGCGTCA CGGCGGCGCG TGAGAGATGC GCGGTGACGG TGATCTGTTG GATTTCGCGC GAACGCGGCG CGGCGTTGAC TCTGTTGACC GAACGACTGA CGGCGATGTT GAAATCGATG TGCGTAGGTT TGAAGCTCAC GAAGGATGGG AACGTGCTGC TGAGAGAGAT GCAGATTCAA AACCCGACGG CGATCATGAT CGCGAGGACG GCGGTGGCGC AGGATGATAT CACCGGGGAC GGGACGACGA CGACGGTGCT CGTCATCGGA GAGTTGTTGA AGCAAGCGGA GCGGTATTTG AATGAGGGTT TGCATCCGAG AGTGATCGTG GAGGGATTCG ATGTGGCGAA ACGCGAGTCG CTGAAGTTTT TGGAAACGTT CAAGCGCGCG GCGCCGGGTG TGGAAGCGCC GGATCGAGAA ATGTTGTTGT GCGTGGCGCG CACGGCGTTG CGAACAAAGT TGCGGGAAGA GTTGGCTGAT AAGTTGACGA CGATCGTCGT CGACGCTGTG TTGTGCATCG CCAAACCGGA AGAGTCGATC GATTTGCACA TGGTGGAGAT CATGACCATG AAGCATCAGA CGGATGACGA AACCAAATTG ATTCAGGGTT TGGTCCTCGA CCACGGGGCG AGACATCCGG ATATGAAGAG ATACGTGGAG GACGCGTTCG TGTTGACGTG TAACATCAGC CTGGAATACG AACGATCGGA GGTGAACTCG ACGTTCATGT ACACCGATGC GGAACAGCGC GAGAAGATGG TCGCCGCCGA GCGCGCGTAC ACCGATGAAA CCGTGCGCAA GGTGATCGCG CTCAAGAAGC AGGTGTGCGA CGGCAACGAC AAGGGTTTCG TCGTCATCAC ACAAAAGGGG ATCGACCCAA TCTCTCTGGA CATGCTCTGT AAGGAAGGCA TTATGGGACT TCGTCGCGCC AAGCGCCGAA ACATGGAGCG TCTCGTCCTC GCATGTGGTG GTCAATGCAT CAACTCTGTT GAGGAACTCT CGCCGGAAAT TTTGGGCCAC GCCGGCGAAG TGTACGAGTA CGTCTTGGGC GAGGAAAAGT ACACTTTTGT GGAAAAAGTC GTCAACCCGA CGTCGTGCAC GGTACTTTTA AAAGGCTCGA ACGATCACAC CATCGCACAG CTCAAGGATG CGGTGCGAGA TGGCTTGCGG GCGGTGAAGA ATGTGTTGAC CGACAAGGCC GTGGTTCCGG GCGCGGGTGC GTTTGAGATG GCCTTGAACA AGCACTTGAA GGAGAACGTC ACCAAGATGG TCGAAGGTCG CGCGAAACGC GGCGTCGAGG CGTTTGCGGA AGCCATGCTC GTGGTCCCGA AGACACTCGC GGAGAACAGC GGTTACGATC CGCAAGACGC CATCATCGAC ATGCAAGAGG AGCACGACAG AGGCAACGTC GTCGGTTTCG ATATCAGCAT CGGCGAGCCC TTCGACCCCA CCATGAGTGG TATCTACGAC AACTTTCTCG TCAAACAGCA AATTCTGCAC TCTGCGCCCA TCATCGCCAC GCAGTTACTC TGCACGGATG AGGTTCTCAG AGCGGGCGTG AACATGCGCA AGCGATGATC GACGCGAGTC AGAAACTTAT AGAATTTAGT GGTGGTGCGT ACGCACGTAC GTAGTAGCAG CG
|
Protein sequence | MSLKSINPNA ELMNADAALF MNINAAKGLQ DVMKTNLGPK GTIKMLVDGA GGLKLTKDGN VLLREMQIQN PTAIMIARTA VAQDDITGDG TTTTVLVIGE LLKQAERYLN EGLHPRVIVE GFDVAKRESL KFLETFKRAA PGVEAPDREM LLCVARTALR TKLREELADK LTTIVVDAVL CIAKPEESID LHMVEIMTMK HQTDDETKLI QGLVLDHGAR HPDMKRYVED AFVLTCNISL EYERSEVNST FMYTDAEQRE KMVAAERAYT DETVRKVIAL KKQVCDGNDK GFVVITQKGI DPISLDMLCK EGIMGLRRAK RRNMERLVLA CGGQCINSVE ELSPEILGHA GEVYEYVLGE EKYTFVEKVV NPTSCTVLLK GSNDHTIAQL KDAVRDGLRA VKNVLTDKAV VPGAGAFEMA LNKHLKENVT KMVEGRAKRG VEAFAEAMLV VPKTLAENSG YDPQDAIIDM QEEHDRGNVV GFDISIGEPF DPTMSGIYDN FLVKQQILHS APIIATQLLC TDEVLRAGVN MRKR
|
| |