Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34049 |
Symbol | |
ID | 5000580 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 507159 |
End bp | 509088 |
Gene Length | 1930 bp |
Protein Length | 542 aa |
Translation table | |
GC content | 57% |
IMG OID | 640416001 |
Product | predicted protein |
Protein accession | XP_001416968 |
Protein GI | 145344912 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02340] T-complex protein 1, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.013996 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.104894 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGAC AATCCGACGT CTTCTTCGGC GAGCGCGAGA GCGGACAGGA CGTGCGACAG ACGAACGGTG CGGCGCGCGG ACGACGACGA GCGACGAAGC GCGAGCGAAC GAGGGACGCG ATCTCGAACG CGATGGGACG AGGACGACGA TCGACGATCG ACGATCGACG GGACGCGCGA CTTCGAGCGG GGAGACGGAG ACGGAGACGG AGACGGAGAC TGACGACGAC GCGCGCGCGA CGCGGGATGC GAACAGCGAC GGCGGTCATG GCGGTGGCGA ACATCGTGAA GACGTCGCTC GGGCCGGTGG GACTGGACAA GGCGCGTGAC GAAGACGCGA AGGCGTTGAC TTTGAGGACG AGACGCGGAG AATCGACGAA ACGATGCGCG CGCGCGGTGA GGGACGAGGA CTGACGGAAT GAAAGGGTTT GACGCGTAGA TGCTCGTGGA TGATATCGGT GACGTGACGA TCACGAACGA TGGGGCGACG ATTTTGAAAC TTTTAGAGAT TGAACACCCG GCGGCGAAGA TCTTGGTCGA GCTCGCCGAG TTGCAGGATC AGGAAGTGGG AGATGGAACG ACTTCGGTGG TGATTCTCGC CGCGGAGCTT TTGAAGCGGG CGAATGAGTT GGTGAGGAAT AAGATTCACC CGACGAACAT CATCGCTGGG TTCAGGTTGG CGATGCGAGA AAGCGTCAAG TACGTCGAAG GTAAGTTGGC GAGAGACGTG GAGACGCTCG GGAAGGAAGC CTTGTTGCAG TGCGCAAAGA CGAGCATGAG CTCGAAAATC ATCGGTGCCG AAGAAGATTT CTTTGCGGAT TTGGTCGTCG ATGCGTGCAC GAGCATCAAG ACGTACAACG ACATGGGCGA CGTCAGGTAT CCCATCAAGG CGATCAACAT TTTGAAGGCG CACGGGAAGA GCTTGAAGGA GTCATCGGTG TTGCACGGAT ACGCCCTTAA CCTCGGTCGT GCGGCGGAAG GGATGCCAAA GTTAGTCAAG AATGCAAAGA TTGCGTGCAT CGACTTCAAC TTGCAGAAGA CAAAGATGTT GATGGGGATT CAAGTGCTGG TGAACGACCC GAAGGAACTC GAAAAGATTC GCGAGCAAGA GTTTGAAATC ACCGCCAATC GCATCAAGAT GATCTTAGCC GCCGGTGCCA ATGTCGTGCT CTGTTCTAAG GGCATCGATG ATATGGCGCT CAAGTACTTC GTCGAGGCTG GGGCTATCGC CTGTCGTCGC GTCAATCGTG ATGATTTGCG CCGCATCGCC AAGGCGACGG GGGCGCAAGT GATGCTGTCT CTGTCCGACA TGGATGGTGG AGAAACTTTC GACGAGTCCA TGCTTGGCAC TGCGGGCGAA GTGGTGGAGC AACGCGTGGC TGATGATGAT ATGGTCGTCA TCAAGGACTG CGCGAGCACC CAATCGTGTA CAATTCTCTT GCGAGGCGCA AACGATTATA TGCTCGACGA GATCGACCGC TCGGTGCACG ACGCACTGTG CATCGTGAAG AGGACGTTGG AAAGTGGCAA GGTTGTCGCT GGTGGTGGCG CCGTCGAAGC TGCGTTGAGC ATTTATCTGG AGAATATGGC GACTACTCTG GGTAGCCGGG AGCAGCTCGC CATCGCCGAG TTTGCCAACG CGCTCTTGGT CATCCCAAAG GTACTCTCTG TCAACGCTGC GAAGGATTCC ACCGATCTCG TGGCCAAGCT TCGAGCTATT CATCATCAAG CACAGAGTCA AGGTAACGAA GAGCTCGCCG GGATGGGCTT GGATCTCGTC AAGGGCGAAC TTCGCGACAA CATCGCCAGC GGTGTCCTCG AGCCGGCGTT GAGCAAGGTG AAGAGCATCC AGTTTGCCAC TGAAGCTGCG ATTACGATTC TCCGCATCGA CGACTTGATT CAACTCGAAC CCGAACAGGA GGGTCAGTAG
|
Protein sequence | MRRQSDVFFG ERESGQDVRQ TNATAVMAVA NIVKTSLGPV GLDKARMLVD DIGDVTITND GATILKLLEI EHPAAKILVE LAELQDQEVG DGTTSVVILA AELLKRANEL VRNKIHPTNI IAGFRLAMRE SVKYVEGKLA RDVETLGKEA LLQCAKTSMS SKIIGAEEDF FADLVVDACT SIKTYNDMGD VRYPIKAINI LKAHGKSLKE SSVLHGYALN LGRAAEGMPK LVKNAKIACI DFNLQKTKML MGIQVLVNDP KELEKIREQE FEITANRIKM ILAAGANVVL CSKGIDDMAL KYFVEAGAIA CRRVNRDDLR RIAKATGAQV MLSLSDMDGG ETFDESMLGT AGEVVEQRVA DDDMVVIKDC ASTQSCTILL RGANDYMLDE IDRSVHDALC IVKRTLESGK VVAGGGAVEA ALSIYLENMA TTLGSREQLA IAEFANALLV IPKVLSVNAA KDSTDLVAKL RAIHHQAQSQ GNEELAGMGL DLVKGELRDN IASGVLEPAL SKVKSIQFAT EAAITILRID DLIQLEPEQE GQ
|
| |