Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34150 |
Symbol | |
ID | 5000635 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 581224 |
End bp | 583122 |
Gene Length | 1899 bp |
Protein Length | 541 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416056 |
Product | predicted protein |
Protein accession | XP_001416701 |
Protein GI | 145344357 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02346] T-complex protein 1, theta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00879554 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTACG GGCTGCAGGC CATGTTAAAG GTGCGGGAGA CGCGCGCGCG ATCGCGTCAA CGTCCGCCGC GATGGCGATG CGATGGCGTG ATGATTAAGA CGCGAGCGTC GCGCGACGCG CGCGCGGGAC GGGCGCGCGC GACGTCGAAA CGGTCGCGAT CGATGTTTTG ACGACGTCGC GACGTCGCGA ACGTCGTTCG CGATGCGGCG CGTTGGTGAC GATCGAAGAT CATCAATCGC GCGTCGCGCG CGCGAAGACC CAGAGTCAAC GAGACTGACT GACGGACGAA CGCACGAACG TAGGATGGAC ATAAACACCT GAGTGGACTC GATGAGGCGG TGATACGAAA CATTGAAGCG TGTAAACAGT TGTCGAAGAT CACGCGGACG TCCCTTGGGC CGAATGGCAT GAATAAGATG GTCATCAACC ACTTGGAACG TTTATTCGTG ACGAGCGACG CGGCGGTGAT CGTGCGAGAG TTGGAGGTGG CGCACCCGGC GGCGAGGTTG ATTGTGATGG CGGCGCAATC GCAAGAGAGA GAGATGGGCG ACGGAACGAA TTTTGTGGTG AGTTTTGGGG GTGAGCTGTT GGGATTGGCG GAGGAGCTGG TGCGCGAGGG GTTGCACCCG AGCGAAATCA TCGAAGGATA CGAAAAGGCG GCGGCGAAGG CGTTGGAGTG GATGCAAGAG CTCGTGATTC CGGGCAGTGA AGTTTTGGAC GTGCGTGATG TGAAGGCGAC GGCGGGTCGA ATCAAGGGCA CGCTTAGCTC AAAGCAGCAC GGGTTCGAGG ACAAGTTGTC CATGGTCGTG GCCGAGGCGA GCGTCGACGT GTTGCCCAAG AATCCTTTGA ATTTCAACGT CGACAACGTT CGCACGACGA AGATTCCGGG GAGTTCTTTA TCCGATTGCA CCGTCGTTCA AGGTATGGTC ATTCGACGAG GCGTTGAAGG GACGATTCGA TCGCAGAAGA ACGCCAAGGT GGCGGTGTTC GGATGCGCTG TCGACACGTC GACGACGGAA ACCAAGGGGA CAGTTTTAAT TTCATCAGCG AGCGAACTCG AGGCGTACAG CAAGGGCGAG GAGGCGAAGA TGGAAGAGTA CATTAAAAAC ATCGCCGACA GCGGTGCTAA AGTGATCGTT TCCGGTCAGT CGTTCGGGGA AATGGCCATT CACTTCATCG AAAGATACGG TTTGATGGCA ATCAAAATTC CGTCCAAGTT TGAACTTCGA CGCTTTTGCC GCGCGACGAA CGCGCGAGGT TTAGTCAAAC TCGATCGCCC GGAGGCAGAT GAGCTCGGGT TCGCGTCTAG CATCGAAGTT CGAGAAATCG GTGGCACGCA ATGCATCGTG TTGTCCCAAG ATGATCACAC GTCTCGCGTC GCCACCGTCA TTCTGCGTGG TTCAACGGAG AGCGCTCTCG ACGACATGGA ACGCGCCGTG GACGACGGCG TCAACGCATT CAAAGCGCTC ACGAAAGACT CGCGCACGCT TCCTGCGGGA GGTGCAACGG AAATCGAACT CGCACACAGG CTCGCCGCGT ACGGCCGCAA ACAAACTGGA TTAGATCAGT ACGCCATTCA AAAGTTTGCG CAGGCGCTCG AAATCGTTCC CAGAACGCTC GCCGAAAACG CCGGCGCGAA CGCTACGGAC AGCGTCTACA ACCTCTACGC CGCGCATGCA AATGGCGAGG TAAACGCCGG GATCGATATC ACTGGCGACA ACTCTTACGT CGATCTCGGC GCCACGCAAG GCATTTACGA CGTGTTCCTC GTCAAGTACT GGGCGCTCAA GTACGCCGTC GACGCCGTGT GCACGGTGCT TCGCGTCGAC ACCATCATCA TGTCCAAGTT CGCGGGCGCC GGCGGCGGCG CCGCCCCTCC AGGCGGTGAA GAAGACTAA
|
Protein sequence | MPYGLQAMLK DGHKHLSGLD EAVIRNIEAC KQLSKITRTS LGPNGMNKMV INHLERLFVT SDAAVIVREL EVAHPAARLI VMAAQSQERE MGDGTNFVVS FGGELLGLAE ELVREGLHPS EIIEGYEKAA AKALEWMQEL VIPGSEVLDV RDVKATAGRI KGTLSSKQHG FEDKLSMVVA EASVDVLPKN PLNFNVDNVR TTKIPGSSLS DCTVVQGMVI RRGVEGTIRS QKNAKVAVFG CAVDTSTTET KGTVLISSAS ELEAYSKGEE AKMEEYIKNI ADSGAKVIVS GQSFGEMAIH FIERYGLMAI KIPSKFELRR FCRATNARGL VKLDRPEADE LGFASSIEVR EIGGTQCIVL SQDDHTSRVA TVILRGSTES ALDDMERAVD DGVNAFKALT KDSRTLPAGG ATEIELAHRL AAYGRKQTGL DQYAIQKFAQ ALEIVPRTLA ENAGANATDS VYNLYAAHAN GEVNAGIDIT GDNSYVDLGA TQGIYDVFLV KYWALKYAVD AVCTVLRVDT IIMSKFAGAG GGAAPPGGEE D
|
| |