Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14769 |
Symbol | |
ID | 5001012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 783474 |
End bp | 786191 |
Gene Length | 2718 bp |
Protein Length | 905 aa |
Translation table | |
GC content | 63% |
IMG OID | 640416433 |
Product | predicted protein |
Protein accession | XP_001416761 |
Protein GI | 145344483 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00506073 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.208503 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACGC GCGCGAAATC GATCGCCGCG GCGACGCTCG GCGTCGTTTT CGTCGCGCTG TGCGCTGTGC GCGCTGATGG ATTCGCAATC GAGCCGCGGT GCGTCGACGA CGCGCGACGC TACGACGACG TCACGCGACG CGCGCTGACG ACGCGGGCGA CGTGCGCGGA CATCGCGCGC GGCGCGGAGG CGCTCGAAGC GCTCGGAGCG TCGACGTGCG GTGCGGTGGG CGCGCGCGCG CGATTTCTGG CGGCGAAGCT CAACGCGGCG ACGGGCGCGT GGGACGACGC GAAGCGCGCG CTCGCGGGCG ACGAGACGAG CGAATTATGG AAAACAATAG ATGAGATGTC GCGCGCGGCG CGGGTGGGGT CGGAGGCGGG AAGACGAGGG GATTGGGTGA AGGCGTACGC GGCGGCGAAC GCGGCGGTTG AGGCGAGCGC GTGCGCTTTG GATCCGAGGA CGTTTCAAGC GCGCGCGCGC GCGGCGTTGA AGTTGGCGAT GCACGGAAGG GCGTTCGTGG ATGCGAAACG CGCGCTCGCG CTGGGGGGGG GGAAGTCAGA GGCGTACGAG ACGATGGCGA CGGCGTTGAG CGAGTTGGCG GATTCCGCCG AGCGGTTAGC CGACGCCGAG ACGCTGGCGC GGCGTTGTTT GCGGTACAGC CCGGACAACG TCGAGTGTTT GGTAGTTAGA AAAAATATTA GGCGAGCGCT GTTGGTTTGG CGCGAAGCGA GCGATGCGGA ATCGCTCGGG GATTGGAGTT CGGCGATCGA CGCTCTGAGG GAGCTGCGAA ACGCGACGCG CGGCGGCGCG TTTGAGGCGT TGAAATTCGA CGCGCTCGTC GCGTCGTGTC GCGTGAATGG TAAACGAGAA CGCGCGCGTT GGGAGACCAG CACGACGAGA AAGAGGGTTC CCGCCAAACT CGTCGCGGAT GCGATCGATC AGTGCACAGA CGCGTTGTCC GAGCTCATGT CGCGCTGCGG CGAAAGTCGC GTCGACGACG TCCCGAATTC GTATTACGCT CGCGCGTGGA TGCGCGCGCT GAGCGCGAAC GTAGATGGTG CGATGGCCGA CGTAGCGGGG ATTGAACGTT CGGTAGATGT ATCGTCCGAC GATTGGAAGG CGAACGTTGA GGCGTTGCGA AAAGCCATCG AAGAGGCGCG CGAAGCAAAC GCTCCAAAGG ATTTATACGC GATTCTGGGA TTGACGCGCG AAGATGCACA AGCGGAAGAT TGGCTTCGTG TTTTGAAACG TGCGTACAGA AAGCTGGCAC TACTATTACA TCCCGATAAA AATCCGGCGG TCGATAAAGA GGAGGCGGAG GAGAAATTCG ATGAACTCGT CAAGGCGTAC AAGATTCTGT CAAGCGAGAC GCTTCGTCGC GAATACGACG AGACCGGCAA GGTGAACTTG GGCGTTGACG CGCAAGCGAC GAACGACTGG TTCGACAGTC ACGCCGATAG CCGTCGCACG AATGACGGTC AACCGCCAAA CGATGGACTT AACGAGGATG ATTACATCTT CCGGTTTGAT AAGCGCGACG CCGGGGCGGA TGGCCGAGCG GCTGGTCAGT ACGTGCACAA AGAAACGGGC GAACGGGTAT TCGGTGAACG CGACGTGCGC CCAGAAGAAG ATCAACAGGA CGACGTCTGT GCGAAGAAAA AAGGATACTG CATCGCCGGA CGAGGCGGCG CCGAGGCTCC GTCGCGCGCC AAGCACGTTC CGGGGGTCGA ATCGCTCGAG GTGAAAATCG TTACGCCGAA TCTGCTCCCG GGAGATACCG TGGCGGCGAG GCTCGTTCGA AACATCTTTG GACTTCACAA GCTTGAGTTC ATCTTTGCTT TTGACGTCGA GTTACCGAGC GAAGAAGTTC GCGACAAGAC TTTTCAAGAC GCAGCGAAAA CGCGTTTGAG AAGACTCGTG CGTACCCTGC ACGCGTCCCT CGTCGGCCGC GACGGGGCGT CGACGCTCAC GAGCATGATT GAAGAGCACA TCGTGAGTTC GTCGTCTTCA GAAGACGATT CGATCGTAGA CTACGTGGCA TCCGCGATGA GCGGTCGCGC ACCACGCGTC GGATCGCCTC ACGAGACGCG GGATCTTCTG GATCGCTTTG CGTCGCGCGC CACGCGCGAG ATGAGACGGC TCGGCGCCTT GGGACTCAAT TCCGACGATC GCACCGATAA TTTCGTGCGC GTCGCGCGTG ACTTGATGTT CAACCCCTTG GTATCGTCTC ACTTGCTCTC ATTCACCGCC GACGACGCCG AAGACGCCTC ACAGGTTCGA AGTCTCGTCG CCCTCTGGCG CGACGACGTC CCCAGCCGCG TCCTCGTCGA CGGCGACGTC CTCTCCTACG AAATCAAATT CGTCGATTCC GACAAGTCTC CCGACGCCGT GCGAGCGATG TGCGCCACCT TAGACTTCGA AACCACCGCG GGCGACCGCC TCAGCGCCTT CCCCGACGCC ATCGATCAAT TCGGCCTCGC CGCCGCCGTC GCCGCCGTCA ATTTCCGCGA CGTCCTCGCG CGAGACGCCG TAAACGGTTG GATCTCGCGT CGCATCCCCA TCCCTTCCGG CCTCATCGGC GCGCGCGTTT CGTCCTGGTT CATCGTCGCC GCCCGCATCG ACGCCGGTCG CGCGTCCACG CGCGTCCGCG AGGTCAAAAT CATCAACGCC TTCGCGGACG CCGACGACTT CGCCATCCTC GAGCCGCGCG GACCCTGA
|
Protein sequence | MSTRAKSIAA ATLGVVFVAL CAVRADGFAI EPRCVDDARR YDDVTRRALT TRATCADIAR GAEALEALGA STCGAVGARA RFLAAKLNAA TGAWDDAKRA LAGDETSELW KTIDEMSRAA RVGSEAGRRG DWVKAYAAAN AAVEASACAL DPRTFQARAR AALKLAMHGR AFVDAKRALA LGGGKSEAYE TMATALSELA DSAERLADAE TLARRCLRYS PDNVECLVVR KNIRRALLVW REASDAESLG DWSSAIDALR ELRNATRGGA FEALKFDALV ASCRVNGKRE RARWETSTTR KRVPAKLVAD AIDQCTDALS ELMSRCGESR VDDVPNSYYA RAWMRALSAN VDGAMADVAG IERSVDVSSD DWKANVEALR KAIEEAREAN APKDLYAILG LTREDAQAED WLRVLKRAYR KLALLLHPDK NPAVDKEEAE EKFDELVKAY KILSSETLRR EYDETGKVNL GVDAQATNDW FDSHADSRRT NDGQPPNDGL NEDDYIFRFD KRDAGADGRA AGQYVHKETG ERVFGERDVR PEEDQQDDVC AKKKGYCIAG RGGAEAPSRA KHVPGVESLE VKIVTPNLLP GDTVAARLVR NIFGLHKLEF IFAFDVELPS EEVRDKTFQD AAKTRLRRLV RTLHASLVGR DGASTLTSMI EEHIVSSSSS EDDSIVDYVA SAMSGRAPRV GSPHETRDLL DRFASRATRE MRRLGALGLN SDDRTDNFVR VARDLMFNPL VSSHLLSFTA DDAEDASQVR SLVALWRDDV PSRVLVDGDV LSYEIKFVDS DKSPDAVRAM CATLDFETTA GDRLSAFPDA IDQFGLAAAV AAVNFRDVLA RDAVNGWISR RIPIPSGLIG ARVSSWFIVA ARIDAGRAST RVREVKIINA FADADDFAIL EPRGP
|
| |