Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36075 |
Symbol | |
ID | 5000144 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 318845 |
End bp | 320212 |
Gene Length | 1368 bp |
Protein Length | 447 aa |
Translation table | |
GC content | 53% |
IMG OID | 640415565 |
Product | predicted protein |
Protein accession | XP_001416368 |
Protein GI | 145343519 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00709127 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGAGG TAGAAGTCGC GGTGAGCGCG TGCACGGTGA AGGAGGCCAA ATGCTCGACG TGTAAAACTG GTTACAGTGA CGTCGTGCTT TTACAAGCAT TCGATTGGTT GTCGACGAAG CGTTCGTTAC ACAGCGACAA GTCTTACTAT CGCAGAATTG AGGATCGAGT GCCGATGATA GCGGATTTCG GGTTTACGCA CGTTTGGTTG CCTCCACCGT CGTTATCGGT AGACGAGCAC GGTTACATGC CATCGGAAAT TTACAACTTG GACGGTAGCG AGTACGGCGA TGAAGCGGAG CTTAAATCGT TGGTACAGGC TCTGAAAAAA GCCGGAATAG TAGCCGTGTG CGACATCGTC ATCAACCATC GTTGCGCTGA GTACGCTTCG GATGGCCGCT TCATCTCGTT TGCGGACGAA GTAACGCCGA GCGGGAGACG AATAAATTGG GGAGCTTACG CCATCGTCGG CGACGATCCA TTTTTTCGCG AAGGTCAAGG AGCCAACGAT AGTGGCGACT CGATCGAAAT CGCCCCTGAT CTCGACCACA CAAACGCCGA GATTCGCGAA GCGATCATCG AGTGGTTGAA CTGGTTGAAA GATGACATCG GTTTCAGCGG ATGGAGGTTC GATTTCGTCC AAGGCTACGC TCCGAATTTC GTGAGAGAGT ATGTGGAGAA AACGGTTGGA TTTGAGCAAT TCTGCGTCGG CGAGAACTGG GTCGGGATGA CGTGGTCGGG AAGCTTTCTC GAGTACAATC AAGACAAGCC GAGACGCGTG CTCGTGGATT GGTTGAACGC CGCAGACGAA TGCGCGGCGT TGTTCGACTT CGTGACCAAG GGAATTCTAC AAGAAGCAGT CAAGCGAGTA GAGTTTTGGC GGCTACGAGA CCAGCAAGGC GGCATGCCTG GGCTTGCCGG CTGGGTACCG CAAAGTGCTG TGACATTTCT CGACAACCAC GATACCGGAT ACCCGCAGAA TCACTGGCCG TTTCCACTCG ATCGTCTCGG TTTGGGTTAC GCGTACACGC TTCTGCATCC CGGCATTCCC TGCGTGTTTG GCCCGCACAT TTGGTGCTGC GACGAAAACT TGGGTTGGTC CTAATCGCTA ACATCAGAAA TTCGAGCTTT GTTGAGCTGC CGTAAGCTCG CCAACGTGTG CTGCGAGAGC AGAGTTGACA TCAAAATCGC CGAGAGCGAT TTATACGTCG CGGTCATCGA TGACAAAATC ATCGTCAAGC TAGGGCCGAG ATACGACGTT CCGGGTGAAA TACTCGCTCA AATCGCAGAG TTCGAGCTCG CAACGCACGG CGACGATTAC GCGGTGTGGA TTCGAAAAGA GTTACTCGAA CAACCGTTCG AGGAGTGA
|
Protein sequence | MDEVEVAVSA CTVKEAKCST CKTGYSDVVL LQAFDWLSTK RSLHSDKSYY RRIEDRVPMI ADFGFTHVWL PPPSLSVDEH GYMPSEIYNL DGSEYGDEAE LKSLVQALKK AGIVAVCDIV INHRCAEYAS DGRFISFADE VTPSGRRINW GAYAIVGDDP FFREGQGAND SGDSIEIAPD LDHTNAEIRE AIIEWLNWLK DDIGFSGWRF DFVQGYAPNF VREYVEKTVG FEQFCVGENW VGMTWSGSFL EYNQDKPRRV LVDWLNAADE CAALFDFVTK GILQEAVKRV EFWRLRDQQG GMPGLAGWVP QSAVTFLDNH DTGYPQNHWP FPLDRLGLGY AYTLLHPGIP CVFGPHIWCC DENLEIRALL SCRKLANVCC ESRVDIKIAE SDLYVAVIDD KIIVKLGPRY DVPGEILAQI AEFELATHGD DYAVWIRKEL LEQPFEE
|
| |