Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_41292 |
Symbol | |
ID | 5002202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 526157 |
End bp | 527857 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | |
GC content | 56% |
IMG OID | 640417623 |
Product | predicted protein |
Protein accession | XP_001418266 |
Protein GI | 145347631 |
COG category | [R] General function prediction only |
COG ID | [COG1752] Predicted esterase of the alpha-beta hydrolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.278724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCGG AGGTGGCGCT GTTGAGCGTG AACGCGATAC TGTTGGGATT GGTGCATTTA GCGATCGTGC TGAACAAGCA CGTGCTGCCG TCGTGGTTTC ATCGGGCGAA GGTGGCGCTG CGCGAGAGCG CGGGCGGCGT CGACCATCGC GGGCGCGCGA ACGCGCACTT CGTCGAGCAT CGGCAAGAGA CGTTGTCCGA GCTCGAGCGG GCGAAGGACT ACCGAGAGTG GATCGCGGTG GCGAATAAGC TCGACGCGTT TCCCGCGGAC GTCGGTGAGG GTGGGTCGCG TTGGAAGGCG GATGAAAAAT CGGACGTGTA CGACAGGCAG CTCGTCAAGG TATATTTGAA CACGATGAGG ACCGCGCGCG AGCGCGACGA TCTCACGGCG CTCGGGTTGT GCTTGCGAAC GGTGTTACAC CGAAACTTTG CCGGGATCGA TAGGTTGCTC CACCTGAGAT TTTCTCGAGT CGGAACGAAG AAACTCGTGC AAGATTTCAA CGACGAAATC GTCGCCGTCA TCAAGCACAT AGCCGAAACC GCAGATGACG AGCAGGCGAT GGAGATGCTT CAAGTGTTAA AAGAGTCGTA CCGTTCGCTC GGACGCACGG CGCTGTGCTT ATCTGGCGGC GGCGCACTGG CGATGTATCA TTTCGGAGTC TTGCGAACGT TGTTGCAAGA GGGACTGTGC CCGCAAGTCA TCTCGGGTAC GAGTGGTGGC TCGATCGTCG CCGCGTTTTT GAGCTGTCAT TCGCCCGAAG ATATCTTGCA AGCGATTCGC CCGGACGTGT CCACTCGTTA CGGGCGCCGA TGGTTCCCAC GTCCGCTGAA AATGGCGCTA CACTTTTTGA AGCACGGCGT TTTGATGGAC GCCGAGGGGT TTTCGAAGAC CACAAAGGCA TACTTTGGCG ACACGACGTT TGAGGAAGCG CTAGCGATAT CCGGAAGAGC AGTCTCTATT CAAGTGTCCA TTGGCTCACA AACCGGTTAC GTTCTCAATC ACCTGACTTC TCCCAACGTG CTCATTCGCA CCGCGGTGTG CGCGTCGTGC GCGCTACCTG GTTTGATGCG CCCTGTGGAG ATTCTCGCCA AAGACAAGCA CGGAAATTTA GTGCCATTTC ATCCACCCGA TGTGAAGTCG TACGATGGCA CCATCACCCA AGACATCCCG TCAGCACGCA TGACGGAGCT TTTCAACTGC AACAACTTCA TCGTTTCGCA GGTGAACCCG CACCTAAACT TCGTGCTACA CCTCGCCGAG GAATCACACG GGCGACGGCA GAAGACCGCG CGTAGTTATC AGCGTCGTAA TGCGGTTCAG AAGCTGTTGC GTGTCGCCAA CTTCTTGCTG CTGAACATCA AGTACTCTAT TCAAAAACTT CTCGAAGTTG ATTTGCTCGA CATTCGGTTC GTTCGCACGC TGCAAGGAGT GTTGATGCAA GACTTCAGAG GCCACATCAC CATTCTGCCA TCGCTTCGTT GGACGGATTA CTCCAGAATC TCCAGCAACC CAACTGAGAA GGATATGGAT CGTTACATCT CGCGAGGCGA GCAATCCACT TGGCCGCATG TCGAGTCGAT TCGCTACACG ATGAAAATTG AAACCACGCT CGTCGACAGC ATCCGCGCTT TGTCCAAGCG CGTCGACGCT ACGGGCGAAA AGATCAAACG AACGAACTCA CATGGGTTCT TAGAACATTA A
|
Protein sequence | MAAEVALLSV NAILLGLVHL AIVLNKHVLP SWFHRAKVAL RESAGGVDHR GRANAHFVEH RQETLSELER AKDYREWIAV ANKLDAFPAD VGEGGSRWKA DEKSDVYDRQ LVKVYLNTMR TARERDDLTA LGLCLRTVLH RNFAGIDRLL HLRFSRVGTK KLVQDFNDEI VAVIKHIAET ADDEQAMEML QVLKESYRSL GRTALCLSGG GALAMYHFGV LRTLLQEGLC PQVISGTSGG SIVAAFLSCH SPEDILQAIR PDVSTRYGRR WFPRPLKMAL HFLKHGVLMD AEGFSKTTKA YFGDTTFEEA LAISGRAVSI QVSIGSQTGY VLNHLTSPNV LIRTAVCASC ALPGLMRPVE ILAKDKHGNL VPFHPPDVKS YDGTITQDIP SARMTELFNC NNFIVSQVNP HLNFVLHLAE ESHGRRQKTA RSYQRRNAVQ KLLRVANFLL LNIKYSIQKL LEVDLLDIRF VRTLQGVLMQ DFRGHITILP SLRWTDYSRI SSNPTEKDMD RYISRGEQST WPHVESIRYT MKIETTLVDS IRALSKRVDA TGEKIKRTNS HGFLEH
|
| |