Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26082 |
Symbol | |
ID | 5004265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 199346 |
End bp | 200285 |
Gene Length | 940 bp |
Protein Length | 272 aa |
Translation table | |
GC content | 59% |
IMG OID | 640419686 |
Product | predicted protein |
Protein accession | XP_001419933 |
Protein GI | 145351119 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0135575 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAGCG ATCTCACCGC GGAAGAGTTT GCGGCGCGGT ATTTGGGTCA CGTGCGGTTG TCGAGCGAGG AGAGGGAGAA ACGGAAGGCG AGGGGAGGGG AGACGCTCGA GACGTTGCCG GTGGAACATT TGCCGGAGGA ATTCGATTGG CGATTTAAGG GCGCGGTGAC GCGGGTGAAG GATCAAGGAC AGTGTGGATC GTGCTGGACG TTTTCAACGA CGGGTGCGAT CGAGGGCGCG CACTTCATCA GCACCGGAAA GCTCGTTGAA TTGAGCGAAC AACAACTCGT GGACTGCGAC GTCGGTTGCG ATCCAGACGT GCCGAACGCG TGCGATTCTG GCTGTAACGG CGGCTTGCCG TCGAACGCGA TGGAGTACAT CGTAGAGCAC GGAGGCATCG ATACCGAGAA GTCGTACCCG TACGTCGGCG AGAAGGGCGA GTGCAAGGCG AAGAAAGGCA AACTCGGCGC GACGCTCAAG AACTTCTCCT TCGTCAGCGA CGACGAAAAA CAAATGGCGG CGGCGTTGGT CAAATACGGC CCACTGTCCA TCGGGATCAA CGCTGCGTGG ATGCAAAGTT ACATCGGCGG CGTCGCCTGC CCTTGGCTCT GCGACGCCGA GTCGCTCGAT CACGGCGTCC TCATCGTCGG CTACGGCTCG AGCGGCTTCG CCCCGGTTCG ATGGGCGCCC GAACCGTACT GGATCGTCAA GAATTCTTGG AGTCCGGCGT GGGGCGAGGG TGGATACTAC CGCATATGCA AAGACAAAGG TTCGTGCGGC ATCAACAACA TGGTCGTCGC CGCGCACGGC GTCGACTGAT CGCGCTTCGC GTTCGCCTCG CGTCTTCATG CTTCTCTCGC GCCTCGTCGC CTTCATCGAC GACGTCGCGC GCGTTTCTCG CATTTTCATC TCTGTATCTC ATCATTCTAA AACTATCGTT
|
Protein sequence | MFSDLTAEEF AARYLGHVRL SSEEREKRKA RGGETLETLP VEHLPEEFDW RFKGAVTRVK DQGQCGSCWT FSTTGAIEGA HFISTGKLVE LSEQQLVDCD VGCDPDVPNA CDSGCNGGLP SNAMEYIVEH GGIDTEKSYP YVGEKGECKA KKGKLGATLK NFSFVSDDEK QMAAALVKYG PLSIGINAAW MQSYIGGVAC PWLCDAESLD HGVLIVGYGS SGFAPVRWAP EPYWIVKNSW SPAWGEGGYY RICKDKGSCG INNMVVAAHG VD
|
| |