Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16088 |
Symbol | |
ID | 5002631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 283074 |
End bp | 284510 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | |
GC content | 61% |
IMG OID | 640418052 |
Product | predicted protein |
Protein accession | XP_001418661 |
Protein GI | 145348449 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4870] Cysteine protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00877055 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTGCGAT ACGAGGCGCA CAACGCGGAT TCGGCGCACA AGTTTCAGTT GAAAGAGACG AAGTTTAGCG ATCTCACGGA GGAAGAATTC GCCGCGAGGG TGTTGACGTA TAAACCGAGG CGACAGTTTG GTGAAACGAT GCTGGGGAAC TCGGAAGATG AGGTTAGCTC GACGTCCGCG CGCGTGGGCT ACGAGGCGGC GGCGTTGAAG TCGCCCGCGG AGATCGCTGT GGCACGGAAG CATTCGCAGC GCGCGCGCGA CCGCACTCAA AGGCGCGAGC GAAACCGGCT GCGTGGACAT GTCGAAGACC ACGGAGAAGT CGACGATAGT GGACCGTTGG GTGATGCGGA CCCATCGATT CCGGCCGCCT TTTCGTGGCG CACGCCGCCA GATGGATACG GAAACGTCGT CGGTGTGGTG CACGATCAAG AGGACTTGTG CGCGTCGTGC TGGGCGTTTG TCACCGCGGA TTCCATCGCG AGCCGAATCG CAATCATAAA CAAGGGCGAC GACGCCCCGG CGTTAAGCGT GAAGCAGCTC ATGGCATGCG ATGCTGTTGA TCACGGGTGT TCGACCGGGA ACATGTACAC CGCGTACGAA TGGATCGGGC AATACGGCGG TATCAGCTCC AAGGCGGATT ACAACGCGAA AGTACCAGGT GACCGAGACG ACGCTCCGGA TGCCAAGTGC GACGCGTCCG TCAAAAAAGT TTACGATACG CCGGCTATGT GTGATTTAGC GCAAGTTGCC GGCGAAGAAC CGCTTTATCG TGCGATCTTC GAGCGAGGTC CCGTCGCCGT TGGCATCAAC GCAAACAAAC TGCAGGCATA CGGCAGCGGC GTCATCATGT TGGATGACTG TAAGCCACTT GGTCGTGGTA TTGAGTCCAT CAATCACGCC GCACTTGTCG TTGGGTGGGG CACGACGGAC GACGGGGTCA AATACTGGGA AATTAAAAAC TCTTACGGCC CAGAGTGGGG CGACGAAGGA TTCTTCAGGC TCGAGCGCGG TCGCATCGGC GAACACAAGT TCGGCACTTG CGGTCTTCTC TTTGAATCCG TCTACCCGGT CGTCACGAAG GCTGGCGATG CGACGTCGAC TGACGCTCCG TGTGTGAAGG GATCAGTCCA AAAGCAAACG TACTATAGAA ACGAGACGCT CAATCCGGGC TTGGGCGACG ATGACGTTGA GGACGAGGCG CGCATCGGCG TCGCGCAGCG CCGCGCGCAC CGAGCCCACG GTCACGCACA CCGCGCGCGA AAAGCTCGCC TAGGCGACGC CGCTTCGTCG CACCTGACGA CGCACACCGA AAACGTCGTC GCCGCGGCCG CCGCTCTGGC CTCGATCGCC GTCCTCGTCG CCGCGGTCGC TCATCGCCGC CGAGCGCGCC GGGACGCCGT CCCCGAATCC GCCGCGCTCC TCGCCGCCGA GCCTTGA
|
Protein sequence | MVRYEAHNAD SAHKFQLKET KFSDLTEEEF AARVLTYKPR RQFGETMLGN SEDEVSSTSA RVGYEAAALK SPAEIAVARK HSQRARDRTQ RRERNRLRGH VEDHGEVDDS GPLGDADPSI PAAFSWRTPP DGYGNVVGVV HDQEDLCASC WAFVTADSIA SRIAIINKGD DAPALSVKQL MACDAVDHGC STGNMYTAYE WIGQYGGISS KADYNAKVPG DRDDAPDAKC DASVKKVYDT PAMCDLAQVA GEEPLYRAIF ERGPVAVGIN ANKLQAYGSG VIMLDDCKPL GRGIESINHA ALVVGWGTTD DGVKYWEIKN SYGPEWGDEG FFRLERGRIG EHKFGTCGLL FESVYPVVTK AGDATSTDAP CVKGSVQKQT YYRNETLNPG LGDDDVEDEA RIGVAQRRAH RAHGHAHRAR KARLGDAASS HLTTHTENVV AAAAALASIA VLVAAVAHRR RARRDAVPES AALLAAEP
|
| |