Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31795 |
Symbol | |
ID | 5001778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 544398 |
End bp | 547376 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | |
GC content | 54% |
IMG OID | 640417199 |
Product | predicted protein |
Protein accession | XP_001417805 |
Protein GI | 145346665 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.163312 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTACT ACGTGGCGGA GAGCCAGAAG CCGAGGGAAC ACGCGGCGCT GGCGCTGGCG GTGGACGCGG GGTCGGTGTT CGAGGGCGAG GGCGAGCGAG GGGCGGCGCA CGTGGTGGAA CACCTGGCGT TTCGATGCAC GGAATCGTAC GAACACTTTG CGATTGTGAA CTTTTTAGAG TCGATCGGGG CGGAATTCGG TGCGTGCTCG AACGCGTACA CGAGCATGGA TGAGACGGTG TACGAGTTGA CGATTCCGAC GCAAAAGGCG GAAGTGTTGG CGACGTCGAT GCATATTTTG AGCGAATTCG CGAGCGCGGT GCGGATATCG AACGAGGATG TGGCGTGCGA ACGAGGGTCC GTGATGGAAG AATGGCGTTT AGGACGGGAC GCGCGAGGAC GCGCGGCGGA GGCGTATTGG AAGACGTTGA TGGAGGGGTC GTTGTACGCC GAACGCTCGC CCATCGGATT GGAGGACTTT ATACAAAACG CCGACCCGCA GGTTTTGCGA GACTTTTACG CCAAATGGTA CCGACCTGAA CGCATGGCGG TGATTGCGGT TGGAGATTTT CAAGACCTGG ACGACGTCGT GAGCCTGATC GAGAGCACGT TTCAAGACTT GAAGCCGAAA GAAGGGCAGC CCGCGGAGAA TCCAGTCATG GAACGACCAA AAAACTCCGC GATGGAGCAT TCCGAACCGC GCGTCGTGAC GCACGTCGAT CGCGAGTTGA AGCAGACGGC GGTGACAGTG ACGTTCAAGT ACGCGAGTAT TCCGGTGGAC ACTCCGCGCG GGTATTATTT GAAGACGGTC GAGGATATTT ACAAGACGGC GCTCGATAAT CGATTGTATC GCATGATGCG TCAACCAAAG CCCCCATTTT TCAGCGCGGG TGGCATCATC GAGGACGCGA CGAGGACGAC GACTTTACTC AGCGTGCAGG CGACGTGCGC GGAGAGCCGT GCGAGCACAG GTTTAGAGGC GTTACTTCGC GAACTCGCGC GTATTCGATT GCACGGAATT TCGGAGCAAG AGTTGAAAAT CGCCAAGTCG CGCATGCTCG CCGATACAGA GCAATTGTAC GCAGAACGCG AGCAGACATA TTGTGAGTCT GTTCGCGATG AGCTAGTGTG CCATTTCTTG CGCGGTGATC TCGTCATCGG AGCTGAAGAC GAGGCGGCTC TTGCCAAGGC GTGCATTGAG CGCGTGTCAC AAGAAGACGT GTTGGCGTTT GCGCGTCAAT TGAACGTGCG TAACTCGTGC GTGATTCGCG TACAAGAAGG TAGAAAGCGT ACAAGTGAAG ATGATTTGCG AGAAGCGATC GAGAATGTCC GCTTGAGGGA AATTGAGGGT GCAATTGACC AAAGCGAAGT GTTTGATATT CCCGAGGTAT TGATGGACGC GACTTCATTG ACTTCTGGCA CCATCGTCGG CTCGCGAGAG TTACCGGCGT TGGAGGTGAA TGAGATCACC CTGAATAACG GTATGCGCAT CGCCATTCGC GTGACTGATT TTCTTGACGA TCAAGTCCTC ATACGTGGTG TCGCACGAGG TGGCCTTTCG GAGGTAGCGC AGATTGATTA CATCGATGCG ATGTGCTCAA ACATGGTCGC CAGCGAACTT GGCATCTACG GCCATCGACC GGATGTCTAC GACGGTATCA TAGCGGGTCT AAGATCGGAC GTGCACGCCA ACGTAACCAT GTATCGCCGT AATATTGAAG GTGAAACATC ACCAGTGGAC ATCGAAAGCG CGCTGCAGTG CATTCATCTT TTGTTCACGC ACGACGTGAG CACGACGAAT GATCCGGAAG TTTTAGAGAC ACTGATGCAG ATGCAGGAGG AAAAAATTAG AAATCAAAGT CGAGACCCCG AAAGTAAATA TAGCGAGGTC GTTCGCTCGC TCGTCTACGG CGAGTCGTAC CATAGTCAGC GAATTACCGT CAAGTCGTTG CGTGAGATGG ACAGCAAAAA GGCTTGCGCG TTCTTCGACG CTTGCTTCTT GGATCCGTCT GAATTTACCA TGGTTTTCGT CGGAGCGATC GATTCGAAGA CACTCGTTCC GCTCATCGAA AAGTATCTCG GCTCGATTCC GCCGGCGTCA CCCACCAAGG TTCTCAAGGC CTTTGAAGGT ATTAGTCAAC GCAAACGCAG CTTGACACCG TTCCTGCTGA AGTTCCCGAC GCGCGTCATC TCGCGCACTG TGCGAGCGCA CATGCGGGAA GGGATGTCTA AGGCGTCGAT TACGTTTCCC GTGCGCATAC AAAATCCGGA CTTTCACAAC AGTCGCGGAC GTTCGACGCT CTTGGGCGGA AAAGAGTTGA CCGTGGCAAA GTTTAAGACG GTCATGACGG CGGCAATCAT CGAGAGACGA TTGCTGGCTT TGCTGAGATT TGAATACGGC GAGATTTACA CCTGCCACGC GGATGCATCG TTCGGCTACC AAGACCCGGA TGTCGCTGGT GAAATGTACT CGGGCGATAT CATGGTATCA TTCTCGTGCG CTCCGGAGCG AGGCGCTCAC CTCGCGGCAC ACGCCCGAGA AGTCGTGAGA CATCTTCGCG AACACGGTCC GACGGAGGAA GACGTGCACG CCGTTCGCGA ATGCGAAATT CGAGACTTTG AAGTCAGTCG ACAAGAGAAC ACATTTTGGC GCGAGTATAT CACCGAACTC TATAAATCGC GGATGATGCA CAAGAGTATT CTGAACGGCG ATATCGAAGC GCTATATCGA ATGACTGAAG AAGTGCGAGA GGAAGTTATC GAGTCCCTCT CCCCGGCGGT GATTCGCGAG CATTTACAAT GCGTCATGAG CATGAATAAT TCCGTTACCG TCGTTCTCAA GCCGCAGCGA TCGCTCTTGC GACGCATCTT CGTTCCATCG TTCGAAACCC GCGGAGAGGC GATTTACTCC GCGGTTTACT TATCAGGAAT CGCGCTCACT GCGAGCGCGA TATATGCGAG ATGCCACAAG AAGGACTGA
|
Protein sequence | MAYYVAESQK PREHAALALA VDAGSVFEGE GERGAAHVVE HLAFRCTESY EHFAIVNFLE SIGAEFGACS NAYTSMDETV YELTIPTQKA EVLATSMHIL SEFASAVRIS NEDVACERGS VMEEWRLGRD ARGRAAEAYW KTLMEGSLYA ERSPIGLEDF IQNADPQVLR DFYAKWYRPE RMAVIAVGDF QDLDDVVSLI ESTFQDLKPK EGQPAENPVM ERPKNSAMEH SEPRVVTHVD RELKQTAVTV TFKYASIPVD TPRGYYLKTV EDIYKTALDN RLYRMMRQPK PPFFSAGGII EDATRTTTLL SVQATCAESR ASTGLEALLR ELARIRLHGI SEQELKIAKS RMLADTEQLY AEREQTYCES VRDELVCHFL RGDLVIGAED EAALAKACIE RVSQEDVLAF ARQLNVRNSC VIRVQEGRKR TSEDDLREAI ENVRLREIEG AIDQSEVFDI PEVLMDATSL TSGTIVGSRE LPALEVNEIT LNNGMRIAIR VTDFLDDQVL IRGVARGGLS EVAQIDYIDA MCSNMVASEL GIYGHRPDVY DGIIAGLRSD VHANVTMYRR NIEGETSPVD IESALQCIHL LFTHDVSTTN DPEVLETLMQ MQEEKIRNQS RDPESKYSEV VRSLVYGESY HSQRITVKSL REMDSKKACA FFDACFLDPS EFTMVFVGAI DSKTLVPLIE KYLGSIPPAS PTKVLKAFEG ISQRKRSLTP FLLKFPTRVI SRTVRAHMRE GMSKASITFP VRIQNPDFHN SRGRSTLLGG KELTVAKFKT VMTAAIIERR LLALLRFEYG EIYTCHADAS FGYQDPDVAG EMYSGDIMVS FSCAPERGAH LAAHAREVVR HLREHGPTEE DVHAVRECEI RDFEVSRQEN TFWREYITEL YKSRMMHKSI LNGDIEALYR MTEEVREEVI ESLSPAVIRE HLQCVMSMNN SVTVVLKPQR SLLRRIFVPS FETRGEAIYS AVYLSGIALT ASAIYARCHK KD
|
| |