Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_45399 |
Symbol | |
ID | 5001416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 152357 |
End bp | 155269 |
Gene Length | 2913 bp |
Protein Length | 916 aa |
Translation table | |
GC content | 54% |
IMG OID | 640416837 |
Product | predicted protein |
Protein accession | XP_001417159 |
Protein GI | 145345314 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTGACGC CCGAACGCGA CGAGAAGCAG TACAGAAGAT GCCGCCTGAG CAATGGTTTG GAAGTGTTGC TGATTTCAGA CGCGACGCTG TGCGGCGCGG ACGACGACGG CGCCGGCGCG GGCGCGAGCG AGGACGATGG TTCGGCCAGC GAGGATGCGT CGGGGATGGA TCACGACGAA GACACGGATG GGGAGGAGGA GGAGGGTGAT GACGATTCCG ACGACGGCGG CGGCTCGAGC GGCGGCGGAA TGAAACTGGC GGCGTGTTCG ATCGCGTTCG ACGTCGGCTA CTTCGCGGAT ACGGACGAGT GCGATGGGTT GTCGCACTTT CTCGAACACA TGGTGTTTAT GGGAAGCGAA AAGTACCCGG GGGAGAATTT CTTCGGCGAG TGGTTGAACG AGCACTGGGG CTCGGACAAC GCGTCGACGG ATAGCGAGAA TACGATTTTT TATTTCGAAT GCAATCCGAA GAATCTACGA GAGGCGCTGG AGATATTCAG TGGGTTCTTT GTCAATCCGT TGGTGAAGCT GGACAGCGTC GATCGCGAGG TGACGGCGGT GGAGAGCGAG TTCGAGCGCG TGGTGAATAA CGATACGGTG CGCGCCGAGT TGTTGTTATC GTCGCTGGCG GCGAAAGGGC ACCCATACAC GAAATTTGGA TGGGGTAACC GCGCGTCGTT GACACAAAGC CCGCCGTATA AGGAAGGTCG CATGCGTGAC GTGTTGTTGG AGCACTGGAG AAGGCATTAT CACGCCAAGC GCATGTCAAT TGCGCTCGTC GGGGCGGAGG ATTTGGACGA ACTTGAGAGT TGGATCGTAG AGATTTTCGG TGACATGCGC GACGACGGGG ACGAAGTGAT TGATTTAAAC ATCGCGCATT CTTCACCGTA TGCGAACGCG GTACCCATCC GCGTGTTAAC GGCGCAAGTG AAGGATGGTC AACACGTTTC TATCACGCAC GAACTCCCGG CTTGGACGCA AAAGAATTAC AAGCACAAGT CGGCGACTTA CATGGAGACC TTGATCGGAC ACGAAGGACA CGGTTCTTTG TTTGCTGAGC TGAAGCGTCG AGGATGGGCG AGCGATTTGC GTTCGGGTGT GGGCGCAGGT GGGATTGATT CGTCTACCGC CGGCGCGCTC TTTGGCACTA CGATCAAGCT CACGGATGAT GGTTTGACTC ACGTCGACGA TGTCATCGGG CTGTTTTTCG CGTACGTCAA TATGCTTCGC GCCAAAGGCC CGCAGGAGTG GTTTTGGAAT GAAATCAAGC AATTGGCTGA TATTGACTTT AGATTTCGCG AACCCGAAGA CGCGAGCGAG TACTCTGAAC GTTTGGTGGC AGATATTCGA AAGTACGCCC CCGAAGACAT CTTGCGCGGA GCGGATTTAT TTGAAACGTA CAAGCCCGAA GAAATTCGAG AAATTATTGA TTTGATGACG CCTCAAAAGG CCATCATCGT CGTGCAAAAT CATGCGTGGA ACGGCGAAGG CGAGAACGTT GAGCACGAGC GATGGATTAA CTTTCCATAC AAAAAGGAGG CTTTAGATTC GGCGCTGTTG GAGACTTGGG CAAACGCAGA CGCGGGCGAG CGCTTACATT ATCCATCACC GAACCCATAC ATTGCGAGTG ATTTCAGGTT ACGATCACCG GCGAGCGAAC ACAAAGACGC GTTGTTTTCA CCCACGATCG TGCACGATTG CAAAGTCAGT CGCATCTGGC ATCGTCTTGA TGACCGATTC AACCAGCCAC GGTCGTGCAT GTACTTTCAA GTATCGCTGC CGCACGTACC TGAGGGTGCA TTCGGTATGA TGTTGATCCA GCTTTTCGTC GCCATGGTGG AAGATTGCGT GAACGAGTCC GTGTATTACC CTGCGCATCT TGCTGGAATG GAAGTCGACA TCGGCGCGTC GGCTTCTTAT TCTGGTTTCG TGCTCTCACT CGAAGGTTTG AGCGACAAGC TCGGCGAGGT CGCGTTGTCG TATTTCAAAA CGATGACTTC GTTAAAGATC GACGCTGATC GATTCGAAAA GCGCAAAGAA GAAAGATTGC GAGACGTCCA TAATCTGTGC TTGAATCCCG CTAGGCACGC AAAGCGCGCG CTCGAGGTAT TGCTCAAGCA AAAGGACGCG ACGCAAGAGG ACAAAGCAAA TGCACTCCAG GAGATGACCG CAGCGGATTT GCAAGCGTTT GCGGATGGAA TTTGGCAGCA TGCGCACGTC GAAAGTCTGA TGATTGGCAA CTTGACAAAG GATGAAGCCT GCGACGTCGG CGAACGCATT CGCGCATGCC TTCCAGGCGC GCCGATTCCT GATAACAGCT GGCCGGAAAC TCGCATCGCG CGCGTACCTC AGGGTGCGCA CTTGTTTAGC ATCAAGGCAA TCAACGCTGA CGAGACGAAT AACGTGGTGC TGTACTATTT TCAGCTCGGT GAGAGCACGT GGCGAGGTCG GGCGTTCATC ATTTTGATGC AATCGCTCAT GCACGAAAAA CTCTTCGACC AGCTCAGAAC GAAGGAGACT CTCGGCTATA GCGTGAGCTG CTCGTTTGAC TCGACGCATG AAATCTTGGG TTATCGAGTT TCCGTCGAGT CCGCGTTCCA CCCGCCGCAT TTCGTTTCGA GTCGCATGGC AGCCTTCTTG CGATCATTTC CCGAGATCCT TGACAACATG GATGACGCTT CTTATGAAAA GACTCGCCAA AGCGTCGTCG ATGATATATT GGCAGACGAT GTCAACTTAC GCGAAGAAGC CATACGACAC TGGGCGCATC TCGTGAATCA AAAGTACCAG TTTCATCGCG GTCGTCATGT GGCGCAAATA ATCTCCGAAA TCTCAAAGCG AGAGGCGGCA GATTGGTGCC GAGAGTTCAT TCAGCCATTC GCACCTGGAA GTAGACACGT GAGCGTGCAC ATT
|
Protein sequence | MDHDEDTDGE EEEGDDDSDD GGGSSGGGMK LAACSIAFDV GYFADTDECD GLSHFLEHMV FMGSEKYPGE NFFGEWLNEH WGSDNASTDS ENTIFYFECN PKNLREALEI FSGFFVNPLV KLDSVDREVT AVESEFERVV NNDTVRAELL LSSLAAKGHP YTKFGWGNRA SLTQSPPYKE GRMRDVLLEH WRRHYHAKRM SIALVGAEDL DELESWIVEI FGDMRDDGDE VIDLNIAHSS PYANAVPIRV LTAQVKDGQH VSITHELPAW TQKNYKHKSA TYMETLIGHE GHGSLFAELK RRGWASDLRS GVGAGGIDSS TAGALFGTTI KLTDDGLTHV DDVIGLFFAY VNMLRAKGPQ EWFWNEIKQL ADIDFRFREP EDASEYSERL VADIRKYAPE DILRGADLFE TYKPEEIREI IDLMTPQKAI IVVQNHAWNG EGENVEHERW INFPYKKEAL DSALLETWAN ADAGERLHYP SPNPYIASDF RLRSPASEHK DALFSPTIVH DCKVSRIWHR LDDRFNQPRS CMYFQVSLPH VPEGAFGMML IQLFVAMVED CVNESVYYPA HLAGMEVDIG ASASYSGFVL SLEGLSDKLG EVALSYFKTM TSLKIDADRF EKRKEERLRD VHNLCLNPAR HAKRALEVLL KQKDATQEDK ANALQEMTAA DLQAFADGIW QHAHVESLMI GNLTKDEACD VGERIRACLP GAPIPDNSWP ETRIARVPQG AHLFSIKAIN ADETNNVVLY YFQLGESTWR GRAFIILMQS LMHEKLFDQL RTKETLGYSV SCSFDSTHEI LGYRVSVESA FHPPHFVSSR MAAFLRSFPE ILDNMDDASY EKTRQSVVDD ILADDVNLRE EAIRHWAHLV NQKYQFHRGR HVAQIISEIS KREAADWCRE FIQPFAPGSR HVSVHI
|
| |