Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_51888 |
Symbol | |
ID | 5006425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009373 |
Strand | - |
Start bp | 22453 |
End bp | 23799 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | |
GC content | 65% |
IMG OID | 640421846 |
Product | predicted protein |
Protein accession | XP_001422415 |
Protein GI | 145356391 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 0.0830441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGA CGCACGCGCA CGACGCGCCG ACGGAGGCGC GCGCGACGGC GCCGCCGACG ACGTCGGTGA CGGTGCTCGC GAACGGGGCG ACGATCGCGA GCGAGAACAC GCCGGGAGCG ACGCTGGCGT GCGGGGCGTA CGTGGACTGC GGGAGCGCGC GCGAGGACGC GCCGTGGAAG CGCGGATTCT CGCACGCGCT GGAGCGCGCG GCGTTCAGGG CGACGAAACA TCGAAGTGGG TTCAGGGTGA CGCGAGAGTG CGAGACGATC GGGGCGAATC TGAGCGCGAG CGCGAGCAGG GAACAGTTTT GCTTCGCGGC GGATGCGCTG AAGACGCGCG CGGCGGAGAC GGTGGAATTG TTGCTCGATT GCGCGCTGAA TCCGGCGTTG GAGAATCACG AGATCGAACG AGTGGTGGAG AATCTGAAGA CCGAGGTGAA GGAGTTGAAC GAGAACCCGC AGGCGTTGTT GATGGAGGCG ACGCACGCGA CGGCGTACGC GGGGGGCTTG GGGCACGCCC TCGTGGCGCC GAGCGGGGAT CTGAGTCACA TCACGGGCGA CGCTCTGAGA GAGTTCGTGC GAGAGAACTT CACCGCTCCG CGCGTCGTGC TCGCGGCGAG CGGGTGCGAA CACGACGAGC TCGTGCGAAT CGCGGAGCCG ATGTTGGCGA CGCTTCCGAG CGGCGAGGGT TCGCCCGAGA CGCCGACGAC GTACGTGGGG GGTGATTTTA GACAAAAGAG CGATTCCCCG ATCACGTCCA TCGTGCTCGG GTTTGAGTTC AAGGGTGGCT GGCGCGACAC CAAGGCCTCG ACCGCGATGA CGGTGCTGAC GATGTTGCTC GGCGGCGGCG GGTCGTTTAG CGCCGGGGGG CCGGGGAAAG GCATGTACTC GCGCCTTTAC ACTCGCGTGT TGAACAGATA TTCTTGGGCG CAAAACTGCA CGGCGTTCCA CAGCATCTTC AACGACACCG GGATCGTCGG GATCTCCGCC ATGGCGAACA GCGCGCACAC CGGTGACATG GTGAAGGTGA TGGCGGGCGA GCTTCAAGCC GTCGCCGCGA GCGGGGGCGT GAGCCCGCAA GAGCTCGAAC GCGCCAAGAA CGCCACGGTG AGCTCGATCT TGATGAACTT GGAGTCCAAG GCTGTCGTCG CGGAAGACAT CGGGCGACAA ATGCTGACTT ACAAGTACCG CAAGAGTGCG GCGGACTTCA TCGCCGAAGT GCGCGCGGTG AGCGCGCAAG ACGTGCAAAA AGTCGCGAGC GACTTGCTCG CGAGCGCGCC CACGGTGGCC ATGACCGGCG AGCTCCACGC CGCGCCGCGT TACGAAGACA TTAAGGCGAT GTTTTAA
|
Protein sequence | MSETHAHDAP TEARATAPPT TSVTVLANGA TIASENTPGA TLACGAYVDC GSAREDAPWK RGFSHALERA AFRATKHRSG FRVTRECETI GANLSASASR EQFCFAADAL KTRAAETVEL LLDCALNPAL ENHEIERVVE NLKTEVKELN ENPQALLMEA THATAYAGGL GHALVAPSGD LSHITGDALR EFVRENFTAP RVVLAASGCE HDELVRIAEP MLATLPSGEG SPETPTTYVG GDFRQKSDSP ITSIVLGFEF KGGWRDTKAS TAMTVLTMLL GGGGSFSAGG PGKGMYSRLY TRVLNRYSWA QNCTAFHSIF NDTGIVGISA MANSAHTGDM VKVMAGELQA VAASGGVSPQ ELERAKNATV SSILMNLESK AVVAEDIGRQ MLTYKYRKSA ADFIAEVRAV SAQDVQKVAS DLLASAPTVA MTGELHAAPR YEDIKAMF
|
| |