Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_49610 |
Symbol | |
ID | 5002031 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 234469 |
End bp | 236301 |
Gene Length | 1833 bp |
Protein Length | 576 aa |
Translation table | |
GC content | 56% |
IMG OID | 640417452 |
Product | predicted protein |
Protein accession | XP_001417715 |
Protein GI | 145346481 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00404596 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000402343 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | CGCCCCTCGC CCTTCCGCGT CGCGCGCGGA CGAACGCGAA CGCGTTCGCG CGATATGTCC GCGCTCGATG TCTGCGTCGC CGCGTGCGGC GCCGCCGCGG ACGCGACGTG CGCGACGGCT TGCTTCACCG CGTCCGGCTG GGAGATTCCC GCCGAACCGT TCCTCCCGAG GACGCCGAGC CTGATCCTGG GGTGCTGCCT CATCCTGCTC TCCGCCCTGT TCAGCGGCCT CACGCTCGGG CTGATGTCGC TCGATCCCGT GGGGCTGGAG ATCATCGCCG AGGGCGGCGA CGCCGAGGAG CGCGAGTACG CGAAGCAAAT CATTCCGGTG AGGAAAAACG GAAACCTGTT GCTGTGCACG CTGCTGCTCG GAAACACGGC GGTGAACTCC ATGATATCGA TTTTGATGGC GAGCGTGACG AATGGGATCA TGGGCTTGTT GGTGTCGACG CTGAGCATCG TGATTTTGGG GGAGATTACG CCGCAGGCGC TGTGCTCGCG GCACGGGTTG TACATCGGGG CGAAGACGAT TTGGATCATG AAGTTTTTCA TAATGTTACT GTTCGTCGTC GCGTGGCCGA TATCGCTCGT GCTCGATCGC ATACTCGGGG TCGACATAGG GACCTTTCAC ACGACGGAGG AGTTGAAGCA CTTGGTGCGC GTGCACGTGG AGAAGCCGCA AGGCCAGGAG GAATCGGGGT TGAATCAACA AGACGCCACG ATGCTCACGG GGGTTTTGGA GTACAAGCAC ATGACGGTGG CGGACGTGAT GACGGATCTA GACAAGGTTT ACATGATTGA ACTGAACACG AAAATGTCTT TCGCCGTGTT GATGGATATT TACAAGAGCG GGTTCACACG CATTCCCGTG TACGAGGGCA CTCGCTCAAA CATCGTGGGG ATTTTGTTCA CGAAAGATTT GATTCTCATC GACCCAGACG ATGAAATCGA ATTGTCCGCA ATCTTAGCGT TCCACGGCGG TAAGAATGGT GGGTACATTC GCTATGTTAG CGATAACACG ACTTTGGACA AGGTGTTCCT CGAGTTCAAG ACGGCTCGCA TGCACTTGCT ATGCGCGCAC TCCGAAGACG GGCCGCCGCG CAAGGATGGA TCAAACGCTC AAGTCACGGG TATAATCACG CTCGAAGATG TGCTCGAAGC GCTCATCAAG GACGAAATTA TCGACGAGAC GGACAACTTG ATTGACGTAA ACGAGCCAAC GTCAATCGTG GAAAGGCGAG TGACGTTTCG CGGCGCCGAT CCGACCAAGT TTATGAGCGT CTTCGAACAC AAGATGAACG AAGAAGAGAA ACTCGGCGAG AATGAAGTGA GTGCGATCGT CGCGTTCTTA TCGTCGAACG TGGCGGAGTT TAAAACTCTC GGCGAATACC ACAAAGTGCT GCGCAAACTC ATCGAAACAT CAAATGTCGT AGAAAACGAT GACACGAGCA GTAGCGATAG CGAAAATAGT ACGATGGGGA CACCGGGCGT GCACAGGGGA CGCGAATACG ACGAAGACCT CTTGTACAGA GCTGGAGAGC CATCAGACGT TTTTACGCTC GTCCTTCAAG GTCAAGTCAA AATCTTCGCC GGCTCCGAAG ACTTTGAGTC TGAGCTCGGT CCTTGGTCGT ACATAGGACA AAATGCGCTC ATCACAGACC CGTACGTTCC TGATTTCCGC GCGTACAGTT GCGGTGGAAC GAGGGTGTTG AAGATTGCTC GTGCGGACTA TAAAGCCGCG CTGGCGAGCG CGGCGGTGAA AGCCATGGGC GCGGGCGCGA AGAAAAGAGT GCAGCTCGTG GGATCGAAAA GCTTTTCCGA GACCGAGCGC TGA
|
Protein sequence | MSALDVCVAA CGAAADATCA TACFTASGWE IPAEPFLPRT PSLILGCCLI LLSALFSGLT LGLMSLDPVG LEIIAEGGDA EEREYAKQII PVRKNGNLLL CTLLLGNTAV NSMISILMAS VTNGIMGLLV STLSIVILGE ITPQALCSRH GLYIGAKTIW IMKFFIMLLF VVAWPISLVL DRILGVDIGT FHTTEELKHL VRVHVEKPQG QEESGLNQQD ATMLTGVLEY KHMTVADVMT DLDKVYMIEL NTKMSFAVLM DIYKSGFTRI PVYEGTRSNI VGILFTKDLI LIDPDDEIEL SAILAFHGGK NGGYIRYVSD NTTLDKVFLE FKTARMHLLC AHSEDGPPRK DGSNAQVTGI ITLEDVLEAL IKDEIIDETD NLIDVNEPTS IVERRVTFRG ADPTKFMSVF EHKMNEEEKL GENEVSAIVA FLSSNVAEFK TLGEYHKVLR KLIETSNVVE NDDTSSSDSE NNLLYRAGEP SDVFTLVLQG QVKIFAGSED FESELGPWSY IGQNALITDP YVPDFRAYSC GGTRVLKIAR ADYKAALASA AVKAMGAGAK KRVQLVGSKS FSETER
|
| |