Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28580 |
Symbol | |
ID | 5006496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009373 |
Strand | + |
Start bp | 107245 |
End bp | 108495 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | |
GC content | 65% |
IMG OID | 640421917 |
Product | predicted protein |
Protein accession | XP_001422393 |
Protein GI | 145356345 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 0.270066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00000000135874 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGAGA GGGAAAAACG GTTGGCGGGT CGCGTCGCGT GGCCGAGGGC GAACGGGACG CGCGCGAAGG CGAAGGCGAG GACGCGCGCG AGCGCGACGA AGGGCGAGGG TTCGAGCGGC GCGGCGCGAG CGAACGGCGA ACGCGCGGCG AAGACGGACG CGGTGAAGGA TGCGGAGGAA GACGTTTCGA ATAATCGACC CGGTGAACGC GGGTTTACGG ATGAAGAGAT CGCCGCGTCG CGCGCGGCGC TCGAGGAGGA ATTGCGGCGC GGCGGCGCGG ACGAAGACTC GTTGTTGGGG TTGCGCACCA TCGGCGCCGC GGGGGCGCGC GAGGGGTGGG ACGACGAGGA GGTGGCCGCG TTCGCGGAGC ACGCGACGCG ATACGGCGAC GATCTCTTTC GCTTGCGCGC CAAGCTGCCG AAGAAATCGA TGCGCGACGT CGTGAACTAT TACTACAACG TGTGGCAGGT CGGGTTCGCG AATTACGGTC GCGTCGACGT CGAGGGCGCG GAGGAACCGG CGCCGCGCGA GCGAGCGCGA CGCGGACCGG CGCCCAAGTA CACCGTCGAA CAGGTGCGAC GGGAGAAGGA TGAAAAGTCG CTGCGCGGAT TCGTGGATTG GATTCGAGGC GTCGCGATCA ACACCAAGCG CGCGATGCTA AACGTTCACC GAGCGCCGAC GACGGCGCGC GTAAAGGGCC ACATGATGAC GCGCTGGCGC ACCGTGACGC GGAGCGAGGA CGCGGACGAC GGCGTCGCGA GAGAGGCGTA CTTGAAGGAT TTGAAGCGAC GCATGACGGC GGCGAGGTTC ACGAAGGAGG AACAAGAGGC GGCGGCGAGA ATGAAGACGA AAGCGAAATC GTCCAAAGCG TCGTCCGCAA AGGTGACGAA GACCGCGTCG AGTGGTGACG CGGCGACGAA GACGAAGAAG ACGGCGTCCG CGCCCGCGGA CGGCGCGCCG ACGCCGAAGA AACGAAAACG CCGAATCGAC GATGGTCAGC CGAAGATTTG TCGAAATTGC CGAGCGATGG AAACGAAACA GTGGCGCTTA CCCGTCGAGG GCGCGGGCGT GCTTTGCAAC GCGTGCGGGT TACGCGATAG AAAACAAGCG AAGAAGAACG AAGCCAGCGC CGCGGGCGAG ACGGAGCCTA CGCCAAAGGA AAATAAGACG CCCGATCGGG GGAAGGATGG TTTGAAGAAG AAACGCTCTC CGGGATTGAA ACCGACGCCC GACCGCAACT TTCAGCTTTA G
|
Protein sequence | MSEREKRLAG RVAWPRANGT RAKAKARTRA SATKGEGSSG AARANGERAA KTDAVKDAEE DVSNNRPGER GFTDEEIAAS RAALEEELRR GGADEDSLLG LRTIGAAGAR EGWDDEEVAA FAEHATRYGD DLFRLRAKLP KKSMRDVVNY YYNVWQVGFA NYGRVDVEGA EEPAPRERAR RGPAPKYTVE QVRREKDEKS LRGFVDWIRG VAINTKRAML NVHRAPTTAR VKGHMMTRWR TVTRSEDADD GVAREAYLKD LKRRMTAARF TKEEQEAAAR MKTKAKSSKA SSAKVTKTAS SGDAATKTKK TASAPADGAP TPKKRKRRID DGQPKICRNC RAMETKQWRL PVEGAGVLCN ACGLRDRKQA KKNEASAAGE TEPTPKENKT PDRGKDGLKK KRSPGLKPTP DRNFQL
|
| |