Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_93862 |
Symbol | |
ID | 5005901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | - |
Start bp | 197017 |
End bp | 198738 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421322 |
Product | predicted protein |
Protein accession | XP_001422002 |
Protein GI | 145355506 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 56 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.213448 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACC GCGAGCTCCA TCTCGCGCTC GAAAGAGTCG GCCCGCGGTT TACGCTCGGT GACTTACTCC TCTCCGAAGG CCCCGCGCTG GATGATTCCG CGGACGCTCT GCGCCAACGC GTCATCTTTG CCGTCCATCT GAGCGGTGGT CATCTCGATG TCGACGAAGA AACCCGAGAA GTCATCTACG TAGTCCCCGT GCGCGTGCGC GCGGCCATCT TAGCCCGTGA TGTGAGCGAA CGCGCGCGCC GAGCGCGTCG GGCGGCGTGG CGAACGACGA TGCGCGCGGT GAGGGCCGCG TTTGGGTCTT TCCTCGTCGT CTCGGCGGCG TTGACGGCGC TCGCAGTCAT CGCGTTGGTC GTCATCGCGC TTTCTCGAGG CCACCAGGGA CGAGGAGGAG GTGGAGGGTC GACGCCGGTG TTACCCACGT ACGTCGGGCA CGGACGAGGG GTGAACGCGG ATTTCTGGTA CTATCTGTGG ATGCGCGATT TGATTGAACT AGCGTATTGG AACGACGTTA TGCGATTCGA ACGAGCGCGC GCGTTTGATC GCGCGCATGG CGTCGCGGAG GGCGTTCCAG TGAGTAAACC GCGCGTTGGC GGCAGCGGAC ACGGTGGCGG TGGCGGTGGC GGTGACGATG GAGGAAGTGG GGATCCCGCA GGGCGAGCGA ACGCGCCGCC ACCGACAAAC GTCGGCCCGC GAAGAGGCGG CGACGGCGTT GATGGCGATG AAGAAGAGGA AGAAGACGAT TGGCTCGACC GCGATCGTGA GTTGTCATTT TTTGAAAGCA TATTCGCGTT TGTTTTCGGA AGAGGTGATC CGAACGATAA TTTGGAAACC AGGCGATGGC GCGCTGTCGG GGCGCTTTTG CGAGTGAACA AAGGCTGCGT GTTCGCCGAG CAAGTGGCGC CGTTTTTAGA CACGTATCTC CTCACGAAAG AGGATCATAG CGAAGTTCGA AACGGGTTAT TTGCCGTGGT TTTCGACCTT GTCGCGCACG CGCGACGACT TTTCAGAAGG AAGGCGGATG CCGAGCGAGA CGTTCGAAGG ATGCACGAGG GCTACATGCT CGAGGTCTTG ACGCGTTTCG GTGGATTTGC CGAAGCTTCG GATGCTGGAG AGCTCATATA CGTGTTCCCA TCTTTGCAAG TGACCGCTCG CGCGGTCGAA CCCTCTTCGT CGCGACTGAT GCCGTCGCGC AGCGTTCAGG CTCCGACGCC GCCGCCAATT TACGAGCGAG TCCGACCTCT GTGGGAGAGC GGTGCGAAGA TGCCGCTTGT TGTCGCCCTG GGATTTTTGA ACGTAGCACT TATTTTCATC TTCCGTGCCG CCGGTGGTAT GGACTTCAAG CCTCCACGCC AAAGTCAACT TCCACGAAGA GCCGAACAAA CGATGGGACG TCGTGCGGGA AGATTCCGTG ACGCCGCGCC GACGACGAGC ACCGTCCCGA TCGATGACTA CGGCGAACCA CCGTTGGTGA TTCTTATCCT CGAGCTGTTC CCCAAACTGC TCAAACTCCT CATGCCGCTC TTGCTCGTGT ACGCGGGCAT TTTCTTCCTC GTGCCGACGT CACGAGCGCT GTACATCGCC GTCGAGAATC GCCAAATCAA GCGACGAAAC GACGTGAGAA AGAGGCGTGC GCAGGAAATA TTATCAACTA GCGTCCAAAT GATCGACAAG CAGTCGCGCG CAAGGGGCAA GCAAGCTTTA GAAGTGGTGT AA
|
Protein sequence | MSDRELHLAL ERVGPRFTLG DLLLSEGPAL DDSADALRQR VIFAVHLSGG HLDVDEETRE VIYVVPVRVR AAILARDVSE RARRARRAAW RTTMRAVRAA FGSFLVVSAA LTALAVIALV VIALSRGHQG RGGGGGSTPV LPTYVGHGRG VNADFWYYLW MRDLIELAYW NDVMRFERAR AFDRAHGVAE GVPVSKPRVG GSGHGGGGGG GDDGGSGDPA GRANAPPPTN VGPRRGGDGV DGDEEEEEDD WLDRDRELSF FESIFAFVFG RGDPNDNLET RRWRAVGALL RVNKGCVFAE QVAPFLDTYL LTKEDHSEVR NGLFAVVFDL VAHARRLFRR KADAERDVRR MHEGYMLEVL TRFGGFAEAS DAGELIYVFP SLQVTARAVE PSSSRLMPSR SVQAPTPPPI YERVRPLWES GAKMPLVVAL GFLNVALIFI FRAAGGMDFK PPRQSQLPRR AEQTMGRRAG RFRDAAPTTS TVPIDDYGEP PLVILILELF PKLLKLLMPL LLVYAGIFFL VPTSRALYIA VENRQIKRRN DVRKRRAQEI LSTSVQMIDK QSRARGKQAL EVV
|
| |