Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18137 |
Symbol | |
ID | 5005268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | - |
Start bp | 466696 |
End bp | 467691 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | |
GC content | 52% |
IMG OID | 640420689 |
Product | predicted protein |
Protein accession | XP_001421476 |
Protein GI | 145354405 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.00371385 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATT CAGTCGCGCA CGCCTCACCG ACATGGAGAA CGAACTCATT AGACCCGTCG CTGCAACCGG ACTGGATGAA CGGCACGGCG GAGTCAGAGG CGGTCAAAAC AGCGGAGGAT GATCTCGAGC GCTCGTACAA GAACAAGCCA TTCTTCAAGG ACCCATGCTA CGAGAAATTC GCCAGATCAC GAGACGAGTA TGACAAACCG ACGCCCGACG GCCTCGAGGA GTTCAAGCGC CTCTTACGGG CGAGTTATCT TCGCACCGTA GAGCGCATCG ATAAAGATAT CCGAGTTCTG GAAGCGCAGC TACAAGACGA GAATATCTCC GAAACGCGAC GGTCGGTTAT CCCGCAGCAC ATCGAGGACA CTCGTGCGAA ATTAAAAGAG AAAGACAATT CATTGAACGC GATGTGCGCC GATGTGGATG CGGTTATCGC AGATTTGAAA GGCTTGCCCG ATAAAATTGC CGACATAAAA GAACACGCCT CTGAACAACT CCAGCAACAC AAAAGAAAAA GGACTAGTGT ATCTTTGGAA CGCAACCTCG AGGAGTTGAG CGATGCGTTG CGCAGCTACG AGGCGAAAAT GGAACAACAG AGGCTCACAC TGTGGAATTA TCGTGGGAAT TATGGGTCAT GTCCACCCCA GTATGTTGGC CACCGAAGTG GCTCAAGTAG CTTCATTGTT CATCCTGTGG GTGTCATTGG CTGCGATGGC AGTAACCTCA GCGTTGATGG ATACGTTAAC TATGATGGTT ATGGTCAGTG TAACGTACCA AAGTGGCAGC CTGAATGGGG ACCTGAGAAG CAATTTTGTG CGTGCAAAAG CTTCCAAGAT TGGTACAACA AAGTTGGCGC CCAAGACCTT CTTTACAGCG CGCTTCGTGG CGTAGAGGGC GCGCTAAAGG CGCTCAGTGG TTCGGGAGAT GACACGTCTT CGCTTCGACG GCGCATGGGC GATGCCAATT TGGACGAGGC AAGCACGGGG AGTTAA
|
Protein sequence | MTDSVAHASP TWRTNSLDPS LQPDWMNGTA ESEAVKTAED DLERSYKNKP FFKDPCYEKF ARSRDEYDKP TPDGLEEFKR LLRASYLRTV ERIDKDIRVL EAQLQDENIS ETRRSVIPQH IEDTRAKLKE KDNSLNAMCA DVDAVIADLK GLPDKIADIK EHASEQLQQH KRKRTSVSLE RNLEELSDAL RSYEAKMEQQ RLTLWNYRGN YGSCPPQYVG HRSGSSSFIV HPVGVIGCDG SNLSVDGYVN YDGYGQCNVP KWQPEWGPEK QFCACKSFQD WYNKVGAQDL LYSALRGVEG ALKALSGSGD DTSSLRRRMG DANLDEASTG S
|
| |