Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17956 |
Symbol | |
ID | 5005448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 2749 |
End bp | 4392 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | |
GC content | 53% |
IMG OID | 640420869 |
Product | predicted protein |
Protein accession | XP_001421176 |
Protein GI | 145353772 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 83 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACAAC AAGAGATCCT TGCATTGGAA CGTGCGACCG CGAAGAAACT GCGTAAACAC AGGCGGAGGA CGAAAGTCAC AAGCAAAAGC AAAGCACGAA TCTTATCCTC CGAAACGTCA AGAATGGAGC TTCCTGAATC CACGGTTTTG ACGTTGACGT CGGCCAAAGA GTCAGCGAAG AGGATCGTAT CAATCAGGGA CGATATCGAG GTGGTGATTC ACGACGTCCA GACCAGCGGC GGCGTGTACA AATCATCGCA TAAAAAAACG TTCGTGCAAA TAGCGTGGAA GGAAAAACGA AATGCAACTC GCAAGGTCAA GACACGGAGA ATAGAGGGTA CAGAGAATCC ATCTTGGGAC GCGCAAGGGA AGTTCATCTT TCAGAATTGT CGGGCGAATG CGGCTGGCAG CAAATTCGTC ATCAAACTAA AGGAAGTGCG CTGGTTTAAA AGAGAAACAG TCGTTGGTTA CGCAAAGATT CCGAGTACTT TGGTTCCATT GGATGGGACG AAGTTGAAGC TTCGCATCGC GCTCGTGACG AAAAGGCACA GAGCCAAGGC TGTGCTGAGC ATGACAATCG GAAAAGCGAC GTTTGAGGAG CCAGCGCTGG GGGAGGGTGG ACACTTTCCC GTCGTCACCG GCGCCGAACA AGAACTCGTC TCATTCGAAG GAAACTTTCC CTCCAGGCGC GAAAAATTAG ACTTGATGGT TGTTAACGTG ATGAGCGCTC GCGGCGTATT CGACGCCGAT GGGTTCGGTA CGAGCGACGT GTTCATCAGA CTCGGTTTCG ACACTACGCC CATCGAAGAA CGCTACCAAA CTACGATTAA GTACCGCACT CGCAATCCTG AGTGGAACGA GCGTTTCTTG TTGCGTGTAC CAAGCGTCGA TGCAGCTAGG GGCGAACCCA AAGCAATCGT GTTCACGGTG TGGGACAAGG ACAGATTCTC GCCGAGCGAC TTCCTTGGGG CCGCCGCTAT TCCACTCGAC CGTGTTTCGA CTACTGGCAG CGTCGCGGAT TTGGACATGG ATTTGAAGGC TAGAATTGCG CCGGACCTCG CGGGTGAGGT GTGCTTCATC CATCCACGCG CACCCGCAGA TTTGGGTAAA CTTCGAGTCA AGGTTTCGGC GCTCATCAGT GACGCCGCCG AAACAATCGC AAAGACAGTC AATTTAGGAA GGATAGATCC CACTGAAGGC ACAAGTCTGG CTACGAGCGT ACACGTCGCC GTCATCGCGG CGAGAAAGTT GTTACATGTT GACACCAAAG GGTCCTGCGA TGCTTTCGCG TACGTGCGAA TGGACAATGC GCCCAAGAAT GAATTCTGCA GGACAGATAC AATCGCAAAC ACGCTCCATC CCGTGTGGAA CAACGGCATG GGAAAGACCT GTTCTCTCAT CGCTCGTCCG GGCTCTGGAG ATGTCTTGTT TCAATTGTAC GATCGCAACC TGCTTAGCAA AACGCTCATG GGAACTGCGT CGGTGTCGTT GGCGTCTCTG CCTCCAGATG GTTCATGGAC ACAAATCGCA ACTCCAGTTT ATGGCCAGGA CAAGAATCGA AACACTCTCG TCGGCGGTTC GGACAGTTCC ATGGCATGGA ACGCTCCAGA GAGAGTGAAG GGTGAAACTC ATCGTTCGCC TTAG
|
Protein sequence | MRQQEILALE RATAKKLRKH RRRTKVTSKS KARILSSETS RMELPESTVL TLTSAKESAK RIVSIRDDIE VVIHDVQTSG GVYKSSHKKT FVQIAWKEKR NATRKVKTRR IEGTENPSWD AQGKFIFQNC RANAAGSKFV IKLKEVRWFK RETVVGYAKI PSTLVPLDGT KLKLRIALVT KRHRAKAVLS MTIGKATFEE PALGEGGHFP VVTGAEQELV SFEGNFPSRR EKLDLMVVNV MSARGVFDAD GFGTSDVFIR LGFDTTPIEE RYQTTIKYRT RNPEWNERFL LRVPSVDAAR GEPKAIVFTV WDKDRFSPSD FLGAAAIPLD RVSTTGSVAD LDMDLKARIA PDLAGEVCFI HPRAPADLGK LRVKVSALIS DAAETIAKTV NLGRIDPTEG TSLATSVHVA VIAARKLLHV DTKGSCDAFA YVRMDNAPKN EFCRTDTIAN TLHPVWNNGM GKTCSLIARP GSGDVLFQLY DRNLLSKTLM GTASVSLASL PPDGSWTQIA TPVYGQDKNR NTLVGGSDSS MAWNAPERVK GETHRSP
|
| |