Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14671 |
Symbol | |
ID | 5000584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 571347 |
End bp | 572870 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416005 |
Product | predicted protein |
Protein accession | XP_001416985 |
Protein GI | 145344946 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.000514062 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGTC GTCGCCTCGC GCGCGCGTGC GCGCTGGCGG TCGCATGCGC GCTGATGGTT CGCTCGTCGC GCGCGGACGC GCCTCGAGCG CACGCGTTCG CCATAGGAGG CGACGCGAAC GACACCGCGC TGGCGCTGCC GAAAACCGTC GCGCGCGTGC GATTCGTCGA GATCGAGATG AAGCTCATGA GCGCCAGCGC AGATTACGTC CTCGGTGTGT TTCCCGAAGA CACGTCGTCG TATTGCGAGG GCTCGAGCGT GAAAGCGTCG TGGCGACGGT TCGCGCGGGC GATCGACGGC GCGGGACGAC TGCGAGACGT CGTCGTGGCG GAGGCGACGC ACGAAGGGAC GGATGTGTCG AGAATCGCGG AGGTGAAGGC GCTCTCGAAA CCAACGGATC GACACGGGAT GCGCGCGAGA GCGAGGAAGG CGGGGTGCGG GTTCATCGCG CTGGCGATAA GTAACGAGAT GCACATGTTC GAAGGGGACG CGACCGACGG CGCCGCGTTG GCGAATTGGC TGGTGAAGTA CGTTCCAGAG AATCGATTCG AGGAGGTGGA GGAGGTGCGC TCGAGACACG AAGTGTTAGA GTTTGTCGAA CGTAATGAGA CGGTAGTGCG GGCGATTTTA TTGGAGGAGG TGCCGTGGTT GAAAGTGTTG GCAAAATCGT TCGACGGGAA GGTGGCGTAC GCCAGGGCGT CGTCGCTCGT CGGATCGAGG TACATCATGG GATTGCCATT TCACGAAGCA GGGGTCGCCG GGTTGTTATT CGTGCCGAAG ATGGAAAGCA TAGAAGACGA TGATCATGCG AAACGAGTAG AGGTGAGATT GATTCGTCGA CAAAACGAAA CGCTTAACTA TTACGAAATC GCCGAGACTT TGATTGAGGC CGAGCGCGAA CTCGGATACG CAGCGATGAT GGAATCGACG ATCGATGTCG TATGCGCAAA CGCCGTTTGG GAGCTCGCTG CGAACTCAAT TCATCCCGAG TATGCGCAAG CGGTGGAAGG TCAAGAAGAG CAAGCTTACG AGTTGATGCC GCCGCCAAGC GCGTGGGAAT TCATCAAAGC CGATTTGCTC GATTTGCTCG ACGGTCACGG TAAGGGTGAC AAAACTCTCG TCCCTGAGCA ATGGTTGTGC GGTCTCGCGG GACTCACCGC GGCTCATCGT GATATGGCGA ACGAATACAC CGAAGCCGCT GGCTCATTAG AGGAGCTCGA AACGCTTCAA AAGGAAAACC TCGCGTTGAG ACAACGCAAC GCGATGCTTG AGCGCGAAGT CGCGCAGTGT GACTCGTCAT CTCCAAAGGC ACCGCGAGGA TCGGCGCGTT TGCCGAAATC AAAACCCGCG GGTTGGACGT TCGATGCCAA ACAAAAGAAA CCGCCGCAGC CTCCGAAACC AACACCACCA ATTCCCGAAG ACAACGTTAT GGAAGACGCC AACGAAGATG CTGACAAAGA AGATGAAGAC CTTTTGGGCG AAGAAGGCAC GAGTGCGAAT CCGAGCGTTC GAGACGAACT TTAG
|
Protein sequence | MRRRRLARAC ALAVACALMV RSSRADAPRA HAFAIGGDAN DTALALPKTV ARVRFVEIEM KLMSASADYV LGVFPEDTSS YCEGSSVKAS WRRFARAIDG AGRLRDVVVA EATHEGTDVS RIAEVKALSK PTDRHGMRAR ARKAGCGFIA LAISNEMHMF EGDATDGAAL ANWLVKYVPE NRFEEVEEVR SRHEVLEFVE RNETVVRAIL LEEVPWLKVL AKSFDGKVAY ARASSLVGSR YIMGLPFHEA GVAGLLFVPK MESIEDDDHA KRVEVRLIRR QNETLNYYEI AETLIEAERE LGYAAMMEST IDVVCANAVW ELAANSIHPE YAQAVEGQEE QAYELMPPPS AWEFIKADLL DLLDGHGKGD KTLVPEQWLC GLAGLTAAHR DMANEYTEAA GSLEELETLQ KENLALRQRN AMLEREVAQC DSSSPKAPRG SARLPKSKPA GWTFDAKQKK PPQPPKPTPP IPEDNVMEDA NEDADKEDED LLGEEGTSAN PSVRDEL
|
| |