Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43047 |
Symbol | |
ID | 5005451 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 455738 |
End bp | 457363 |
Gene Length | 1626 bp |
Protein Length | 399 aa |
Translation table | |
GC content | 49% |
IMG OID | 640420872 |
Product | predicted protein |
Protein accession | XP_001421294 |
Protein GI | 145354020 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02167] bacterial surface protein 26-residue repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0199168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.21868 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATATAG CGCAGTGGGA TACGAGCTTG ATCACGGATA TGAATCACTT GTTCTACAAC AAAATTTCAT TCGACGCCAA TATCAGCACG TGGAACACGG CGAAAGTGAC AAAAATGGAT AGCATGTTTT CTCAAGCACG TGCATTTAAC CAGAGGATTA ATGAATGGAA TACATCGAGC GTGGTGACAA TGGAAGCTAT GTTCCGCTAC GCCGAGTCCT TTGACCAGCC CATTGGTGAG TGGAATACCG GAAGAGTCAC CAATATGAAA GACATGTTTT CGCAGGCGTC TGAATTCAAT CAGAATATCG GTAATTGGGA TACATCGAGT GTGACGACGA TGGATACTAT GTTTCGCTAC ACCGAGTCCT TTAACCAGCC CATTAGTGAG TGGAATACCG GAAGAGTCAC CAATATGAAA GACATGTTTT CGCATGCGTA CGCATTCAAT CATTCAATCG GTGACTGGAA CACGGGTGCC GTGGAGAATA TGGAGCTTAT GTTCCAAAAT ACCGGCGTTT TCAACCAGGA CATCAGCAGT TGGAACACGG CGGCGGTGAC TCAGATGGAC TCAATGTTTT ACGGTGCTTC AGCCTTCAAT TATGACATCA CTGGTTGGAG TTCAGCAAGC CTTACGTCGT CGACGGACAT GTTTCTCTCC GCGACGGCTT GGCTGGCGAG GTACAAGAAC ACCGCCTCCC CAGGTAGCAA AAATGGGCCA TCGTCGAACT GGCTCGGTCC CGGTCCGTTC ACGACCAAGT CGGCGTTGGA GACCGCTACA TGGAACTGCT TGGCGGGGAG CGCCGATGCC GATGGCAACT GCGACTGTAC GACCGTCGAT TGTGGCGCGG CGGTGTATGA ATCGATTTCG AACTGGAACA CGAGTTTGAT CACGGACATG ACCGGTTTGT TTGAAGGTAA GTCTTCATTC AACGCGAATA TCGGCGGATG GAACACGGGG AGCGTCACAA ACATGCACCA GATGTTTAAC GGCGCGACTG CTTTCAACCA AGACATCGGT TCTTGGGATG TCTCGGATGT AGCGGACATG TATCAAATGT TCGGTAGCGC GTCCTCGTTC AATCAAAACA TTGGGTCGTG GAACACAGGC AGTGTGACGG ATATGAGCGG TATGTTCATG AATGCATCTG CGTTTAACCA GCCCGTTGGT GACTGGAACA CAGGCAGCGT CGTGGATATG AGCGGCATGT TTGAGAGCGT GTCTGCGTTT AACCAGCCCA TTGGTGAGTG GAATACCGGA AGAGTCACCA ATATGAAAGA AATGTTTTCG CAGGCGTCTG AATTCAATCA GAATATCGGT AATTGGGATA CATCGAGTGT GACGACGATG GATACTATGT TTCGCTACAC CGAGTCCTTT AACCAGCCCA TTAGTGAGTG GAATACCGGA AGAGTCACCA ATATGAAAGA CATGTTTTCG CATGCGTACG CATTCAATCA TTCAATCGGT GACTGGAACA CGGGTGCCGT GGAGAATATG GAGCTTATGT TCCAAAATAC CGGCGTTTTC AACCAGGACA TCAGCAGTTG GAACACGGCG GCGGTGACTC AGATGGACTC AATGTTTTAC GGTGCTTCAG CCTTCAATTA TGACATCACT GGTTGG
|
Protein sequence | MYIAQWDTSL ITDMNHLFYN KISFDANIST WNTAKVTKMD SMFSQARAFN QRINEWNTSS VVTMEAMFRY AESFDQPIGE WNTGRVTNMK DMFSQASEFN QNIGNWDTSS SALETATWNC LAGSADADGN CDCTTVDCGA AVYESISNWN TSLITDMTGL FEGKSSFNAN IGGWNTGSVT NMHQMFNGAT AFNQDIGSWD VSDVADMYQM FGSASSFNQN IGSWNTGSVT DMSGMFMNAS AFNQPVGDWN TGSVVDMSGM FESVSAFNQP IGEWNTGRVT NMKEMFSQAS EFNQNIGNWD TSSVTTMDTM FRYTESFNQP ISEWNTGRVT NMKDMFSHAY AFNHSIGDWN TGAVENMELM FQNTGVFNQD ISSWNTAAVT QMDSMFYGAS AFNYDITGW
|
| |