Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_87131 |
Symbol | |
ID | 5001572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 843077 |
End bp | 844438 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | |
GC content | 62% |
IMG OID | 640416993 |
Product | predicted protein |
Protein accession | XP_001417368 |
Protein GI | 145345760 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00012796 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGACGG GGACGAAGGT GGCGCTCGAC GACGCGCGCG CGGCGTGCGT CGCGCTCGCG CTCGAGGAGG CGGACGCGGT CGGGAAGATC ACGCTCACGG GACCGGGGCG AGGGACGGCG GCGGAGGCGC ATCGAAGGGT GGAGTTTCGA TGGGTGACGA TTCGGGGGCG GACGGCGCTG CAGCGGACGC GGTACGACGA ACGACAGGCG TTCACGAGTA ATCACGCGAT CGAGGGCGAA GGAGGGCTCG TTTCGAAGAG CGCGAGCGGG GATGAGGACG CGATCGGGGC GAGGGAGGCG CTGGAGGAGG CGCTGAGGGC GGGATATAAA CATTGGCGCG TCGAACACGC GCGCGGCGGT TACAACGTGA CGGCGAACGC GAAGAAGGCT CGGGCGACGA TTTCTAGGGA CAACGCGAGT AAGGGGACGC TGATAGACGG TAGTGCGAGA ACGCAGACGA TCGTCGTCGG CCCACAAGGG CACGACCGCG AGAAGTCGAG GTTACTGACG GGGGAGGACC CGTTTTTGCG ATACGTCGGC GTCGTCGCCA AGGATGGAAC CATCAAGGCG AGCAAGCGGG ACAAGTACAA GCAAGTGGAG GAGTTTCTGA AGATTTTGAA CGTTGCTTAC GACACCGCGA CGTCGGCGGG ACACATGAAG GGCGGCGATG AGACTCGCCC GCTTCGTGTG TGCGATTTAG GATGCGGGAA CGCGTATCTC ACGTTTGGGG CGTACTCGCT CTTGAGCTCG AAGAGACGCG TGCCTACAAA CGTAGTGGGC GTCGACGTGA AGCGCCAAGC GCGCGAACAT AACTCGCGGG TGGCCAAAGA GCTAGGTTGG GATGCGTCGA TGAGATTCAT TGAAGGCACG ATCGCCGACG CCGACGTGAC TTTCGTAGAC GGTTCAGAGG AGGACGCGTT CACCGACGTA GTGCTCGCTT TGCACGCGTG CGACACGGCG ACGGATGAGT CCATCGTTCG AACGGTGCGT TGGTGCGCTC CACTGGCGCT CATCGCGCCG TGCTGTCATC ACGATCTCCA AGTGCGTTTG AAAAGCGCAC CTCATGTTGC TTTCCCGCCG ATGGCGAGGC ACGGCATCCT CAGCGAACGT CTCGGTGACG TTCTCACGGA CGCATTCAGA GCTCACATTT TGCGACTGCT GGGATATCGC GTCGACGTCA TGGAATTCGT AGGAGGGGAG CACACGCCTC GAAATACTCT CATTCGAGCG ATCCGCACGA ACGCGTCGGC GTCGAAGGCG GCGTGGGAAG AGTATGATCA CATGTGTTCA ACGTGGGGCG TCACGCCTTT TCTCGCGGAC GCCCTCGCCG AGGAGTTGGC GGTGGCGCGT CGCGCGATCT GA
|
Protein sequence | MPTGTKVALD DARAACVALA LEEADAVGKI TLTGPGRGTA AEAHRRVEFR WVTIRGRTAL QRTRYDERQA FTSNHAIEGE GGLVSKSASG DEDAIGAREA LEEALRAGYK HWRVEHARGG YNVTANAKKA RATISRDNAS KGTLIDGSAR TQTIVVGPQG HDREKSRLLT GEDPFLRYVG VVAKDGTIKA SKRDKYKQVE EFLKILNVAY DTATSAGHMK GGDETRPLRV CDLGCGNAYL TFGAYSLLSS KRRVPTNVVG VDVKRQAREH NSRVAKELGW DASMRFIEGT IADADVTFVD GSEEDAFTDV VLALHACDTA TDESIVRTVR WCAPLALIAP CCHHDLQVRL KSAPHVAFPP MARHGILSER LGDVLTDAFR AHILRLLGYR VDVMEFVGGE HTPRNTLIRA IRTNASASKA AWEEYDHMCS TWGVTPFLAD ALAEELAVAR RAI
|
| |