Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31385 |
Symbol | |
ID | 5001543 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 781786 |
End bp | 783493 |
Gene Length | 1708 bp |
Protein Length | 462 aa |
Translation table | |
GC content | 59% |
IMG OID | 640416964 |
Product | predicted protein |
Protein accession | XP_001417356 |
Protein GI | 145345734 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.000357166 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAGTC CGACGCGACG ACGCGCGACG GTTGAGTCGA CGACGTCGTT CGAGCACTGT CGAGCGGGGA CGCCGACGGG ACGAGCGCAG GTGCGCGCGC GCGACGGCGA GGCGAGGCGA GGCGCGCGCG CGCGGAATTC GCGTCGCGAA AACCGTTCGC GTCGCGCGCG GTGATAAGCG CGGTTCGACG CGCGCGCGGG ACGCGCGGGA CGCGCGGATG GCGATGGCGA TGGCGATCGA GGGGATGGAT GATCGTCGAG CGAATCGACG CGCGAGACTG ACGATGTTGA CGGTTCGATA GGTCGATCGA TTCATCCCGA GCCGTAGCGC GTTGGATTTG GACGTGGCGC ACTACAATTT GTCCCGAGAG GGCGGGGAAT CGGAGGTGGA TGATGCGGTA AAGGAGATCA AGTCTCCGGC GAAGGTGCGC GCGTCGACGC GCTCGAAGCG CGCGCGGACG AACGCGTGGA GTGAATGAAT GAGTTCGTTG AGACTGACGA CGAAATCGAT CGCGCGATGA TATGATGCAC AGGAAGCGTA TAAGAAAAGT CTTGCGGATA ACTTCCACGT GGATAACGGA AGCGATTCGG CGAAGATTCT CGCGTTCAAG TCAAAGGCGC CGGCGCCGCC GAGCGGGTTG GAAAACTCGG CGCGCGGTGT CTACACGAAC AACTCCGCGG GAGTGAAGGC GAAGAAGACG TTCCGTCAGA TTCCCAGCGC TCCCGAACGC ATCTTAGATG CGCCGGAGTT GATCGATGAC TACTATTTGA ACCTTATCGA CTGGGGGTCG TCGAACCAAG TCGCGGTGGC GTTGGGGTGC ACCGTGTACA TGTGGAACGC GGATACTGGG GCTATCAACC AATTGTGCCA GACCAACCCG GATGACGAAG ATGATTACAT CACCTCCGTC AACTGGGGTG CGGACGGTAA GCACATTGCG GTGGGTACGA ACAGTGCGGA GGTTCAAATT TGGGACGCGG CGCAGTGCAA GAAGGTGCGT ACGTTGCGAG GTCACGCCGC GCGGGTGGGT GCGGTCTCGT GGAACGGTTC GCAGCTTGCA ACGGGTAGTC GTGATAACAA CATCATGATT CACGACGTTC GCATTCGCGA GCATTGCACC TCGACGCTCC AGGTTCACCA GCAAGAGGTT TGTGGCTTGA AGTGGAGCCC GAGTGGCAAT CAGCTCGCGT CTGGCGGTAA CGACAACTTG TTGCACATCT TTGATGCGAG CTCCATCGGC AATCAACAAG CGTTGCACAG ATTAGATGCG CATCAAGCTG CCGTTAAGGC TCTCGCCTGG TGTCCGTTCC AGTCCAACTT GCTCGCTTCG GGCGGCGGTA CCGCCGACCG TTGCATCAAG TTTTGGAACA CGAACACCGG CGCCATGCTC AACTCTGTGG ACACGCACTC GCAAGTGTGC TCGTTGCAGT GGAACACGCA TGAGCGGGAG CTTTTGTCGT CGCACGGTTA CAGCCAAAAC CAGTTGTGTT TGTGGAAGTA TCCGACGATG ACCAAGATGG CCGAGTTGAC GGGTCACCAA GCGCGAGTGC TTCACATGGC GCAGTCTCCG GACGGTACCA CGGTGGTATC GGCGGCCGCG GATGAGACTT TGCGATTCTG GAAGTGCTTC GATAACGCTA GCGAGAAGAC CAAGAAGGTG CGCGATTCCA ATGACTCATC TGTTTTGCGC AGGTTCAATT TCCGCTAA
|
Protein sequence | MLSPTRRRAT VESTTSFEHC RAGTPTGRAQ VDRFIPSRSA LDLDVAHYNL SREGGESEVD DAVKEIKSPA KEAYKKSLAD NFHVDNGSDS AKILAFKSKA PAPPSGLENS ARGVYTNNSA GVKAKKTFRQ IPSAPERILD APELIDDYYL NLIDWGSSNQ VAVALGCTVY MWNADTGAIN QLCQTNPDDE DDYITSVNWG ADGKHIAVGT NSAEVQIWDA AQCKKVRTLR GHAARVGAVS WNGSQLATGS RDNNIMIHDV RIREHCTSTL QVHQQEVCGL KWSPSGNQLA SGGNDNLLHI FDASSIGNQQ ALHRLDAHQA AVKALAWCPF QSNLLASGGG TADRCIKFWN TNTGAMLNSV DTHSQVCSLQ WNTHERELLS SHGYSQNQLC LWKYPTMTKM AELTGHQARV LHMAQSPDGT TVVSAAADET LRFWKCFDNA SEKTKKVRDS NDSSVLRRFN FR
|
| |