Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_49676 |
Symbol | |
ID | 5001729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 460865 |
End bp | 462395 |
Gene Length | 1531 bp |
Protein Length | 499 aa |
Translation table | |
GC content | 56% |
IMG OID | 640417150 |
Product | predicted protein |
Protein accession | XP_001417778 |
Protein GI | 145346608 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0207] Thymidylate synthase [COG0262] Dihydrofolate reductase |
TIGRFAM ID | [TIGR03284] thymidylate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.21688 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATG GAAAGTTTCA AGTCGTCGTC GCGGCGACGC GCGAGGGAGG GATCGGGAGG GAGAACGCGC TGCCGTGGCG ACTCGCCGGG GACATGGGAT ACTTTAAAAA GATTACGAGC GAGACGCGGG ACGGGGACGC GATGAACGCG GTGGTGATGG GGAGAAAGAC GTGGGAATCG ATTCCGGGGA AATTCAGACC GCTGCCGGGA AGGTTGAACA TCGTGCTGAG TCGAAGCGGG GGATTGGCGG AGGCGAACGA CGAGAATAAT AACGGCGCGG AGACGCTGCC GGAGGGGGTG CTGGTGCGTA AGTCCATCGA TGACGCGCTG AGCGCGATTT CGTCGAGCGA AAAGAGGATT GAGAAGACGT TTGTGATCGG TGGGGCGCAA ATTTACGAAG AGGCGCTTCA AAGCGAAAAG TGCGAAGCCG TGCACCTCAC CGAGGTGGAG GGCGAATTCG AATGCGACGC CTTCATCCCG AAGATCGACG CCACCAAGTT CAAGCTCTAT GGACAGTCCA AGCCCATGAT TGAGAAAGGC ACGAGGTTTC AATTTTTGAC GTACGTCACT GCCGACGCCG AGAGCGGAAA GTTCCGCCCG AAGGCCGACG AGGTTCTGCC CGCGGGTTGC TCGATCAAGC ACGAAGAATA CCAGTACTTG GAAATGATTC GCGAAATCAT CGATCAAGGC GCGGTGAAGG GCGATCGCAC CGGCACTGGG ACGATTTCTA CGTTTGGCAA TCAAATGCGT TTCGATCTTC GCCGATCGTT TCCACTTCTC ACGACCAAGC GCGTCTTCTG GCGCGGCGTC GCGGAAGAGT TGCTGTGGTT CGTCGCCGGC GAGACGAACG CGAACAAGTT GGCCGAGAAA AAGATCAACA TCTGGGATGG TAACGGAAGT CGCGAGTACT TGGATTCTAT TGGTTTGACT GAACGCGAAG TCGGTGATCT CGGTCCGGTG TACGGATTCC AGTGGAGACA CTTCGGCGCC GAGTACACGA ACATGCACGC CGACTACACT GGCAAGGGCG TGGACCAACT CGCTGAGGTC ATTCACAAGA TCAAGAACAA CCCGAACGAT CGTCGCATTT TACTCACGGC GTGGAATCCG GCGGCGTTGA AGGAGATGGC GTTGCCACCG TGCCACATGT TCTGCCAGTT TTACGTTGCC AACGGCGAGT TGAGCTGCCA AATGTACCAA CGCTCGTGCG ACATGGGTCT GGGCGTTCCT TTCAATATCG CTTCGTATTC CTTGCTCACG TGCATGATTG CGCAAGTTTG CGGTTTGAAA CCTGGTGATT TTGTGCACTG CTGCGGAGAC ACGCACGTAT ACTCGAACCA CGTGGAGCCG CTCGAAAAGC AGCTCGCGTG CGAGCCGCGA CCGTTTCCGA TTTTGAAAAT CAACCCGGAA AAGAAGGATA TCGACTCCTT CACCTTTGAC GACTTCGAGA TCGTCGGTTA CGATCCCCAC CCTAAAATTG AGATGAAAAT GGCCGTCTAA CGGCGCCCCG CGTACGTTGT GTCAGTATCA A
|
Protein sequence | MNDGKFQVVV AATREGGIGR ENALPWRLAG DMGYFKKITS ETRDGDAMNA VVMGRKTWES IPGKFRPLPG RLNIVLSRSG GLAEANDENN NGAETLPEGV LVRKSIDDAL SAISSSEKRI EKTFVIGGAQ IYEEALQSEK CEAVHLTEVE GEFECDAFIP KIDATKFKLY GQSKPMIEKG TRFQFLTYVT ADAESGKFRP KADEVLPAGC SIKHEEYQYL EMIREIIDQG AVKGDRTGTG TISTFGNQMR FDLRRSFPLL TTKRVFWRGV AEELLWFVAG ETNANKLAEK KINIWDGNGS REYLDSIGLT EREVGDLGPV YGFQWRHFGA EYTNMHADYT GKGVDQLAEV IHKIKNNPND RRILLTAWNP AALKEMALPP CHMFCQFYVA NGELSCQMYQ RSCDMGLGVP FNIASYSLLT CMIAQVCGLK PGDFVHCCGD THVYSNHVEP LEKQLACEPR PFPILKINPE KKDIDSFTFD DFEIVGYDPH PKIEMKMAV
|
| |