Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_87976 |
Symbol | |
ID | 5003417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 265424 |
End bp | 266887 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | |
GC content | 62% |
IMG OID | 640418838 |
Product | predicted protein |
Protein accession | XP_001419113 |
Protein GI | 145349380 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase [COG0807] GTP cyclohydrolase II |
TIGRFAM ID | [TIGR00505] GTP cyclohydrolase II [TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0473982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGAT GCGCGTCGGC GCGCGCGCCC GCGCGCGACG CGCGACGACG CGACGCGCGC GCGTGGACGC GTCGCGCGCG GACCCTCGCG ACGACGACGC GACGACGCGA CGGCGCGCGC GCGACGCGAC ACCGCGCGAC GACTCGAGCG CGCGCGACGA CCGGGGATGA TCCCGACGCG CCCACGGCCG GGTTCGCGGC CGTCGCGGAC GCGCTCGAGG ACGTGGCGAA GGGGAAATTC GTCGTCGTGC TGGACGACGA GGACAGGGAG AACGAGGGGG ACTTGATTGG GGCGGCGGAT AAGATGACGG CGGAGTCGCT GGCGTTCATG ATTCGACACA CGAGCGGGTT GGTGTGCGTG AGTCTTGAGG ATTCGAGGGC GGACGCGCTG GATCTGCCGC TGATGGTGGA TTCGCAGAGT AATAAGGATG CGATGAAGAC GGCGTTTACG GTGAGCGTGG ATTTGGCGAC GAGCACGACG GGGATATCGG CGAGCGAAAG AGCGATGACG ATCAACGCGC TGGGATCGGA TGAGACGACG GCGGCGGCGT TCGTGAGACC GGGACACGTG TTTCCGTTGA GATATCGCGC GGGTGGGGTG CTGAAACGGG CGGGACACAC CGAGGCGGCG GTGGACTTGG CGCGCATGGC GGGGTCGTCC CCCGTGGGCG TGTTGTGCGA AATCGTCAAC GACGAGGATG GGTCCATGGC GCGGTTGCCT CAGCTCAAAG TTTTCGCGGA GAAGCATGGG TTGAAGATGG TTCTCATCTC TGATATGATT CGGTATCGTC GCGCGCGAGA AAAGATGGTT GAGCGCACCG CCGTCGCGCG GTTGCCCACC GAGTACGGCA ACTTTACGTG CGTGTCTTAC AAGAACACTC TCGATGGTCA CGAGCACGTG GCTTTTCTGT ACGGCGAGCA CGAAGGCGAC GTATCCGGTG CTGTGGGTGA AGACATGCTC GTGCGCGTGC ACAGCGAGTG TTTGACTGGG GATATTTTCA AAAGCGCGAG ATGCGACTGC GGCAACCAGC TCGACATGGC GATGCGACGC ATCGCCGGCG AGGGCAAAGG GTGCATCGTG TACTTGCGCG GACAAGAAGG TCGTGGTATC GGTCTCGGTC ACAAATTGCG CGCGTACAAC TTACAGGATG AAGGACGCGA CACGGTTCAG GCGAACGAGG ATCTCGGCTT TCCCGCAGAC ACGCGCGAGT ACGGCGTCGG CGCGCAAATT TTACAAGATC TCGGCGTCAC GTCTCTTCGC TTGATGACCA ATAATCCGGC CAAATACAAC GGTTTGAGCG GTTACGGTTT GAAAGTCACC GGTCGCGTGC CGCTTTTCGC TCCAGTCACG ATGGAAAACA AGAGATACAT CGACACGAAG AGAATGAAGA TGGGTCACTT GTTCGAGATG CTCGAAGGCG TCGAACCGTC GCAAGCGGAG TCCGAACAAA AGCCGTCGCG ATGA
|
Protein sequence | MSRCASARAP ARDARRRDAR AWTRRARTLA TTTRRRDGAR ATRHRATTRA RATTGDDPDA PTAGFAAVAD ALEDVAKGKF VVVLDDEDRE NEGDLIGAAD KMTAESLAFM IRHTSGLVCV SLEDSRADAL DLPLMVDSQS NKDAMKTAFT VSVDLATSTT GISASERAMT INALGSDETT AAAFVRPGHV FPLRYRAGGV LKRAGHTEAA VDLARMAGSS PVGVLCEIVN DEDGSMARLP QLKVFAEKHG LKMVLISDMI RYRRAREKMV ERTAVARLPT EYGNFTCVSY KNTLDGHEHV AFLYGEHEGD VSGAVGEDML VRVHSECLTG DIFKSARCDC GNQLDMAMRR IAGEGKGCIV YLRGQEGRGI GLGHKLRAYN LQDEGRDTVQ ANEDLGFPAD TREYGVGAQI LQDLGVTSLR LMTNNPAKYN GLSGYGLKVT GRVPLFAPVT MENKRYIDTK RMKMGHLFEM LEGVEPSQAE SEQKPSR
|
| |