Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17535 |
Symbol | |
ID | 5004685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 145125 |
End bp | 146561 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | |
GC content | 62% |
IMG OID | 640420106 |
Product | predicted protein |
Protein accession | XP_001420600 |
Protein GI | 145352542 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00097] phosphomethylpyrimidine kinase [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.274913 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCGCG GTAAAGTCCT CGTCGTCGCC GGCTCCGACT CGGGCGGCGG CGCCGGCGTC CAAGCCGACG TCAAGGCGAT CCTCGCCCAC GGCGCCTTCG CCACGACCGC CATCACCGCG CTCACCGCTC AAAACACCAC GGGCGTGCAC GGCGTCCACG CCGCACCGTT GGAGTTCATC GAGGCGCAGA TCGAGGCCGT GGTGACGGAT CTACCGCCAG ACGCGTGCAA GACGGGGATG CTGGCCAACG CGGCGACGAC GCGCGCCGTC GCCGACGCGA TCGAACGACA TCGATTGACC AACGTCGTCG TGGACACGGT GATGCTGGCG AAAGGCGGCG CGAGCCTGCT CGAAGCGGAG GCGCTGGAGG TGATGCGGGA TCGATTGGCG CCGCTCGCGA CGGTGATCAC GCCAAACGTC CCGGAGGCGG CGGCGCTGCT AAACTTGAGC GAGGAAGAGT TCGTGATGGA GACGATGGCG ACGCGGGCGA AGGAGTTAGG GAAGTTAGGG TGTCAGTGGG TGTTACTTAA GGGTGGGCAC GTCAAGGATG ACGCGGAGAT GTCGGTGGAT TATTTGTACG AGGCGAACAC GGGGAGGACG ACGACGTTCT CGAGCGCGAG AATAGATACG AGGCACACTC ACGGGACCGG GTGCACGCTC GCGAGCTCGA TCGCGGCGTC GTTGGCGCAA AGGTATGACG TTCCTACGGC GGTGCATCGA GCGAAGAGGT ACATCTCGGA GGCGATTCGA ACGAGTCCGG GGTACGGCGC GGGACACGGG CCGTTGAATC ATTTGCCGTT TCACGCCGGC GCGGCGGCGC GTGGAAAGCG GTTCGATCCG CGATGTTTGA AACTTTATCT CGTCAGCAGT GAGGCGTTGA CCATGGATAA GCTTCGACAG GCGCTCGAGG CGGGAGTGAC GATTGTGCAG ATGCGCGATA AGGATCCCTC GACGAGGGCC TTGATCGAAC GCGCCAAGGC GATGAAGGCG GCGTGCGATG AATACGGCGT CCCGTTCATC GTCAACGATC GTGTCGACGT CGCCATCGCG TGTGACGCCG ATGGTGTGCA CTTGGGTCAG TCAGACATGA CTTGCGCGGA GGCTCGACAA ATTCTTGGTC CGAATAAATG GATCGGAGTG AGCTGTCGAG AGGTGTCTTT GGCGCGCCAA GCAAGCGCCG ACGACGCCGA TTACATAGGG TGCGGCGCGT GTTTCGGCAC TAATTCCAAG GGTGACGCTA AAGTCATCGG CTTGGATGGT GTCGGGAAAG TTATCGCGGT AGCTCGCGAG CTTTCCTTAC CAGTCGTCGC CATCGGGGGC GTCTCGCTCG AAAACGCCGC GTCTGTTCGA GCGACGGGCG CCGACGGAAT AGCCGTCATT TCCGCCGTCG CCAACGCGGC GGACGTGAAA AAGGCGGTCC ATAAGTTGCT GCATTAG
|
Protein sequence | MVRGKVLVVA GSDSGGGAGV QADVKAILAH GAFATTAITA LTAQNTTGVH GVHAAPLEFI EAQIEAVVTD LPPDACKTGM LANAATTRAV ADAIERHRLT NVVVDTVMLA KGGASLLEAE ALEVMRDRLA PLATVITPNV PEAAALLNLS EEEFVMETMA TRAKELGKLG CQWVLLKGGH VKDDAEMSVD YLYEANTGRT TTFSSARIDT RHTHGTGCTL ASSIAASLAQ RYDVPTAVHR AKRYISEAIR TSPGYGAGHG PLNHLPFHAG AAARGKRFDP RCLKLYLVSS EALTMDKLRQ ALEAGVTIVQ MRDKDPSTRA LIERAKAMKA ACDEYGVPFI VNDRVDVAIA CDADGVHLGQ SDMTCAEARQ ILGPNKWIGV SCREVSLARQ ASADDADYIG CGACFGTNSK GDAKVIGLDG VGKVIAVARE LSLPVVAIGG VSLENAASVR ATGADGIAVI SAVANAADVK KAVHKLLH
|
| |