Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31740 |
Symbol | |
ID | 5001867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 455813 |
End bp | 457475 |
Gene Length | 1663 bp |
Protein Length | 468 aa |
Translation table | |
GC content | 63% |
IMG OID | 640417288 |
Product | predicted protein |
Protein accession | XP_001417777 |
Protein GI | 145346606 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01127] threonine dehydratase, medium form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.414163 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGACGCGC GACGCGGACC GCACTCGCGA CGCTCGGCGC ACCGAAGTCC GATGATGACG CACGCGATGC CGACGCGCGC GGCGATGCCG ACGCGCGCGG CGACGACGCG ACGCGAAAGA TGCGATGACG GCGTCGCGAT GACGACGACG ACGACGACGC GCCGACGCGC GGTGACGCGA CGCGCGGCGA CGCGAACGAC GACGCGGACG CGAACGGTGA TGCGCGCGGT GATACCGCGA CCGGGCGCGA GCGTGGCGGA GGGACTGGAG ATACCGTCGT ACGAACACGT GCGAGAGGCG TACGAAAAGG TGAGCGGAGG GGTGGTGCGC TCGCGGTGCA CGCACTCGAA GACGCTGAGC GCGCTGACGG GGATGGAGAT TTACTTGAAA CACGAGTGGG AACAAGCGAC GGGGTCGTTT AAGGAGCGAG GGGCGCGGAA CGCGCTGATG GCGCTCGACG CGGAGCAAAG AAAGCGAGGC GTCATCGCGG CGAGCGCGGG CAACCACGCG CTGGCGCTCG CGTATCACGG AAGAGAACTC GGGATTCCGG TGACGGTGAT CATGCCGTCG ATCGCGCCGC TGACGAAGAT TACGAAGTGC CAAAAGTTGG ACGCGCGCGT GATTTTGGAA GGGGACACGA TCGCGGACGC GGCGGCGTAT GCGAAAGAGA ACTACGTCGT GAAGGGCGAA ATGCTCAAGT ACATCAACGG TTTCGACGAC TTTGAAATCA TCTCCGGCGC CGGTTCGGTC GGTATCGAGA TGCTCGAAGA CGTTCGCGAC GCGGACGCCG TGGTGATTCC CGTCGGCGGT GGTGGCTTGA TTGCCGGTAT CGCGCTCGCC GTGAAGCACC TCAAGCCGGG GTGCCAAGTC ATCGGCGTCG AGCCCGAGCG CTGCGCCTCC ATGACGCTCG CGCTCGAAGC GGGTGAACCC GTCAAGGCGC CGACGACGGC GACGCTCGCT GATGGTCTCG CCGTTCCTAC CGTGGGTCCG CGCTCGTTCC AAACCGTTAA AAACTTGATT GACGATATCG TACTCGTCAG CGAGTCAGAC ATCGCCGTCG CCATGTTGAG ACTTCTCGAA AACGAAAAGC TCGTGCAAGA GGGCGCTGGG ATTTCCGGTT TGGCCGCACT CTTGACCGAT AAGCTCCCCG GTCTCAAGGG CAAGAAAGTA GTGGTCGCCA TGTGCGGTGG TAACATCGAT ACGAGCACGC TCGGTAAAGT GCTCGAGCGC GGCTTGGTCT CGGACGGGCG ACTCGTGCGC TTCGCGTGCG TCGTTCCCGA CAGACCGGGT GGTATCGCTG GCGTGTGCAA CAAGATAGCC GACGTGGGTG CTTCGATCAA GCACATCGTT CACGAGCGCG CTTGGTTAGA CCAAGACTCT CACTGCGTCC TCGTCGATGT CGAATGCGAA GTGACCGACA GTAGCATGGG CGAGGAGCTC TACGACGCCA TCAGCTCGGC GTACGTCCTG CGTACCGCTA ACTTTGGACC GGCGAACCGC CCGAAGGTCC CCGCGCGAAC GCCCGCACGT CGAGACGACG TCGACGCAAA TGATCTGAAG AATCCTGTGA CGTCGTGCGA TTTGTTCGAC GAAGACTGCA CGTCCATTTA CCCCGAAGAC GAGTAACTCC GGTTACGCCG TCGCAGCATC GCATGCATCG CGC
|
Protein sequence | MRAVIPRPGA SVAEGLEIPS YEHVREAYEK VSGGVVRSRC THSKTLSALT GMEIYLKHEW EQATGSFKER GARNALMALD AEQRKRGVIA ASAGNHALAL AYHGRELGIP VTVIMPSIAP LTKITKCQKL DARVILEGDT IADAAAYAKE NYVVKGEMLK YINGFDDFEI ISGAGSVGIE MLEDVRDADA VVIPVGGGGL IAGIALAVKH LKPGCQVIGV EPERCASMTL ALEAGEPVKA PTTATLADGL AVPTVGPRSF QTVKNLIDDI VLVSESDIAV AMLRLLENEK LVQEGAGISG LAALLTDKLP GLKGKKVVVA MCGGNIDTST LGKVLERGLV SDGRLVRFAC VVPDRPGGIA GVCNKIADVG ASIKHIVHER AWLDQDSHCV LVDVECEVTD SSMGEELYDA ISSAYVLRTA NFGPANRPKV PARTPARRDD VDANDLKNPV TSCDLFDEDC TSIYPEDE
|
| |