Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16744 |
Symbol | |
ID | 5003688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 249959 |
End bp | 251419 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | |
GC content | 63% |
IMG OID | 640419109 |
Product | predicted protein |
Protein accession | XP_001419756 |
Protein GI | 145350738 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0285] Folylpolyglutamate synthase |
TIGRFAM ID | [TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.544898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.122901 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGACG ACGACCGCAC GTACGACGCG GCGGTGCGCG AGCTCGCGAA GACGATCACG GGTAAAAAAC GCGCGGACCC GGGCGCGGTG ACGTGGGAGG CGCAGTTCGC GCAGCTGGAG ACGTACGTCT CGAGGCTGGA CCTGCGAGGG CGCGTCGACG CGCTCGACGT GGTGCACGTG GCGGGGACGA AGGGGAAGGG ATCGACGTGC GCGATGGTGG AGGGGATGCT CAGGGCGAGC GGGACGCGGG TGGGGACGTT CACGAGCCCG CACTTGATGG ACGTGCGAGA ACGATTTCGG ATCGACGGGG CGATGGTGGA CGAGGAGACG TTCGCGAGGG AGTTTTGGTG GACGCGGGAC GCGATCGCGG CGCGGTGCGG AGATCTGGGG ATGCCGGCGT ACTTTAGGTT TTTGACGCTG TTGGGGCTGC GAATTTTCTC GAGCGCGGGG GTGGAGGCGT GCGTGCTGGA GGTGGGGTTG GGGGGGCGCC TGGACGCGAC GAACGTGGTG CGCGCGCCGG CGGCGTGCGG GATCACGTCG CTCGGGATGG ATCACGTGGA TATTTTAGGG GATACTTTGG GGAAAATCGC GACGGAGAAG GCGGGGATCA TGAAGCCGGG CGTGCGAACG TTTACGGCGC CGCAAAAACC CGAGGCGATG GTCGCGCTCG AGCGTCGCGC CGCCGAGGTT GGTTCGCCGT TAGTCGTCGC GAGAGATTTA GACAGCTATG AGGGCGGGAG TGAGATCGAA GTCGGGTTGG CGGGACCGCA CCAACGCGTC AACGCCGCCG TCGCCGTGGA GTTGGTGCGG GAGTGGGCGA ACGCCACGAA CCAGCCGTGG GCGGAGGAGA TGGAGGCTTC GTTCGCGCGA AACGAGTTAC CGGAGAGCTT TCGCGTCGGG TTAGAGAAGA CGACGTGGCC GGGTCGATCG CAGGTGATGC ACGATCCCGA TGTGATGAAC TTGACGTTTT ATCTCGACGG CGCGCACACC ATCGAATCCA TGCGCCACAG CGCCGAATGG TTTACGAGCA CCGCGCGAGC GGTGACGGCG GAGCACAACA TCATGCTTTT CAACTGCATG GATGATCGCA AACCGGAGGA TTTACTCGAA CCGGTCGCGG ATGTTTTCCA CACGACTTCA AACGTACAGT TAGAGCGCGC TATTTTTAGC CCGCCGGATA GCACCACGAG CGGGTTAGAC AAGTGCGGGG ACGCAAAGGC GACCTCTTGG CAAGACAGGT GTGCGCGTAC GTGGGACGAT ATCATCGTCA AACGTGCGAA CGTGCTCGCA GAAAAAGCTC AAGACACTCG AGGTGTCGTG GTTTCGAGCA TTCATCAAGC GCTCAGTCTC ATTCGACATC GAGCGCGAGA GGTGGCGCCG GCGCGCGTCA ACGTCCTCGT CACTGGCAGT CTGTATTTGG TCGGCGACGT CCTGAGGCAT CTAAAGAAGT GCGTGAAATA G
|
Protein sequence | MRDDDRTYDA AVRELAKTIT GKKRADPGAV TWEAQFAQLE TYVSRLDLRG RVDALDVVHV AGTKGKGSTC AMVEGMLRAS GTRVGTFTSP HLMDVRERFR IDGAMVDEET FAREFWWTRD AIAARCGDLG MPAYFRFLTL LGLRIFSSAG VEACVLEVGL GGRLDATNVV RAPAACGITS LGMDHVDILG DTLGKIATEK AGIMKPGVRT FTAPQKPEAM VALERRAAEV GSPLVVARDL DSYEGGSEIE VGLAGPHQRV NAAVAVELVR EWANATNQPW AEEMEASFAR NELPESFRVG LEKTTWPGRS QVMHDPDVMN LTFYLDGAHT IESMRHSAEW FTSTARAVTA EHNIMLFNCM DDRKPEDLLE PVADVFHTTS NVQLERAIFS PPDSTTSGLD KCGDAKATSW QDRCARTWDD IIVKRANVLA EKAQDTRGVV VSSIHQALSL IRHRAREVAP ARVNVLVTGS LYLVGDVLRH LKKCVK
|
| |