Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14033 |
Symbol | |
ID | 4999406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 1098631 |
End bp | 1100190 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | |
GC content | 62% |
IMG OID | 640414827 |
Product | predicted protein |
Protein accession | XP_001416026 |
Protein GI | 145341871 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0882668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAAC TCGCGAGAGG GCTGGCGGGA TTGGGGTACG AAATCGTGTC CACGGGCGGG AGCGCGAAGG CGATCGAGGC GAGCGGGACG GCGGTGACGA GCGTGGACGC GGTGACGGGA TTCCCTGAGA TGCTCGATGG ACGCGTGAAG ACGCTGCATC CGGGCGTGCA CGGAGGTATA CTGGCGAAAC GTGAGGACGC GTCGCACATG GAGGCGATCG CGAAGCATGG GATCGATACC ATCGACGTCG TGGCGGTGAA CCTGTACCCG TTTCGGGAAA CCGTGGCGGG CGGAGGAGAT TTCGCGCAGT GCGTGGAGAA CATCGATATC GGTGGACCGG CGATGATTCG GGCGGCGGCG AAGAACCACC CGCACGTGTA CGTCGTCGTC GATCCGAACG ATTACGAAAA GTTGATTGAA CATTTGAAGA GTTCGCCGAG CGCGGCGGAC GACTTGAAGT TCAAGCGCGA GTTGGCGTGG AAGGCGTTCC AACACTGCGC CTCGTACGAC TCCGTGGTTT CGGAATACTT GTGGAGTCAA ATCGGCGAGG GTGCCCCAGC GCCCGAGCTT TCGGTGCCGA TGACGCTCAC GGCGGCGCTG CGATACGGGG AGAACCCGCA CCAACCCGCC GCGGTATACG CGGATGGTTC GCTCGCCGAG TCTACGGGCG AGGGCGTGGC GCGATCGATC CAGCATCACG GCAAGGAAAT GAGTTACAAC AACTATCTCG ATGCCGACGC CGCGTACGGG TGCGCGTGCG ATTACCCGAC GAGCGATCCG ACGTGCGTCA TCGTCAAGCA CACCAACCCG TGCGGCATCG CGAGCGCGAG CGGCGCCAAT GGCGATTTGC TCGAAGCTTA CCGCATGGCG GTTCGCGCCG ATCCGATCTC CGCGTTCGGT GGCATCGTCG CCTTCAACTG TACGGTCGAT GCGGACATGG CGCGAGAAAT TCGCGAGTTC CGCTCTCCCA CCGACGACGA GACGCGCATG TTTTATGAAA TCGTCATCGC TCCGTCCTAC ACTCCCGAAG GTTTGGAGGT GTTGAAGGGT AAGTCCAAGA CGCTGCGCAT CTTGGAAACC AAGCCGCGCA CGGGATCAAC GAAGAGCTTG CGACAAGTCG GCGGTGGCTG GCTCGAGCAA GCCTCGGACT CTCTCGTGCC CGAGGACATT ACGTTTGAAG CCGTCTCTGA CGTCAAACCC ACGCCAGAGC AACTCGAAGC CTTGAAGTTT GCCTGGCGCG CGGTCAAGCA CGTCAAGTCC AACGCCATCA CCGTCGCCAC CACCGGTCGA CTTCTCGGCA TGGGCTCCGG CCAGCCCAAC CGCGTGAACT CTGTTCGCAT CGCCCTCGAA AAGGCGGGCG AAGAGGCGCA GGGCGCCGTT CTCGCCTCGG ACGCCTTCTT CCCTTTCGCT TGGGGCGATT CCGTGGAGAT CGCGTGTCAG GCCGGCATCA AAGCCATCGC CCATCCCGGC GGTTCCATGC GCGACCAAGA CGCCGTGGAC GTGTGCAACA AGTACGGCGT CGCCCTCGTC ACCACCGGTC ATCGACACTT CCGTCACTAG
|
Protein sequence | MNELARGLAG LGYEIVSTGG SAKAIEASGT AVTSVDAVTG FPEMLDGRVK TLHPGVHGGI LAKREDASHM EAIAKHGIDT IDVVAVNLYP FRETVAGGGD FAQCVENIDI GGPAMIRAAA KNHPHVYVVV DPNDYEKLIE HLKSSPSAAD DLKFKRELAW KAFQHCASYD SVVSEYLWSQ IGEGAPAPEL SVPMTLTAAL RYGENPHQPA AVYADGSLAE STGEGVARSI QHHGKEMSYN NYLDADAAYG CACDYPTSDP TCVIVKHTNP CGIASASGAN GDLLEAYRMA VRADPISAFG GIVAFNCTVD ADMAREIREF RSPTDDETRM FYEIVIAPSY TPEGLEVLKG KSKTLRILET KPRTGSTKSL RQVGGGWLEQ ASDSLVPEDI TFEAVSDVKP TPEQLEALKF AWRAVKHVKS NAITVATTGR LLGMGSGQPN RVNSVRIALE KAGEEAQGAV LASDAFFPFA WGDSVEIACQ AGIKAIAHPG GSMRDQDAVD VCNKYGVALV TTGHRHFRH
|
| |