Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_51122 |
Symbol | |
ID | 5004623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 218219 |
End bp | 220779 |
Gene Length | 2561 bp |
Protein Length | 841 aa |
Translation table | |
GC content | 60% |
IMG OID | 640420044 |
Product | predicted protein |
Protein accession | XP_001420784 |
Protein GI | 145352925 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily [TIGR03550] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofG subunit [TIGR03551] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0345053 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGAGA CGCTCGCCGA CGTCGCGCGT CCTCGCGCGT GGCGCGCACG CGCGCGCGTC CAACGCGCGC ATCACCGTCC TCGCGCAGCC GTCGCGCGCG TCGCCGTCCT CGACGACGTC GCCGCCTACG CCTTGGTGAC AAAGTCGGCT GATGATTTAC TCGACGACGT CCGACGCCTC AACCGGGACG CGTCGTCGTC GTCGCCGCGA ACGACGAATG CAAACGTCGT GACGTACTCG CCCAAGGTGT TCGTTCCGCT CACGCGCGCG TGTCGGGACT CGTGCGGATA CTGCGCGTTC GTCGACTACG AACCGAGCGC GGCTGGAAAG CGCGTGTACA TGACGCTCGA GGAAATCGTC GACGTGGCGC GACGAGGCGC GGCGGCGGGG GCGACGGAGT GTTTGTTGAC GTTTGGTGAT CGACCCGAGG CGACGCGGGA GGATGCGCGA GAGGGATTGA GGGAGTTGGG ATGCGCGAGC ACGGCGGAGT ACGCGGCAAA GGCGTGCGAG GCGGTGTTGC GAGAGACGGG GTTGTTGCCG CACGTAAACG CGGGTGTGTT GACGAGAGAC GAGTTGAGGA TGCTACGACG CGTGAGCGCG TCGCAAGGGT TGATGTTGGA GACGACGAGC GAGCGGTTAT TGGGGCCGGG AATGGCGCAC GACGGGTGTG AGACAAAGCG ACCGAAGACG CGCCTGCGGT GCATCGAGCT CGCGGGAGAG GAGCGCATCC CGTTCACGTC TGGATTATTG ATTGGGATCG GCGAGACTCG CGAAGAGCGT ATCGATGCAC TTCTGGCGCT TCGCGATGTA CATGCCAAGC ACGGACACAT TCAAGAGCTC ATCATACAGA ATTTCTTATC GAAACCCGGC ACCGCGATGG CTGATTTTCC AAATCCTCCG CTGGAAGAGT TGACGTGGAC GGTGAGCGCA GCTCGCCTAA TTTTTGGCGC AGACATGATT ATACAGGCGC CACCGAATCT TACACCAGGC GAAGAGGCTG GCTGGCGCGC CCTTTTGCGC GCCGGTGCGA ATGATTGGGG AGGAATCTCG CCGGGCGTCA CGCCGGACCA CGTCAACGCC GAGGCGCCAT GGCCGCACAT AGAAGAGCTC GCCACCGTGT GCGCCGATGA AGGTTTCGCG CTCGTCCCGA GACTGCCAGT GCACCCTAAG TACTTGAGGG TAGACGATGA TCGAGTGAGC GTCGGGGGAT CCGCAGTTTG GCTTGACGAC AAAGTTTCGC CGTATCTTCG CAAACTCGCC GACAGCGAGT TTCTCGTTCG CGGTACGACA TGGTCGCCAG GACGTCCGGA TGATGAAAAG AAAGAGTTTG TGGATATCGT CGGCGTGAAT GGCTCTGTTC CTTGTCGTGG TACCAAGAGG CGTATATCGA GCGAAGTTCT GGCCGCCATA GCCGCCATAG TGGACGGAAA CTATGAGTTG GACGACATCG TGACGTGCTT ACAAGCGAGA GGCGCCGATT TCGACAAGGT GTGCGAGGCC GCGAATACTT TGCGAGAGCA GCAGTGTGGT GATACCGTTA CGTTTGTGAA CAATAGAAAC ATTAACTATA CGAATATCTG CACGTTGGCG TGCACGTTTT GTTCGTTTTC CAAGGGAAAG GCTGCAGAAG AACTTCGCGG TTCGCCGTAC CTGCTCGACT TGGACGAAGT CGCAAGGCGA ACGGCCGAGG CTTGGGAGCG TGGTGCGAGC GAAGTCTGCA TGCAAGGCGG CATTCATCCC TCGTTCACAG GCGAAGATTA TATGGCTTTT ATCGGAGCTG CGAAACGAGG GGCACCGGAC ATACACATTC ACGCCTTTTC ACCGCTCGAA ATCGCTCACG GAGCGCAGAC TCTCGGTCTG AGCGCTCGCG AATACTTGCG TAAACTCAAG GATGCAGGGC TGGGATCGCT CCCAGGTACT GCTGCCGAGG TTTTGGACGA CCAAGTTCGC GAAACACTCT GTCCAGATAA ACTCACCGCG AAAGAATGGC TCGATGTCGT CGAAGACGCT CACTTTGTGG GCGTGCCAAC GACGAGCACC ATCATGTTCG GTCACATTGA CGCCGACGGC CCGCGCGCGT GGGCGCGACA TCTCGTCTCC ATTCGCGATT TGCATCTCAA GACGGGTGGA TTCACGGAGT TCGTACCACT ACCTTTCGTG CATTTCGAGG CGCCGACGTA TCGTTTCGGC GCGTCTCGGA AAGGTCCAAC GCTGCGCGAG TGCATCCTGA TGCACGCCGT CGCGCGTCTC GTGCTGGGAC CGGCGGGAAT CACGAACATT CAGGCGAGCT GGGTAAAAAT GGGTCCCGAG CTCGCCTCAC TTCTCCTGCA CGCTGGATGC AACGATATGG GCGGTACACT CATGAATGAA TCCATCACTC GCGCCGCTGG TGCGACGTTT GGGCAAGAAA TCGACGCGCG CGAAATGCGT CGAATCATCG AAGCCGCCGG CCGCGTTCCG CTTCAACGCA CCACCTTGTA CGCTCACGCA CCCCAACATC GCGTCGAGCA CCCATCGATC GCGTAGGACG CGATCGTCGT AGATTGTATG TAGAGTCACT A
|
Protein sequence | MRETLADVAR PRAWRARARV QRAHHRPRAA VARVAVLDDV AAYALVTKSA DDLLDDVRRL NRDASSSSPR TTNANVVTYS PKVFVPLTRA CRDSCGYCAF VDYEPSAAGK RVYMTLEEIV DVARRGAAAG ATECLLTFGD RPEATREDAR EGLRELGCAS TAEYAAKACE AVLRETGLLP HVNAGVLTRD ELRMLRRVSA SQGLMLETTS ERLLGPGMAH DGCETKRPKT RLRCIELAGE ERIPFTSGLL IGIGETREER IDALLALRDV HAKHGHIQEL IIQNFLSKPG TAMADFPNPP LEELTWTVSA ARLIFGADMI IQAPPNLTPG EEAGWRALLR AGANDWGGIS PGVTPDHVNA EAPWPHIEEL ATVCADEGFA LVPRLPVHPK YLRVDDDRVS VGGSAVWLDD KVSPYLRKLA DSEFLVRGTT WSPGRPDDEK KEFVDIVGVN GSVPCRGTKR RISSEVLAAI AAIVDGNYEL DDIVTCLQAR GADFDKVCEA ANTLREQQCG DTVTFVNNRN INYTNICTLA CTFCSFSKGK AAEELRGSPY LLDLDEVARR TAEAWERGAS EVCMQGGIHP SFTGEDYMAF IGAAKRGAPD IHIHAFSPLE IAHGAQTLGL SAREYLRKLK DAGLGSLPGT AAEVLDDQVR ETLCPDKLTA KEWLDVVEDA HFVGVPTTST IMFGHIDADG PRAWARHLVS IRDLHLKTGG FTEFVPLPFV HFEAPTYRFG ASRKGPTLRE CILMHAVARL VLGPAGITNI QASWVKMGPE LASLLLHAGC NDMGGTLMNE SITRAAGATF GQEIDAREMR RIIEAAGRVP LQRTTLYAHA PQHRVEHPSI A
|
| |