Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30829 |
Symbol | |
ID | 5000752 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 813378 |
End bp | 814825 |
Gene Length | 1448 bp |
Protein Length | 433 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416173 |
Product | predicted protein |
Protein accession | XP_001416771 |
Protein GI | 145344503 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.461496 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCCTCGACGT TCCAAAATTC CTCTCGACGC CCCCGCCGTC GAGCCTCCCG CCGAGGCGAT GTCTCGCCTC GACGTCGATT TCTGCGTCGA CGAACGCGCG CGCTGGCGCT CCATCGACGC GCTTTTACTT CGCCGTGGGA GATTCACCGG TCCAGATTTT GAGCCGGGAG AAGACGTCGC GCGCGTGCTC CGCGAGCACG TCCGCGTGCT CGTCGTCGGC GCCGGAGGAC TCGGGTGTGA ACTGTTGAAA GGGCTCGGTG CGTGACGACG GCGCGAGAGA CGCGCGGACG TTGGTGGTGA TGAAGGACGC GCGACTGACG AGACGAACGC GACGACGCCG CGCAGCGCTG AGCGGATTCA CGACTCTGGA CGTGATCGAT ATGGACACGA TCGACGTGAC GAATTTAAAT AGACAGTTTT TGTTTCGCGC GGAGGACGTG GGGAAGAGTA AGGCGGAGAC GGCGGCGAGA CGAGTGCGGG AACGCGTGCG CGGGTGCGCG GTGAACGCGC ATCACGGACG AATAGAAGAG AAAGAGGATG GGTGGTATAA ACAGTTTGAT ATCATCGCTC TGGGATTGGA TTCTCTGGAA GCGCGGGCAT ACATCAACGC GGTGTGCTGT GGGTTCTTGG ATTACGACGA GGATGGGAAC GTGGATCCGG CGACGATTAA ACCGCTCGTG GACGGCGGTA CGGAGGGATT CAAGGGACAC GCGCGCGTCA TCGTTCCGGG AATGACACCG TGTTTCAATT GCACAATGTG GCTGTTCCCT CCGCAAACGA CGTTTCCGTT GTGCACGCTG GCAGAGACGC CGAGGAACGC AGCGCACTGC ATTGAATACG CAAAATTAAT TCAGTGGCCG GCGGAGCGAT ACGGGGAGAC GTTTGACGCG GATGTCGTCG AGCACATGAC GTGGGTGTAC ACGAAAGCGC TCAAACGCGC CGAGACATTT GGTATTCCAG GCGTAACGTA CGCTCACACG CAAGGTGTGA CGAAGAACAT CATTCCGGCG ATTCCTAGCA CGAACGCAAT CATAGCCGCG GCGTGCGTCA TCGAAACGTT GAAAATGGCG ACGATGTGCG CCAAGGGAAT GAACAATTAC ATGATGTACG TGGGCACGGA TGGTGTGTAC TCGCACACTG TGGAGTACGA ACGCGATCCA TCGTGCGTGG TGTGCTCACC CGGGATCGCT CACGCGTTGA ACGCGAACGC GACACTCGAA GAATTCATGG CTTCCATCGT CGCCGCGTAT CCAGATTCTC TCGCCGAACC GAGCGTGAGT TTCGGCGGGA AAAATCTGTA CTTGCGCGGC GTGCTCGAGT CCGAATTCGC GGAAAACTTG AATAAACCTA TGATTGAGCT CATGAATGGG CGCAAAGAAG GCTTAGTCGT GGTGAATGAC AAGAAAATGA AGAAGACGTC GATGCGGTTG CGACTGTCGT TAAAATGA
|
Protein sequence | MSRLDVDFCV DERARWRSID ALLLRRGRFT GPDFEPGEDV ARVLREHVRV LVVGAGGLGC ELLKGLALSG FTTLDVIDMD TIDVTNLNRQ FLFRAEDVGK SKAETAARRV RERVRGCAVN AHHGRIEEKE DGWYKQFDII ALGLDSLEAR AYINAVCCGF LDYDEDGNVD PATIKPLVDG GTEGFKGHAR VIVPGMTPCF NCTMWLFPPQ TTFPLCTLAE TPRNAAHCIE YAKLIQWPAE RYGETFDADV VEHMTWVYTK ALKRAETFGI PGVTYAHTQG VTKNIIPAIP STNAIIAAAC VIETLKMATM CAKGMNNYMM YVGTDGVYSH TVEYERDPSC VVCSPGIAHA LNANATLEEF MASIVAAYPD SLAEPSVSFG GKNLYLRGVL ESEFAENLNK PMIELMNGRK EGLVVVNDKK MKKTSMRLRL SLK
|
| |