Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_1406 |
Symbol | |
ID | 5001299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 455367 |
End bp | 456920 |
Gene Length | 1554 bp |
Protein Length | 518 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416720 |
Product | predicted protein |
Protein accession | XP_001417248 |
Protein GI | 145345504 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.000342481 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGAACG CCAAGGTGCT CATGGTGGGC GCCGGGGGCA TCGGGTGTGA ACTTTTGAAA ACGCTCGTCT TGCACGGATT TCGAGACGTC ACGGCGATCG ATTTGGACAC CATCGACGTG TCGAATTTGA ACAGGCAGTT TTTGTTCAGG CGGCGGCACG TGGGGATGGC GAAGAGCGAG GTGGCGAGGG AGTCGGTGTT GAAGTTTAGA CCAGAAGCGA AGATTTCGGC GCTGCGAGCG AACGTGAAGG AGGCGAGATT CGATAAGGAG TATTTTAAGG GGTTCGACGT CGTGTTGAAT GGATTGGATA ACTTGGAGGC TCGACGACAC GTCAATCGGT TGTGTCTCGC GGCGGAAGTG CCGCTCGTGG AGAGTGGGAC GACTGGATAT AAAGGGCAAG TCACCGTGCA CGCGCGCAAA CAATGCGCGT GCTTTGAGTG CACGGAGAAG CCGACGCCGA AGAGTTATCC GATCTGTACG CTGAGGGATA CGCCGGACAA ACCGATTCAT TGCATCGTGT ACGCAAAAGA GCTTTTATTC AGCAAACTGT TCGGCGATGC GAGCGTGCAG AGCGATTTAG ATGAGGAGGA CGCGGTGGAA GCCGGGGCGT TTCGCCGAAA CGAAGGAGAG TCCGGCGTGG ATTTCGCCAA ACGCGTTTTT GCGTACGTTT TCGGGTCAAA GATCGAGGGT TTGCTGCTGA AGGATGACAT GTGGAAGACG AGATCCAGAC CGAAACCGCT GAAATCGGCG GACGTAGGCT TAGATTGCGA GTTTGTGGAG ACCGATTCAT CGGCGTCGAG TGCGCGACGA GCGCATGGTC TGATGGATCC CCACGTGGTT TGGTCCCCGA CCGAGTGCGC GAAGGTGTTT GTGAGCGCCA CAGCCCGACT TGTAGAGCGC GAGCGCCCGA TTGAGTTCGA CAAGGACGAC GACGACGCCG TGGAATTCGT TACGGCGGTG AGTAATTTGC GCTCGGTGAA TTACGGTATT CCTCCGCAAA GCGTATTCGA CGCCAAGGGT ATGGCTGGAA ACATCATCCA CGCCGTCGCG ACGACCAACG CGATCGTATC CGGTCTCATC GTCATAGAGG CGATCAAGAT TCTTCATAAA AGAATGGACC AAACCCGGTA CACGTTCGTC CTCGAGCACG CGAGCAACGG ACGTTTGCTG CAACCCATGT CGAAAGACGA CCCGAATCCA AAGTGTGCAG TTTGCGGGAA CGCACGCGTG GAATTAGTGT GCGACACCAC AAAGTTCACG AAGGGCGACC TAGTGAAGCG CGTGCTCAAG GGGAAGTTCA GCGTGAACGA GCCCACCGTG CAGTTTGGTG GGAACCTCCT TCACGAGACG GGCGAGGATT TAGACGAAGA CGAGGTGGAG CACTACGCTT CGCTCGACCC GCGCACCTTA GACAAGCTCC CGGGCGGTGG CGTCGTGAAC GGGACTATTT TACTCATCGA GGATTACTCG CAAGATTTCA AGTTTGAGCT CATGGTGACC CACCGCGAAG ATTGGGACGA TGAGAAGGAG CCGGACGGTT TCATCGTGCG CGGC
|
Protein sequence | VENAKVLMVG AGGIGCELLK TLVLHGFRDV TAIDLDTIDV SNLNRQFLFR RRHVGMAKSE VARESVLKFR PEAKISALRA NVKEARFDKE YFKGFDVVLN GLDNLEARRH VNRLCLAAEV PLVESGTTGY KGQVTVHARK QCACFECTEK PTPKSYPICT LRDTPDKPIH CIVYAKELLF SKLFGDASVQ SDLDEEDAVE AGAFRRNEGE SGVDFAKRVF AYVFGSKIEG LLLKDDMWKT RSRPKPLKSA DVGLDCEFVE TDSSASSARR AHGLMDPHVV WSPTECAKVF VSATARLVER ERPIEFDKDD DDAVEFVTAV SNLRSVNYGI PPQSVFDAKG MAGNIIHAVA TTNAIVSGLI VIEAIKILHK RMDQTRYTFV LEHASNGRLL QPMSKDDPNP KCAVCGNARV ELVCDTTKFT KGDLVKRVLK GKFSVNEPTV QFGGNLLHET GEDLDEDEVE HYASLDPRTL DKLPGGGVVN GTILLIEDYS QDFKFELMVT HREDWDDEKE PDGFIVRG
|
| |