Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_40637 |
Symbol | |
ID | 5005782 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | - |
Start bp | 456404 |
End bp | 458029 |
Gene Length | 1626 bp |
Protein Length | 510 aa |
Translation table | |
GC content | 63% |
IMG OID | 640421203 |
Product | predicted protein |
Protein accession | XP_001421803 |
Protein GI | 145355088 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAAGA GGACGACTCG GTCCGCGACG GTTCGACGAC ACCGAAGCGC GGTGCGACTC GAAGCGACGT CCGAGGCGGC GACTGCGGAG ACGAACGCGC CGCCGGCGGC GGCGGGGGAG TCGTTCGATT GGTCCAGCGC GTGGTATCCC CTTCGTCCGG TGAGCTTTTT GGATGCCAAC GAACCGAACG AGCTGCGCGT GCTCGGTAAA AAGCTCGTCG CCTACCTCGA CCCGACGTGC AAGGAGTGGC GAGTGTTGGA GGATAGCTGT CCGCACAGAC GTGCGCCTTT GAGCCTGGGA TACGTTCAAA AAGACGGCAC GCTGGCGTGC CGGTACCACG GCTGGGCGTT TGACGGCAAG GACGGCTCGT GCGTGTCCAT CCCGATGTCC GTCGATGAAG CCGCGGAAAA GACCGCGTGC GCATCGCCGC GATCTTGCGC TACGAGCTAT CCGAGCCGCG TCGAAGACGG AATTCTCTGG GTGTGGCCGA CGGCGGGCGC GGATGGCCTT CTCGCCAGCG CGGGCGCGCC GTGCGCGACG TCTCTGGCGC GAGAAGGCAC GCTTCCCGGG GAGTGGGGCA TGGTGGAGCT TCCCGTGGGT TACGCCCCGG CGCTCGAGAA TCAGTTTGAC CCGTCTCACG CGGAGTGGTT GCACGCGAGG TACGACGCCG AAGGACAGCT CGACGAGCGC GCGAACGCGG GTTTCGTGGC CATGACTGAG TTCAGCGTTC GCGAGGGGAC GATGCAAAAG GATGGATTCG TGGTCGAGCA CGGTGGATAC AACAAGTCGA ACGTCGGCGT ATCGGCATCG CGCGTGTTCA CCGCGCCGTG CTCGAGTCGA AGCGAATACT TGGACGCCAA GGGTAAAAAG TACCTCTCGG CGGCGATTCT CTACGCGCCG ACGGAGCCCG GACGAACGCT CATGTTTACA AAGTTCCAAG CCCACCAGGC GAGCGCGGTG CAGGGCGCGG GCGCGCGTAA GGTTTCACCC GCCGATCGCA TCAACTCCCT TGTCACCGCT CCCGCGACGT CGCTGTTTGA CTTTTACGTC GACAACTTTA CGAGCGACCC AAAGCTCGTG CGCGTAGGGC TGTCGCACGG CACGCCGCCG GGATCGAGCG CGTACAACTT GGGGGACCAG GATATCTTAG CCATGCACGG AGTTGAGGTC GAGATGGAGC TTCAAAACAA ACCGTGGAAA CAATCGTATT ATTTGCCGAC GCCCGCCGAC GCTGGAGTGT CGGCGTTTAG AAATTGGATG GACAAGCACG CCGGAGGCAA AGTCGCGTGG GCGCCGGGCG TCGTCGACGA CGCGTCGAAG GTGAAATCCG AAGCCGAACA GCTGGATCGG TACCATCGTC ACACCAAGCA CTGCGTGGCG TGTAAGACGG CGCTCAGCGA ACTCGGCGTG CTCGAAGAGC GATGCGTCGC CGCGAGCAAG TACTTGCTCG CCGGCGGGTT GTTCCTCGCC GTCACGGGTG CAGCGTTCGA TCAAGAAGCG CCAGCCATCA TCGCCACCTG TCTCGCCGGC GCCTCTCTCG TCGGCGCCGA AAAGGTTCGC GACATGCAGC ACGAATTCCT GTCGAGCGTG CCTCGAAGAG GCGTGCCGAA ACCGAAACTT TGGTGA
|
Protein sequence | MDKRTTRSAT VRRHRSAVRL EATSEAATAE TNAPPAAAGE SFDWSSAWYP LRPVSFLDAN EPNELRVLGK KLVAYLDPTC KEWRVLEDSC PHRRAPLSLG YVQKDGTLAC RYHGWAFDGK DGSCVSIPMS VDEAAEKTAC ASPRSCATSY PSRVEDGILW VWPTAGADGL LASAGAPCAT SLAREGTLPG EWGMVELPVG YAPALENQFD PSHAEWLHAR YDAEGQLDER ANAGFVAMTE FSVREGTMQK DGFVVEHGGY NKSNVGVSAS RVFTAPCSSR SEYLDAKGKK YLSAAILYAP TEPGRTLMFT KFQAHQASAV QGAGARKLVR VGLSHGTPPG SSAYNLGDQD ILAMHGVEVE MELQNKPWKQ SYYLPTPADA GVSAFRNWMD KHAGGKVAWA PGVVDDASKV KSEAEQLDRY HRHTKHCVAC KTALSELGVL EERCVAASKY LLAGGLFLAV TGAAFDQEAP AIIATCLAGA SLVGAEKVRD MQHEFLSSVP RRGVPKPKLW
|
| |