Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43309 |
Symbol | |
ID | 5005382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 58496 |
End bp | 59974 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | |
GC content | 59% |
IMG OID | 640420803 |
Product | predicted protein |
Protein accession | XP_001421195 |
Protein GI | 145353812 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.231154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00124797 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCGCG CGGGCGCGGC GACGCGGCGA CGCGCGCTGC AGACGACGGA GGACACGCCC GTGAGCGCGG AGGCGAGGGC GGAGGCGTAC GATTGGCGCG CGCACTGGTA CCCGGCGGCG TACGTCGCCG ACGTGGAGAA GGACGCGCCG CTGACGTTTA CGCTCCTGGG CGAACCGCTG GTGTTCTGGC GGGACAAGAG CGGCGAGATG CGCTGCGTCG CGGATAGGTG CCCGCACAGG TTGGTACCGC TGAGCGAAGG ACGCGTCAAC GAGACGGGTG AGCTCGAGTG CGGGTATCAC GGGTGGACGT TTACGGGAGA GGGTAAGTGC ACGTCCATTC CGCAAATCGA GCAAGGGACG GGTTTGGAGA CGGCTTTGAA GTCTCCGAGA TCGTGCGTCG CGGCGTACCC GACGAAAGAG GCGCAAGGCA TGCTTTGGGT GTATCCGACC TCGATGGATA AAGCGCCGGC GACGCTGCCA GATTTGCCGC TAATCCCAGA GTACGACGAC CCGGAGTGCG TGTGTCAAGA CATCTTTCGG GATCTTCCCA TGGATTGGGC GACTTTGCTT GAAAACGTCA TGGATGTTAG TCATGTGCCG TTTACGCATC ACAACAGCGT TGGTAAGCGA GAAAATGCGA CGCCAGTGAA TTTGGAATTG GCGAGCGCCG CGGGCGTCAC GGCGAACGGA TTCGAAGGGA TATGGAAGGA AGGTCCGAGG AAAGGTAAAT ATGGGTCTCA ATACACCGAG TTCAAAGCAC CGACGTTAAT GCGCCACACG CTCAAGACGG AGGCTTTCAC GACGCTCACC GTCGTGTACG CGGTGCCGAC GACACCCGGC CGATGCCGAC TTATGGCGCG ATTTCCATTC ATCTTCAAAT CGGCGTTGCC GCGATTCTTT TTCGGTCTTT ACCCGCAATG GTTCTCGCAC ACGAATCAAA ATGCAATTTT AGAGGATGAC CAAATCTTCT TGCACAAGCA AGAGCGATTG ATCGAGGTTG AGCAAAAAGA AGGCAAGTCA TACGCGCAGT CGTGCTACAT GCCCACCAAG GCAGACGTCT ACGTTTCGGC GTTCCGCAAG TGGATTGTCG ACGTCGCCGG CGGCGGTCCA GCGTGGCCGA AGGATATGCC CACTGATTTG CCACCGCAAG AAACCACGCG CGAGGCTTTG CTCGATCGCT ACCATTCGCA CACGATAAAC TGCAAGTCGT GCGCCAGCGC TTTGGCGAAA ATCGGCAAGG CGCGAAAGGC GCTTCGCGTG CTCACCTTTG TCGCTCTCGC CGCTGCCGTG GCGACTTTTG CGCGAGCAGT ACCGTTGAAG TACACGATCG CCTTGTCGGT ACTATCCGCC GCGTGCGCTT TGGTTCGCGA GAAACTCGGC GCGTTTGCCG CGAAGATGAA AATCGGGCCG TATCCGCCAC CGCGCAGACC GCCATCCATG ATGGAGGCAG CGTTGCAACA AGCGCGAATC GCGTTTTAA
|
Protein sequence | MTRAGAATRR RALQTTEDTP VSAEARAEAY DWRAHWYPAA YVADVEKDAP LTFTLLGEPL VFWRDKSGEM RCVADRCPHR LVPLSEGRVN ETGELECGYH GWTFTGEGKC TSIPQIEQGT GLETALKSPR SCVAAYPTKE AQGMLWVYPT SMDKAPATLP DLPLIPEYDD PECVCQDIFR DLPMDWATLL ENVMDVSHVP FTHHNSVGKR ENATPVNLEL ASAAGVTANG FEGIWKEGPR KGKYGSQYTE FKAPTLMRHT LKTEAFTTLT VVYAVPTTPG RCRLMARFPF IFKSALPRFF FGLYPQWFSH TNQNAILEDD QIFLHKQERL IEVEQKEGKS YAQSCYMPTK ADVYVSAFRK WIVDVAGGGP AWPKDMPTDL PPQETTREAL LDRYHSHTIN CKSCASALAK IGKARKALRV LTFVALAAAV ATFARAVPLK YTIALSVLSA ACALVREKLG AFAAKMKIGP YPPPRRPPSM MEAALQQARI AF
|
| |