Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119486 |
Symbol | SmkH |
ID | 5000128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 423777 |
End bp | 425633 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | |
GC content | 43% |
IMG OID | 640415549 |
Product | Protein Lysyl hydroxylase fusion protein, putative |
Protein accession | XP_001416384 |
Protein GI | 145343554 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0222202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAG GTGACTTTCA CAAAGTTGAC TCGTCTACGA ATTTTCTCCT TCTACAAAAA AACAGTTCGC GGCAAACTTG CGGAAACTGT TTGCGTCTTT TCGGTAACTG CGACGATGAT AAGGCAAGTT TCATTAGGTG TGACTGTGGA GATATCTTCT GTAGACGAGA TTGTGCACCG CATGCGATGC GGGAATTAGG CTGGCATTCA ATTTTATGCT CAAATAGAAG TGAACCAATG TTGGAATTCT ACGAGTATTG CCGAAGGTCG AGCGAAAGTT TCACTTTTGC TGCTAAGGTG CTTGCTCAAC TTTTATCAGC AATGTCAGAG TGTGAAACGA GCGTATCAAA AATACAATTG AGTTGGTCGG AGCAATTCGA TCCGCGTCAT AAGTGGTCCG AACTTGCTGC ACGCGCCAAC TGGGAGTGTA CGGACTTCTA TGAGTTTGTG CGACGAGCGT GTATTTTAGA ACATCAAATA GCAGATGCAT GGGACTTATT GAAAGCACAT TTGAATAATA GATTTGATCT AGAGACAAAA ACACGATTTG CCGAGCTTTT AGACTTTGAG ATGTACGCAG ATATAGTGGC TTCCATTCAG CGGTTTGCTG TAGCTATTGG TACAGAGAAG ACAACAATTC CACCGACACG CCAATCTCCT TCCCAAAATG GTGATACTCA TCCTGCATTA GCTCATTACG GCTCAAAGTT TGATATTCAT CCGAAAGTTT CATACCATTG TGCAATTATG AATAAATTGC CAGATTACTG CGGTTTTTTG TTGGCACCAA TTCTGACCAA ATTCCAACAC AGCTGCAATC CAAGTGTGCA AGTTGAAGCT AATTCAACCC TTTCACAGGT GAAATATGGA TATATAGCGC TCAGAGAAAT TGATCAGATG GAGCCGATCT CTTTGACAAC TATACCTCTT CATTCTAGTG TAAGTGCCAG AGATGCCTCA CTGTTGAGGC AGCTTGGTAA AGTTTGCGTT TGCTCGCGTT GTTCGTGGGA ACGAGAAGAT TTTGGTAGAG TTTCGCCCGA ACAAATGAAG CAATTGGCTC TTCAAGCGCA GGAAGAGGGG CGATACTCAG AGGCAGAGCT CTTATTTCAC TGCACACTTC TTCACAATCC ACACGACACA GATGTGCTTC ATGCGTATGG TGTGACGCTC CTTAGTCAAG GCAAATGGAA ACTTGCCCAC AGTATCTGGC AACTCGCTTT TAAGGTTGAC AAAGCTCATC TATGGCTATC GAAACAATTC AACAAAGATG TTTCATACGC AGCGTATTTA TGTAGAAAAT GTCCCGAATA CTATGTGTCT TTTCAGACTA TTTCAATTGA TGAAATTTAT TTGACAACAA ATAGTGCAAT ATCACCATCA GCTTGTTCTA GCTGGATTAA AACCGCCGAA GCTCATGCAA CAAATAGAGG AGGCTGGGAC ACAGATCGTC ACAAATCTGT AGCCACAACT GATTTGCCTA TACACGAAAT TCCCTCTGTC TTGCGAGAGT GGAATTTGAT CTTCGGGCAA ATCATAGGTC CCTTCATTCA AGAACGTTTC AGAGTCGATG GTGACACGAA TCTTAGGGTT CACGATGCAT TCATAGTAAA ATACGATGCC AGTGATGGAC AGTGTCAGCT ACCAGTACAC ACCGATCAAG GTCACTTCTC CATAACTCTT TCTCTAAATG ATCCTATACA ATACAAAGGT GGCGGTACGA TTTTCCCGGA GCATGAGTTC ATTGTTCGAC CAAAGTGTGG AGATTTCGTC GCTTTCAGAA GTTATCTGAC GCACGGTGGC GTGCCAATCA CATCTGGAGT CAGGTACATA GTCGTGGCTT TTCTCTACTT GAGTTGA
|
Protein sequence | MSKGDFHKVD SSTNFLLLQK NSSRQTCGNC LRLFGNCDDD KASFIRCDCG DIFCRRDCAP HAMRELGWHS ILCSNRSEPM LEFYEYCRRS SESFTFAAKV LAQLLSAMSE CETSVSKIQL SWSEQFDPRH KWSELAARAN WECTDFYEFV RRACILEHQI ADAWDLLKAH LNNRFDLETK TRFAELLDFE MYADIVASIQ RFAVAIGTEK TTIPPTRQSP SQNGDTHPAL AHYGSKFDIH PKVSYHCAIM NKLPDYCGFL LAPILTKFQH SCNPSVQVEA NSTLSQVKYG YIALREIDQM EPISLTTIPL HSSVSARDAS LLRQLGKVCV CSRCSWERED FGRVSPEQMK QLALQAQEEG RYSEAELLFH CTLLHNPHDT DVLHAYGVTL LSQGKWKLAH SIWQLAFKVD KAHLWLSKQF NKDVSYAAYL CRKCPEYYVS FQTISIDEIY LTTNSAISPS ACSSWIKTAE AHATNRGGWD TDRHKSVATT DLPIHEIPSV LREWNLIFGQ IIGPFIQERF RVDGDTNLRV HDAFIVKYDA SDGQCQLPVH TDQGHFSITL SLNDPIQYKG GGTIFPEHEF IVRPKCGDFV AFRSYLTHGG VPITSGVRYI VVAFLYLS
|
| |