Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24513 |
Symbol | |
ID | 5002044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 148338 |
End bp | 150816 |
Gene Length | 2479 bp |
Protein Length | 820 aa |
Translation table | |
GC content | 56% |
IMG OID | 640417465 |
Product | predicted protein |
Protein accession | XP_001417930 |
Protein GI | 145346921 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0270] Site-specific DNA methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.558259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAGTC TTCGCGTGGC GTCCCTATTC TCCGGAGTCG GAGGCCTCGA TCTCGGTCTC CAGCAAGCCG GCCACCGCAT AGAGCTCATG GTAGAGCGCG ACGCCCACTG CAAGCAAGTC CTCTCGGCGC GCTTTCCGGG CGTCGCGCTG CTCAATGACG TGGCGGAGGT GCTCCCGTTC ATGCTCGAAA ACATCGACTG CGTCGTCGCT GGGTTTCCGT GCAACGACTG CAGCTGCGAG AATCTCAAGC GACCCGGACT CGAGCTCGGC GGCGCCACGC GTTCCGTCTC GCACGTGTTT CGCTTGCTCG AGGCGAGACG GGTGCCGTGG CTGTTGCTCG AGAACGTCGT CGGGTTGTTG AAGTGGCACA GCGACGGCGA ACAGAGACCG GCGATCGATT ACGTAGTCAA TGAATTGGAA AATCTCGGAT ACAGATGGGC GTATCGCGTC GTCGATCTTT TGTCCTTTGG GACGCCTCAT AAGCGACGAC GCGTTTTCGT CGTCGCATCT TTGCACGGTG ATCCCCGAGA CGTGTTATTG TCGCAGAGCG CGATGTGTTC GGGAGAGTGC GTCCAGCTGG GAATGAACAA CGAGTGTTAC GAGTGCTTCA TCACTCCACC GCGAGTACCG ACAAAGATGT TTTCGGCGTC GATAGATCTC GGGGAGAAGC GTCGAGCGCC GTGTTGCGAT ATCATGCACT GCTTCACGAC GAGCAACGGA CGTCGTACAT GTGTCGCGAC CCAAATCGGA AAGCAAAAGG CTGAGTTGTC AATGTTGGCG ATAGAAGACG CCGAACGATT GATGGGATTT CCTCCGGGTT ATACGGAACC CTGCTATCCG CTCATGCGTC CCAACGAACG AGCGCCGGTG TTCGACACGG ATTTACAAAC GATGAAAAGA TTTAGTCTCT TAGGTCTGGC GTGCAGCGTC CCACAAAGCC GGTGGCTTGG CGAGCAGTTG AAATGCCCGT ACAACGTGAA ATTTACCTAC GATGCGCTGG CGACGCCATT CGAAAAGCCG TGTCCGGGAC CAGCGACGCG CGATCGATCT TCAAAGGCAT GGCCGCTCGC CGCTTACAAC ATGCTCAACG TGAACGGCGA TCCGAAATGG ACAGGTCGAC AACGCGCGCC GAACGAAGTT TCAGAGTTTC CACTCATTCG CGGTTTCACA CCGCTCGGCG ATTTTCTTGA ATTCACAAAG AACAAACCCG TGCGGTACGA ACTTCGAGAA GGCTACTTGC GGAGACTCGA GCTCGCGCAC GAGAACATTG ACTCGACGAT TCTCGCGGCG TTAGACGTCA AGCGTGACTC GACACAAGTG ACCCTGCCGT CGCCATCAAA GAAGTCTAAG ATTGACATTT TAGAAGACAT TGACGCCGAA GACGACGAAC AAGCTGCCAA TGAGTACGAC GACGATTCGG ATGAAGACGA GGGGGAGAGC GAAGAAACAG AGGCTGCAAA GGAAAAGAGA CATCGAGACG AAAACAACGA AAACGACTTA GACGAACATG GCATGACGAC ACACGGCGAG TGCTCGTGGG TGAAATGGAA AGGAGCTGTG CGAGGAAAGA CCATGTACTG GCCATGCGTC GCGCTCCATC CGCTTCGGGA TCACGCCGTT ATCCCTGAGG GCGCACGTGT TGCAGCTTTC AGCCCCAAGT TCACCGAGGA TCATCGTTTA GTGATCTTTT TCGACGATCG CAGATCCTTC GCATGGGTTA AAGCGTCGGA AGTTTATCCG TTCGACAAAT TTTACGGCGA AGCGATGAAA CAACCTGTGT TTTCTTCAAA GGCGAAATTT ACGCAGTCAG TCGAGTCGGC ACGGGCGTGG TGCAATGCGA GGAATTTAGA GGCTCCCGTC AACCCAATCG TGCAGCGTAA CAACGCACAT CTCTTCACCG ATCCATCGCC ATGTTACGAA TGTGACGTGT GCGTCACCGA AGCGCACAAG GCTATGGCCG ATAGCAAACC GCAGACGAGA TCGCGGCGTT CGTCCGGCAC GGAGTCGGAG CTCGCGGCTG GTCGTTCTAA GATGAAATCA AAGTGCGCAC AGATGAAGAT CATAGAACTC GCGAGACACG GTCAAATCGG CGCCACACTA GCGCTTCGCA AGGACAAAGC CGTAGGCCAG AGGATAGTGG TGCTGTGGCA GCGGGACAAC GCGTTTTACT CTGGCACGAT TACCGCCTTC GATCCGCACA CGTATTCGTT TCGCGTCGAT TACGATGACG GCGACGTCGA CTTGAACTTC AAGCCGTGGA CTGAATCCGT CATGGTAGCG CAATACGTCC CGTCAAACGT CGACTCGGAT ATCGCTTTGG CAAAGAAGGC CAACGCCGCG TCCGCGCTCA AGGCGAAAAT CATCGTTCAC ACCGCCGCCG ACGCGGCGCT CGATGAAAAT CCTCTCTGCA CGAAAACCAT GCGACGCGAT GACGCCGGCG TCGCCATCCA GCTCAAACTC TAGTTTCCTT CTCGACGAT
|
Protein sequence | MVSLRVASLF SGVGGLDLGL QQAGHRIELM VERDAHCKQV LSARFPGVAL LNDVAEVLPF MLENIDCVVA GFPCNDCSCE NLKRPGLELG GATRSVSHVF RLLEARRVPW LLLENVVGLL KWHSDGEQRP AIDYVVNELE NLGYRWAYRV VDLLSFGTPH KRRRVFVVAS LHGDPRDVLL SQSAMCSGEC VQLGMNNECY ECFITPPRVP TKMFSASIDL GEKRRAPCCD IMHCFTTSNG RRTCVATQIG KQKAELSMLA IEDAERLMGF PPGYTEPCYP LMRPNERAPV FDTDLQTMKR FSLLGLACSV PQSRWLGEQL KCPYNVKFTY DALATPFEKP CPGPATRDRS SKAWPLAAYN MLNVNGDPKW TGRQRAPNEV SEFPLIRGFT PLGDFLEFTK NKPVRYELRE GYLRRLELAH ENIDSTILAA LDVKRDSTQV TLPSPSKKSK IDILEDIDAE DDEQAANEYD DDSDEDEGES EETEAAKEKR HRDENNENDL DEHGMTTHGE CSWVKWKGAV RGKTMYWPCV ALHPLRDHAV IPEGARVAAF SPKFTEDHRL VIFFDDRRSF AWVKASEVYP FDKFYGEAMK QPVFSSKAKF TQSVESARAW CNARNLEAPV NPIVQRNNAH LFTDPSPCYE CDVCVTEAHK AMADSKPQTR SRRSSGTESE LAAGRSKMKS KCAQMKIIEL ARHGQIGATL ALRKDKAVGQ RIVVLWQRDN AFYSGTITAF DPHTYSFRVD YDDGDVDLNF KPWTESVMVA QYVPSNVDSD IALAKKANAA SALKAKIIVH TAADAALDEN PLCTKTMRRD DAGVAIQLKL
|
| |