Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31489 |
Symbol | |
ID | 5002064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 42020 |
End bp | 43759 |
Gene Length | 1740 bp |
Protein Length | 562 aa |
Translation table | |
GC content | 58% |
IMG OID | 640417485 |
Product | predicted protein |
Protein accession | XP_001417651 |
Protein GI | 145346348 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0415] Deoxyribodipyrimidine photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0512006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGC CGAGCGTGAG CAACGGCACG ACGCGTTCGC TGATTTGGTT TCGCAAAGCG CTGCGCGTCC ACGATAATCC CGCTCTCGCC GCGGGCATCG CGCGCGCGAA GAGCGCGCAA CCGGTGTTCG TGCTCGATCC GTGGTTCTGT AAGCCCTCGC GGGTGGGGGC GAATCGCATG CGGTTCTTGT TGCAATCGCT GCGCGACTTG GACGCGACGC TGCGAGAACG CGGCAGTTCG CTGCTGGTGC TGCACGGCGA ACCGCGAGTG GTGTTGCCGC GAGCGTGCAA GACGTGGAAG GTTGATTTGG TGACGTGGGA ACACGATATC GAACCGTATG CGAAAATGAG AGACACCGCG GTGCGAGGCG CGCTGGAGCG CGCGGGAGTA GAGTGCGCGT CGTCGAGCGG ACACACGCTG TACGACGTGG AGGAGATGTT GGCGAAGTGT CATGGGAAAC CACCGACGAC GTATTCGCAG TTTTTGAAGA TTGTAGATAA AATGGGTGCA CCTGCGGCCG CTTTGGATGC GCCGAAGGCG ATGCCGGCGC CGTTCACGGG GACAGCAGAG GAGACGAAGG AATTAGTCGC GGGAGTGGCG GATGCGTACG GGATCCCCAC TTTGGAGGAG CTCGGTTACG AAGCCATGCA TGACGACGAA GGTTTTCAAG CGATCGGAGG TGAAACCGAA GGCTTGAGAC GTCTACGCAG ACAACTGTCG CGCACGGAAT GGGTGCACAC GTTTCAAAAG CCAGATACAA ATCCCACGAC GCTTTTTCAT GCCTTGGGCG CGAAAAAGCC AAAGCCAAAG AGTCCGTTCG AAATCGCAGC GCGAGACGCG GGAAGCAAAA ATGACGCAAC AAACACTGCA GCGAATAGTG ATATGCTGAT GACACCGTCC ACGACAGCGC TGAGTCCGTA CATGAAATTC GGTTGCGTGT CGCCGCGAGT GTTTTATCAC GAGCTCAACG CCGTGCTCGC AAAGTTTGAG GGCAAAGGTC AACCTTCGCA ACCGCCGGTG AGTCTCATGG GACAACTGAT GTGGCGCGAA TTCTACTATC TCGTCGGCGC GGGGACGCCG AATTTCGATA AGATGGAAGG AAATCCAATC TGCAGGCAAA TTCCTTGGAA CAAGGATCGT GAGCTCTTCG CGGCTTGGGA GAACGCGCAA ACCGGGTTCC CTTGGATCGA CGCCGCGATG ACCCAGCTTC GCCTCGAGGG ATGGATCCAT CACTTGGCTC GACACGCCGT GGCGTGCTTC CTGACCCGTG GTGATCTATT CGTGCACTGG GAGTGGGGCA GAGACACGTT CGATCGCGAC CTCGTCGACG CCGATTGGGC GCTGAACAAC GGCAACTGGA TGTGGCTCTC GTGTTCGTGT TTCTTTTACC AATATTTCCG AGTGTACGGT CCGCACTCGT TCGCGAAAAA GTACGACAAG GACGGCGCGT ACGTCAAGCA CTACCTCCCC GTGCTGAAGA ATATGCCCGC CAAGTACGTG TACGAACCTT GGCTCGCCCC TCTCGACGTG CAGAAGAAAG CCGGTTGCGT CGTCGGCGTC GATTATCCCG CTCCGATCGT CGACCACGCC ACGGCGAGCA AGGCGTGCAT AGATAAGATC GCCACCGCTT ACGCCGCTCA CAAGGACGCC ACCGCGGTGG CTGGGAAGAA GCGTAAAGCC GGCGAATAAA CTCGTTGATT CGTTAATTTA AAAAGCCTCT CGCGCGCGGC GTCCACCACA
|
Protein sequence | MTAPSVSNGT TRSLIWFRKA LRVHDNPALA AGIARAKSAQ PVFVLDPWFC KPSRVGANRM RFLLQSLRDL DATLRERGSS LLVLHGEPRV VLPRACKTWK VDLVTWEHDI EPYAKMRDTA VRGALERAGV ECASSSGHTL YDVEEMLAKC HGKPPTTYSQ FLKIVDKMGA PAAALDAPKA MPAPFTGTAE ETKELVAGVA DAYGIPTLEE LGYEAMHDDE GFQAIGGETE GLRRLRRQLS RTEWVHTFQK PDTNPTTLFH ALGAKKPKPK SPFEIAARDA GSKNDATNTA ANSDMLMTPS TTALSPYMKF GCVSPRVFYH ELNAVLAKFE GKGQPSQPPV SLMGQLMWRE FYYLVGAGTP NFDKMEGNPI CRQIPWNKDR ELFAAWENAQ TGFPWIDAAM TQLRLEGWIH HLARHAVACF LTRGDLFVHW EWGRDTFDRD LVDADWALNN GNWMWLSCSC FFYQYFRVYG PHSFAKKYDK DGAYVKHYLP VLKNMPAKYV YEPWLAPLDV QKKAGCVVGV DYPAPIVDHA TASKACIDKI ATAYAAHKDA TAVAGKKRKA GE
|
| |