Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_51395 |
Symbol | |
ID | 5005304 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | - |
Start bp | 318082 |
End bp | 320007 |
Gene Length | 1926 bp |
Protein Length | 565 aa |
Translation table | |
GC content | 66% |
IMG OID | 640420725 |
Product | predicted protein |
Protein accession | XP_001421440 |
Protein GI | 145354330 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0415] Deoxyribodipyrimidine photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00141783 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGAGCGG CGTGCGGGGG AGTGGGAGCG ACGCGCGCGT TCGCGTCGAG GCGGTACGAA CCGACGGCGG ACGCGGACGA CGAAGAGACG GCGGCGAGCG TGCGGGACGC GTGCGAGATC GTGCGCTTGC CGGGGTACTT GTTGTTTGAG CCAGAGAAGA TTCAGATCGA TCTGAGGAGG GAAAAGTACT TTTTCGGCAC GCTCATGCCG TTCGTGCACG CCGCGGAGAA GCTCGGAGGG AAAGTGGGGG CGCCGCGCGC GGCGCCGGCG AGCGCGCGAG TGGTGGATTT AGGCCCCGAG CACGGATTCG CGGCGTCGTT AGAAGAATTA GGGATAGTAT CAGACGTAGA TAAACGCTTG GACTGGGCGG CTGGGATTCG CGCGGAGTGG ACGATGTCGG AGTCGGGCGC CGCCTCTACT TGGGAGCACT TCGCTTCGGT GGGGTTGGAG AAGTTTGAGG ACGATCACGG AAGAGCGGAT ATCGTCGCTC CGGCCGCGGT GAGCACGCTT TCACCGTATT TACGCCATGG TCAGATATCG CCGCGGCAGA TTTACCACGA ATTGGCGACG AAAAAGATGG GCAGCGACGG GGTGGAGGGC AAAAAGCTCT CTCGCGTGTT TTGGCACCGT CTCTATCGCC GAGAGTTTGC GTACTGGCAA CTGCACAACT GGCCCGAGTT ACCGTCCAAG TCCGTGCGTG GGCACTACGA GAATCGCAAG GCTTGGTTGG AGGGCGACGA AGCCGCCGTT GCGCTTCATC GATGGCAAAC TGGAACGACG GGCTTCCCCA CCGTGGACGC GGGCATGCGC CGGCTTTGGG CCACCGGATG GATGCATCAG AGTGAACGCA TGATTGCGGC CACGTTTCTC GTCGATTACT GCGGCGTTCA TTGGACGCAC GGCGCAGATT GGTTCCTCGA CACTCTCGTG GACGCGGACT TGGCGATCAA TTCCATGATG TGGCAAAACG CGGGAAAGAG CGGACTCGAT CAGTGGGACG TCTTCGCGGG TTCGTTGACC CCAGATGGAT CCTCTCGAGC GCACGACCCC GAAGGCGAGA GCATCGCTCG ATGGATTCCA GAGCTCGCGG CGCTGCCGAA AGGGCACCTG CGACACCGCC CGTGGGAAGC GTCGGCGAAG CAGCTCGACG CCGCGGGCGT CGAGTTGGGA TCGACGTACC CGACGCGCAT GATTACAGAT CTCGAAGGCG CGCGACGCCG CATGCTGGAC GACGTGAACG CGCTTCGCGT CGACGAAATT AACAAAGCCG CGACGAGGGC GAACGCGGCG TCCGCGGCCG ACGTCGACGC GCGTTCTTCC GACGTCTTCG TCGACGTTCG TTCGGCGAAC GACTTCGTCG TCGCGCCGCC CGGCGCGACT AAAGATCACG CCGGCGCCCT CGTTCCGGTG TCGACGCGAA AGGAGTTCAA AACCGAACTC AAATCCACCA AAGGCGCGCA GGCCGCGATG CAAAGCGCCT ACGCCTGGGC CGACCGCGCC CTCGCCGTCG CCGAAACCGC CGCCTCCGCC GACGCCGCCA AAGCCAAGGC CCAAAAACGC GCCCAGCGCG CCCACGGCCA CTCCCACGCC GCGTCCACCG ACGCGTCTCG CGTCCACGCC CACGCCCACG GCCACTCCCA CGCCCCGTCC CGCGCGCCCG CGCGCCGAAG CAGCGCCTGG CAGCGTCCGT CGCGCGCCTC CGCGCGCGCG CTCTCCGACG CCCGCGCCGC CCGCGCCGAC CGCCGCGTCG TCAAGTCCCT CCGCCGCGAG GCTCGCGAGA TCAACGCCGC GCACGCCGGC CGACGCCCCG CCGCCGACGA CGACGACGAC GACGACGCGT AGCGCGCGCC GTCCGACCGA CCGCGTCCGA CCGACCGTCC GACCGATCGC GCTCCTCTAT ATAGTAAAAC TTCGACGCAT TATATA
|
Protein sequence | MRAACGGVGA TRAFASRRYE PTADADDEET AASVRDACEI VRLPGYLLFE PEKIQIDLRR EKYFFGTLMP FVHAAEKLGG KVGAPRAAPA SARVVDLGPE HGFAASLEEL GIVSDVDKRL DWAAGIRAEW TMSESGAAST WEHFASVGLE KFEDDHGRAD IVAPAAVSTL SPYLRHGQIS PRQIYHELAT KKMGSDGVEG KKLSRVFWHR LYRREFAYWQ LHNWPELPSK SVRGHYENRK AWLEGDEAAV ALHRWQTGTT GFPTVDAGMR RLWATGWMHQ SERMIAATFL VDYCGVHWTH GADWFLDTLV DADLAINSMM WQNAGKSGLD QWDVFAGSLT PDGSSRAHDP EGESIARWIP ELAALPKGHL RHRPWEASAK QLDAAGVELG STYPTRMITD LEGARRRMLD DVNALRVDEI NKAATRANAA SAADVDARSS DVFVDVRSAN DFVVAPPGAT KDHAGALVPV STRKEFKTEL KSTKGAQAAM QSAYAWADRA LAVAETAASA DAAKAKAQKR AQRAHGHSHA ASTDASRSLR REAREINAAH AGRRPAADDD DDDDA
|
| |