Gene OSTLU_36787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_36787 
Symbol 
ID5006909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp249543 
End bp250976 
Gene Length1434 bp 
Protein Length477 aa 
Translation table 
GC content60% 
IMG OID640422330 
Productpredicted protein 
Protein accessionXP_001422939 
Protein GI145357465 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR00591] photolyase PhrII 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.00684152 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00249943 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGACG CGGCGAAGGC GGCGTTGGAT GCGTCGGCGC GCGCGCGCGC GAGCGCGAGC 
GCGAGCGCGA GCGCGGGGCC GGTGGTGTAC TGGTGCGACC GGGACAGGCG GTGCGCGAAT
AACGACGCGC TGGGACGAGC GATGGAATTG GCGAACGAAA GGCGCGTCCC GCTCGTCGTG
GCGATGCACG TGGGGACAGA TTTGAGCGGG AGCGGCATCG GAGGCGCGCG CAGGGCGGTG
TTCGCGCTGA AGGGGTTGAA GGAATTGGAT GAGGATTTGC GAGCGCGAGG AGTGTCGACG
CGAACGACGA CGGGAAGCGA CGTCGCGGGA GGAATCGTGG AGACGTGCGA GACGCTGAAT
GCGAGTGCGG TCGTGTGTGA CTTTTCGCCG TTGCGAGAGG GGCGTGCGGC GAGGGAAGCG
GTGGCGCGTG TGGTGGAGGT TCCGGTGATT GAAGTGGACG CGCACAACGT CGTGCCGGCG
TGGGTGACGA GCGATAAGCA AGAGTACGCG GCGAGAACGA TTCGGCCGAA GATTCATCGA
AATCTCGGGG ATTTTCTCAC CGCACCGCAA GCGTTAGATG ATCTCATCGC CGCGCCGGAC
GCGTTGACGC CAAGTGAGAC GGATTGGGAC GCATTGATTG ACACCGCGCG CGTCAAGGGC
GCGCACGTCC CAGAGGTTGA CTGGATCAAA CCGGGTGAAC GTGCCGCCTT AGCCGCGCTG
CTCGATCCGA ATGTCGACTC TTTCCTCCCA CAGCGATTGA CACTCTACGG GGAGCGAAAC
AAGCCGACGT CGCCGCGCGC CGTGTCTCGC CTCTCGCCGT ACTTGAATCA CGGCCAGCTG
TCGCCACGTC GCGCCGCGTG GGAAGCTGCG CAACTTCGGG GAATCGTAGA CGACGAGGCG
ATCGATAGCT ACTTGGAAGA GCTCATCGTT CGAAGGGAAT TATCAGACAA CTATTGTCTC
TTCAATCCGT ATTACGACTC GTTGCAAGGA GCGAGTCAAT GGGCGCAAGA TTCACTGAGT
TTGCACGCCC GCGACGTTCG CGAGTACGTG TACGATTACA AAACACTCGA GCGTGGCAAC
ACGCACGACG AGCTTTGGAA CGCGGCTCAG AAAGAATTAT ACCATCTCGG ACGAATGCAT
GGGTTCATGA GAATGTACTG GGCGAAGAAG ATTCTTGAGT GGACGCCGTC GCCGGAGGTG
GCCCTGCAGA CGGCGATTCA ACTCAACGAC GCTTACGCGT TAGACGGTCT CGATCCCAAC
GGCTACGTTG GTTGTATGTG GAGCATTGCC GGTGTGCACG ATCAAGGATG GAAAGAGCGC
GCGGTGTTCG GTAAAGTGCG GTATATGAAT TACGCCGGTT GCAAGAGAAA GTTTCAAATC
CAAGATTACG TAGCGGCGGT CGACGCTGAG ATAAGCGGAA TAGGTCGCAA ATAG
 
Protein sequence
MNDAAKAALD ASARARASAS ASASAGPVVY WCDRDRRCAN NDALGRAMEL ANERRVPLVV 
AMHVGTDLSG SGIGGARRAV FALKGLKELD EDLRARGVST RTTTGSDVAG GIVETCETLN
ASAVVCDFSP LREGRAAREA VARVVEVPVI EVDAHNVVPA WVTSDKQEYA ARTIRPKIHR
NLGDFLTAPQ ALDDLIAAPD ALTPSETDWD ALIDTARVKG AHVPEVDWIK PGERAALAAL
LDPNVDSFLP QRLTLYGERN KPTSPRAVSR LSPYLNHGQL SPRRAAWEAA QLRGIVDDEA
IDSYLEELIV RRELSDNYCL FNPYYDSLQG ASQWAQDSLS LHARDVREYV YDYKTLERGN
THDELWNAAQ KELYHLGRMH GFMRMYWAKK ILEWTPSPEV ALQTAIQLND AYALDGLDPN
GYVGCMWSIA GVHDQGWKER AVFGKVRYMN YAGCKRKFQI QDYVAAVDAE ISGIGRK