Gene OSTLU_41733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41733 
Symbol 
ID5004960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp277493 
End bp278926 
Gene Length1434 bp 
Protein Length477 aa 
Translation table 
GC content60% 
IMG OID640420381 
Productpredicted protein 
Protein accessionXP_001420955 
Protein GI145353298 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR00591] photolyase PhrII 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.908244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.188048 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACG CGGCGAAGGC GGCGTTGGAT GCGTCGGCGC GCGCGCGCGC GAGCGCGAGC 
GCGAGCGCGA GCGCGGGGCC GGTGGTGTAC TGGTGCGACC GGGACAGGCG GTGCGCGAAT
AACGACGCGC TGGGACGAGC GATGGAATTG GCGAACGAAA GGCGCGTCCC GCTCGTCGTG
GCGATGCACG TGGGGACAGA TTTGAGCGGG AGCGGCATCG GAGGCGCGCG CAGGGCGGTG
TTCGCGCTGA AGGGGTTGAA GGAATTGGAT GAGGATTTGC GAGCGCGAGG AGTGTCGACG
CGAACGACGA CGGGAAGCGA CGTCGCGGGA GGAATCGTGG AGACGTGCGA GACGCTGAAT
GCGAGTGCGG TCGTGTGTGA CTTTTCGCCG TTGCGAGAGG GGCGTGCGGC GAGGGAAGCG
GTGGCGCGTG TGGTGGAGGT TCCGGTGATT GAAGTGGACG CGCACAACGT CGTGCCGGCG
TGGGTGACGA GCGATAAGCA AGAGTACGCG GCGAGAACGA TTCGGCCGAA GATTCATCGA
AATCTCGGGG ATTTTCTCAC CGCACCGCAA GCGTTAGATG ATCTCATCGC CGCGCCGGAC
GCGTTGACGC CAAGTGAGAC GGATTGGGAC GCATTGATTG ACACCGCGCG CGTCAAGGGC
GCGCACGTCC CAGAGGTTGA CTGGATCAAA CCGGGTGAAC GTGCCGCCTT AGCCGCGCTG
CTCGATCCGA ATGTCGACTC TTTCCTCCCA CAGCGATTGA CACTCTACGG GGAGCGAAAC
AAGCCGACGT CGCCGCGCGC CGTGTCTCGC CTCTCGCCGT ACTTGAATCA CGGCCAGCTG
TCGCCACGTC GCGCCGCGTG GGAAGCTGCG CAACTTCGGG GAATCGTAGA CGACGAGGCG
ATCGATAGCT ACTTGGAAGA GCTCATCGTT CGAAGGGAAT TATCAGACAA CTATTGTCTC
TTCAATCCGT ATTACGACTC GTTGCAAGGA GCGAGTCAAT GGGCGCAAGA TTCACTGAGT
TTGCACGCCC GCGACGTTCG CGAGTACGTG TACGATTACA AAACACTCGA GCGTGGCAAC
ACGCACGACG AGCTTTGGAA CGCGGCTCAG AAAGAATTAT ACCATCTCGG ACGAATGCAT
GGGTTCATGA GAATGTACTG GGCGAAGAAG ATTCTTGAGT GGACGCCGTC GCCGGAGGTG
GCCCTGCAGA CGGCGATTCA ACTCAACGAC GCTTACGCGT TAGACGGTCT CGATCCCAAC
GGCTACGTTG GTTGTATGTG GAGCATTGCC GGTGTGCACG ATCAAGGATG GAAAGAGCGC
GCGGTGTTCG GTAAAGTGCG GTATATGAAT TACGCCGGTT GCAAGAGAAA GTTTCAAATC
CAAGATTACG TAGCGGCGGT CGACGCTGAG ATAAGCGGAA TAGGTCGCAA ATAG
 
Protein sequence
MNDAAKAALD ASARARASAS ASASAGPVVY WCDRDRRCAN NDALGRAMEL ANERRVPLVV 
AMHVGTDLSG SGIGGARRAV FALKGLKELD EDLRARGVST RTTTGSDVAG GIVETCETLN
ASAVVCDFSP LREGRAAREA VARVVEVPVI EVDAHNVVPA WVTSDKQEYA ARTIRPKIHR
NLGDFLTAPQ ALDDLIAAPD ALTPSETDWD ALIDTARVKG AHVPEVDWIK PGERAALAAL
LDPNVDSFLP QRLTLYGERN KPTSPRAVSR LSPYLNHGQL SPRRAAWEAA QLRGIVDDEA
IDSYLEELIV RRELSDNYCL FNPYYDSLQG ASQWAQDSLS LHARDVREYV YDYKTLERGN
THDELWNAAQ KELYHLGRMH GFMRMYWAKK ILEWTPSPEV ALQTAIQLND AYALDGLDPN
GYVGCMWSIA GVHDQGWKER AVFGKVRYMN YAGCKRKFQI QDYVAAVDAE ISGIGRK