Gene OSTLU_31489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31489 
Symbol 
ID5002064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp42020 
End bp43759 
Gene Length1740 bp 
Protein Length562 aa 
Translation table 
GC content58% 
IMG OID640417485 
Productpredicted protein 
Protein accessionXP_001417651 
Protein GI145346348 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0512006 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGC CGAGCGTGAG CAACGGCACG ACGCGTTCGC TGATTTGGTT TCGCAAAGCG 
CTGCGCGTCC ACGATAATCC CGCTCTCGCC GCGGGCATCG CGCGCGCGAA GAGCGCGCAA
CCGGTGTTCG TGCTCGATCC GTGGTTCTGT AAGCCCTCGC GGGTGGGGGC GAATCGCATG
CGGTTCTTGT TGCAATCGCT GCGCGACTTG GACGCGACGC TGCGAGAACG CGGCAGTTCG
CTGCTGGTGC TGCACGGCGA ACCGCGAGTG GTGTTGCCGC GAGCGTGCAA GACGTGGAAG
GTTGATTTGG TGACGTGGGA ACACGATATC GAACCGTATG CGAAAATGAG AGACACCGCG
GTGCGAGGCG CGCTGGAGCG CGCGGGAGTA GAGTGCGCGT CGTCGAGCGG ACACACGCTG
TACGACGTGG AGGAGATGTT GGCGAAGTGT CATGGGAAAC CACCGACGAC GTATTCGCAG
TTTTTGAAGA TTGTAGATAA AATGGGTGCA CCTGCGGCCG CTTTGGATGC GCCGAAGGCG
ATGCCGGCGC CGTTCACGGG GACAGCAGAG GAGACGAAGG AATTAGTCGC GGGAGTGGCG
GATGCGTACG GGATCCCCAC TTTGGAGGAG CTCGGTTACG AAGCCATGCA TGACGACGAA
GGTTTTCAAG CGATCGGAGG TGAAACCGAA GGCTTGAGAC GTCTACGCAG ACAACTGTCG
CGCACGGAAT GGGTGCACAC GTTTCAAAAG CCAGATACAA ATCCCACGAC GCTTTTTCAT
GCCTTGGGCG CGAAAAAGCC AAAGCCAAAG AGTCCGTTCG AAATCGCAGC GCGAGACGCG
GGAAGCAAAA ATGACGCAAC AAACACTGCA GCGAATAGTG ATATGCTGAT GACACCGTCC
ACGACAGCGC TGAGTCCGTA CATGAAATTC GGTTGCGTGT CGCCGCGAGT GTTTTATCAC
GAGCTCAACG CCGTGCTCGC AAAGTTTGAG GGCAAAGGTC AACCTTCGCA ACCGCCGGTG
AGTCTCATGG GACAACTGAT GTGGCGCGAA TTCTACTATC TCGTCGGCGC GGGGACGCCG
AATTTCGATA AGATGGAAGG AAATCCAATC TGCAGGCAAA TTCCTTGGAA CAAGGATCGT
GAGCTCTTCG CGGCTTGGGA GAACGCGCAA ACCGGGTTCC CTTGGATCGA CGCCGCGATG
ACCCAGCTTC GCCTCGAGGG ATGGATCCAT CACTTGGCTC GACACGCCGT GGCGTGCTTC
CTGACCCGTG GTGATCTATT CGTGCACTGG GAGTGGGGCA GAGACACGTT CGATCGCGAC
CTCGTCGACG CCGATTGGGC GCTGAACAAC GGCAACTGGA TGTGGCTCTC GTGTTCGTGT
TTCTTTTACC AATATTTCCG AGTGTACGGT CCGCACTCGT TCGCGAAAAA GTACGACAAG
GACGGCGCGT ACGTCAAGCA CTACCTCCCC GTGCTGAAGA ATATGCCCGC CAAGTACGTG
TACGAACCTT GGCTCGCCCC TCTCGACGTG CAGAAGAAAG CCGGTTGCGT CGTCGGCGTC
GATTATCCCG CTCCGATCGT CGACCACGCC ACGGCGAGCA AGGCGTGCAT AGATAAGATC
GCCACCGCTT ACGCCGCTCA CAAGGACGCC ACCGCGGTGG CTGGGAAGAA GCGTAAAGCC
GGCGAATAAA CTCGTTGATT CGTTAATTTA AAAAGCCTCT CGCGCGCGGC GTCCACCACA
 
Protein sequence
MTAPSVSNGT TRSLIWFRKA LRVHDNPALA AGIARAKSAQ PVFVLDPWFC KPSRVGANRM 
RFLLQSLRDL DATLRERGSS LLVLHGEPRV VLPRACKTWK VDLVTWEHDI EPYAKMRDTA
VRGALERAGV ECASSSGHTL YDVEEMLAKC HGKPPTTYSQ FLKIVDKMGA PAAALDAPKA
MPAPFTGTAE ETKELVAGVA DAYGIPTLEE LGYEAMHDDE GFQAIGGETE GLRRLRRQLS
RTEWVHTFQK PDTNPTTLFH ALGAKKPKPK SPFEIAARDA GSKNDATNTA ANSDMLMTPS
TTALSPYMKF GCVSPRVFYH ELNAVLAKFE GKGQPSQPPV SLMGQLMWRE FYYLVGAGTP
NFDKMEGNPI CRQIPWNKDR ELFAAWENAQ TGFPWIDAAM TQLRLEGWIH HLARHAVACF
LTRGDLFVHW EWGRDTFDRD LVDADWALNN GNWMWLSCSC FFYQYFRVYG PHSFAKKYDK
DGAYVKHYLP VLKNMPAKYV YEPWLAPLDV QKKAGCVVGV DYPAPIVDHA TASKACIDKI
ATAYAAHKDA TAVAGKKRKA GE