Gene OSTLU_43003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43003 
Symbol 
ID5005342 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp175340 
End bp177181 
Gene Length1842 bp 
Protein Length565 aa 
Translation table 
GC content66% 
IMG OID640420763 
Productpredicted protein 
Protein accessionXP_001421409 
Protein GI145354261 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.832505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0789751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGCGG CGTGCGGGGG AGTGGGAGCG ACGCGCGCGT TCGCGTCGAG GCGGTACGAA 
CCGACGGCGG ACGCGGACGA CGAAGAGACG GCGGCGAGCG TGCGGGACGC GTGCGAGATC
GTGCGCTTGC CGGGGTACTT GTTGTTTGAG CCAGAGAAGA TTCAGATCGA TCTGAGGAGG
GAAAAGTACT TTTTCGGCAC GCTCATGCCG TTCGTGCACG CCGCGGAGAA GCTCGGAGGG
AAAGTGGGGG CGCCGCGCGC GGCGCCGGCG AGCGCGCGAG TGGTGGATTT AGGCCCCGAG
CACGGATTCG CGGCGTCGTT AGAAGAATTA GGGATAGTAT CAGACGTAGA TAAACGCTTG
GACTGGGCGG CTGGGATTCG CGCGGAGTGG ACGATGTCGG AGTCGGGCGC CGCCTCTACT
TGGGAGCACT TCGCTTCGGT GGGGTTGGAG AAGTTTGAGG ACGATCACGG AAGAGCGGAT
ATCGTCGCTC CGGCCGCGGT GAGCACGCTT TCACCGTATT TACGCCATGG TCAGATATCG
CCGCGGCAGA TTTACCACGA ATTGGCGACG AAAAAGATGG GCAGCGACGG GGTGGAGGGC
AAAAAGCTCT CTCGCGTGTT TTGGCACCGT CTCTATCGCC GAGAGTTTGC GTACTGGCAA
CTGCACAACT GGCCCGAGTT ACCGTCCAAG TCCGTGCGTG GGCACTACGA GAATCGCAAG
GCTTGGTTGG AGGGCGACGA AGCCGCCGTT GCGCTTCATC GATGGCAAAC TGGAACGACG
GGCTTCCCCA CCGTGGACGC GGGCATGCGC CGGCTTTGGG CCACCGGATG GATGCATCAG
AGTGAACGCA TGATTGCGGC CACGTTTCTC GTCGATTACT GCGGCGTTCA TTGGACGCAC
GGCGCAGATT GGTTCCTCGA CACTCTCGTG GACGCGGACT TGGCGATCAA TTCCATGATG
TGGCAAAACG CGGGAAAGAG CGGACTCGAT CAGTGGGACG TCTTCGCGGG TTCGTTGACC
CCAGATGGAT CCTCTCGAGC GCACGACCCC GAAGGCGAGA GCATCGCTCG ATGGATTCCA
GAGCTCGCGG CGCTGCCGAA AGGGCACCTG CGACACCGCC CGTGGGAAGC GTCGGCGAAG
CAGCTCGACG CCGCGGGCGT CGAGTTGGGA TCGACGTACC CGACGCGCAT GATTACAGAT
CTCGAAGGCG CGCGACGCCG CATGCTGGAC GACGTGAACG CGCTTCGCGT CGACGAAATT
AACAAAGCCG CGACGAGGGC GAACGCGGCG TCCGCGGCCG ACGTCGACGC GCGTTCTTCC
GACGTCTTCG TCGACGTTCG TTCGGCGAAC GACTTCGTCG TCGCGCCGCC CGGCGCGACT
AAAGATCACG CCGGCGCCCT CGTTCCGGTG TCGACGCGAA AGGAGTTCAA AACCGAACTC
AAATCCACCA AAGGCGCGCA GGCCGCGATG CAAAGCGCCT ACGCCTGGGC CGACCGCGCC
CTCGCCGTCG CCGAAACCGC CGCCTCCGCC GACGCCGCCA AAGCCAAGGC CCAAAAACGC
GCCCAGCGCG CCCACGGCCA CTCCCACGCC GCGTCCACCG ACGCGTCTCG CGTCCACGCC
CACGCCCACG GCCACTCCCA CGCCCCGTCC CGCGCGCCCG CGCGCCGAAG CAGCGCCTGG
CAGCGTCCGT CGCGCGCCTC CGCGCGCGCG CTCTCCGACG CCCGCGCCGC CCGCGCCGAC
CGCCGCGTCG TCAAGTCCCT CCGCCGCGAG GCTCGCGAGA TCAACGCCGC GCACGCCGGC
CGACGCCCCG CCGCCGACGA CGACGACGAC GACGACGCGT AG
 
Protein sequence
MRAACGGVGA TRAFASRRYE PTADADDEET AASVRDACEI VRLPGYLLFE PEKIQIDLRR 
EKYFFGTLMP FVHAAEKLGG KVGAPRAAPA SARVVDLGPE HGFAASLEEL GIVSDVDKRL
DWAAGIRAEW TMSESGAAST WEHFASVGLE KFEDDHGRAD IVAPAAVSTL SPYLRHGQIS
PRQIYHELAT KKMGSDGVEG KKLSRVFWHR LYRREFAYWQ LHNWPELPSK SVRGHYENRK
AWLEGDEAAV ALHRWQTGTT GFPTVDAGMR RLWATGWMHQ SERMIAATFL VDYCGVHWTH
GADWFLDTLV DADLAINSMM WQNAGKSGLD QWDVFAGSLT PDGSSRAHDP EGESIARWIP
ELAALPKGHL RHRPWEASAK QLDAAGVELG STYPTRMITD LEGARRRMLD DVNALRVDEI
NKAATRANAA SAADVDARSS DVFVDVRSAN DFVVAPPGAT KDHAGALVPV STRKEFKTEL
KSTKGAQAAM QSAYAWADRA LAVAETAASA DAAKAKAQKR AQRAHGHSHA ASTDASRSLR
REAREINAAH AGRRPAADDD DDDDA