Gene OSTLU_30886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30886 
Symbol 
ID5001100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp927085 
End bp928654 
Gene Length1570 bp 
Protein Length506 aa 
Translation table 
GC content61% 
IMG OID640416521 
Productpredicted protein 
Protein accessionXP_001417095 
Protein GI145345173 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR00591] photolyase PhrII 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.25268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGACGCCGAG TCCTCGCGCG CGCGCCGACG CTCGAGCGCG CGATGCCGTC GGCGTTGTGC 
GCGTCGAAGA AGCGCTATCG CGCGCTGACG CGCGGGACGA AGCCCGAGGC GGGACCGAAC
GCGCCAGTCG TGTACTGGCT GTCCAGGGAC CAACGCGTGC GAGATAACTG GGCGCTGCTG
CGCGCGTGCG ACGTCGCGCG CGCGAACGAC GCGCCGGTGG TGATCGCGTT TAATCTGCTG
ACAAAGTTCC TCGGCGCGGG GGCGCGACAG TTCGGGTTCA TGCTGCGTGG ACTTCGCGAG
CTGGAGGGCG CGGCGAAGGC GAAAAACGTG ACGTTCGCGA TGACGTACGG GGATGAGCCA
GCGATCGCGA TCGATGCGCT GGCCAAGAAG ATTGGTGCGA AAACGATCGT GTGCGACTTT
TCGCCGCTTC GCGATGGGTT GAAATGGCGG AAAGAACTGG CGGCATTGTG CGAGGCGCGC
TCGGCGCACT GCGAGGAATG CGACGCGCAC AACGTAGTGC CGTGCTGGGA AGCGAGCGAC
AAGCTCGAAG TCGGCGCGCG GACGCTTCGA GGGCGCTTGG CGAAGCGTTA CCCGGAGTTC
TTGCACGAAT TCCCGGAAGT GCCGGATGAT TTGCCCAAGT ACAGCGGCCC GGCGCTCGAC
GCGGTGAAGT GGGACGACAT CATCGCCGAG GCGCTGTCGC GCGGGCAAGC GGTGCCGGAA
GTGACGTGGG CGATCCCGGG CGAAACCGCG GCGCACGCTG TTTTGGACGA CTTTGTGAAC
TCACGCATGA AGTTGTACGA GAAGCGCAAC GATCCATCCA AGCCGCAGGC GCTGAGCGGG
CTGTCGCCTT GGCTGCACTT TGGGCAGATT TCTGGTCAGC GATGCGCGCT CGAGGCAAAG
AAAGCCGTCG GAAAGGCGTC GCCGCCCGCG TATGAGTCGT TCTTCGAAGA GCTCGTCGTT
CGTCGAGAGT TGTCCGACAA CTTTTGCTAC TACAGCCCGA AGTACGATCA GATTGAGGGA
CAAAAGTACG ACTGGGCCAA GGACACGCTT CGCCTGCATG CGAGTGACAA GCGTCCGTAC
TTGTACTCGC TCGAAGAACT CGAGCGGGCG AAGACGCACG ACGATCTGTG GAACGCCGCC
CAGCGAGAGT TGCGTTATGG AGGTAAGATG CACGGCTTTT GTCGAATGTA TTGGGCGAAG
AAGATTCTCG AGTGGACCGA GTCGCCCGAG CAAGCGTTGA AGTTTGCAAT TTATTTGAAC
GACACGTACT CGCTGGATGG TCGCGATCCG AGCGGCTACG TCGGATGCAT GTGGTCCATC
GTGGGCGTGC ACGATCAAGG TTGGAAAGAA CGCGAAGTTT TCGGCAAGAT TCGATACATG
GCGTACGACA GCACAAAGAA GAAGTTTAGC ATCCCCGACT ACATCGCTCG CGTCAACGCG
CTCGTCAAGG CGGCGAAAGC AGACTTCAAA TCTGGCGAGA GCTCGCACGC CGCTAATCCC
GGGTTGTTCA AAATCAACGT GCCGGCGGCA AACAAGCGCA AGGCTACGGA GGAGGCGCAA
TAATTCAAAT
 
Protein sequence
MPSALCASKK RYRALTRGTK PEAGPNAPVV YWLSRDQRVR DNWALLRACD VARANDAPVV 
IAFNLLTKFL GAGARQFGFM LRGLRELEGA AKAKNVTFAM TYGDEPAIAI DALAKKIGAK
TIVCDFSPLR DGLKWRKELA ALCEARSAHC EECDAHNVVP CWEASDKLEV GARTLRGRLA
KRYPEFLHEF PEVPDDLPKY SGPALDAVKW DDIIAEALSR GQAVPEVTWA IPGETAAHAV
LDDFVNSRMK LYEKRNDPSK PQALSGLSPW LHFGQISGQR CALEAKKAVG KASPPAYESF
FEELVVRREL SDNFCYYSPK YDQIEGQKYD WAKDTLRLHA SDKRPYLYSL EELERAKTHD
DLWNAAQREL RYGGKMHGFC RMYWAKKILE WTESPEQALK FAIYLNDTYS LDGRDPSGYV
GCMWSIVGVH DQGWKEREVF GKIRYMAYDS TKKKFSIPDY IARVNALVKA AKADFKSGES
SHAANPGLFK INVPAANKRK ATEEAQ