Gene OSTLU_42450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42450 
Symbol 
ID5003260 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp187172 
End bp190168 
Gene Length2997 bp 
Protein Length975 aa 
Translation table 
GC content56% 
IMG OID640418681 
Productpredicted protein 
Protein accessionXP_001419304 
Protein GI145349776 
COG category[L] Replication, recombination and repair 
COG ID[COG1948] ERCC4-type nuclease 
TIGRFAM ID[TIGR00596] DNA repair protein (rad1) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.488375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTCG TCCTCGCCGA CGCGAGTCCG AGCGAAGCGG TCGTCGACGG GGCCGACCTT 
TTGCCGTTCC AGCGCGAGAT CACCAAGGAA CTGCTCGCGC GCGATGGTTT TTGCGTTCTC
GCCGAAGGAC TCGGTGCGAG CGCAGTCATC GCCGCGCTCG TCGCCGTCGA CGACGCGCTG
TCTAAGACGC ACGTTCTAGG CGAACCACCC ATGGTGACGC TCATCGTCGG TGCGAGTGAG
CACGCGAAAG TGAGCGTGAA GGAGCGTATG ACGGCGCTGT TTCCGCGCGC GGCGCCGCCG
CTGGAGTTCA CCGCCGACTA CGCGGGCGAC AAGAGGAAAA AGTTTTACGA CGCGGGATGC
GTGGCGTTCG TGACGACTCG AATCGCGAGC GTGGATTTGT TGAGTGGGAG GCTGGACGCG
AAGCGAGTGC GAGGGATCAT AGTGTGCTCG GCGCATAGGA CGAACGAGAC GTCGGGGGAG
GGGTTCGTCG TGCGGTTGTT TAGAGAGGGG AATAGGAAAG GATACGTGCG GGCGATCAGC
GATCGCCCTG GGGATTTGAC GCGCGGCTTT AATAGCGTGG AACGATGTTT GAAGGCGCTG
ATGCTCACGC GCGTGCATCT GTGGCCGCGA TTTCATTTGC GAGTGAAGGA CGATTTAGAC
GCGCGTCCGC CAGAGGTGAT CGAATTGAGA CAGCCGATGA GCGAGAACGT GTTGAAGATT
CAAGAAGCGA TCGTCAGCGT CATGGATTCG TGCATGGCTG AGTTGAAGAA AAGTCGCTAC
ATCGACACGA GCGATTTAAC GCTCGAGAGT GGGTTGTTCA AGAGCTTCGA TTTGATTTTG
CAGCGACAAC TCGACAAGGT GTGGCACATC GCGCCGAGAC GAGTGAAACA GATCGTGTAC
GATTTAAAGA CGCTCCGCTT ACTCGCAGAT GCGTTGCTGA AGTACGATTC AGTCACGTTT
TTGAAGTACT TGCAGGCGTT GCGAGCGAGT GAGTCACGTG AGAGCATGTG GATGTTCACA
GAGGCCTCGC ACGCGATTTT CGAGTACGCG AAAAAGCGAG TTTACCTGCT CAAGCGTAAA
GCCGCGGCGG CGCAACCGAA AGGCTTGGGC GCAAAGCGAC CGCTTCCACC ACAAATCACA
GAGACGGACT TGATTCCGAT TTTAGAACCC ATGCCAAAAT GGACGCTCAT GGAAGAGATT
TTAGATGAAA TAGACGAAGA ACGTCGGCAA GGCGGAGAGC TCCTCGCCGT GGCGGACTCC
GAAACGGTCG TGGACCTGAC GTTTTCTCAG CCGTATGAGT CCCAAGAGCA CGGCACGCAT
AGAATGTTAA AATACAAGCA AGGAGCGACG TTGATTGTGT GTAAAGAAGA GCACGTGGCG
CGGCAACTGG AGTACTGCAT TCGTTACGGA ACGCCGGCGC TGATGAATGC GCATTGGGTC
GATTACTTGT TTAGCCGGGG CGGGAAGAAC GTGGCGGCGC AAGTGACGAA GCGACAGACG
GGATGGCGCG GTGCCGGGCG CGGTGGTGGC CGTGGTAGTG GCCGTGGTGG TGGGCGCGGT
GCGGCGCAGA AGCCTCAGCG TGTGTATTCA AAACTCGAGC GCATTCAGGC GAGAATGGAG
GGTCGAGAAA TCGACGAAGA TCCGGGGCCG GCGAAAGATG GGAAATCTGA CGAGAACAAA
CTCTTAGCGG CGGCCGCAGC GGAGGCGAAG AAGACCCTAG TCGCTGCGAA GAAGGCCGAA
GACGCGGATA AGAAGATGAA GGCTACGAAG GCGGCGAAAG AGTTAGATAT CAAAATAGAA
ATGAAAGCGG AAGAGGACGC GGCGGCTAGC GACGACGAAG TCATCGTAGT CGGCGATACG
CGCACGCGCT CGACCGTTAA GCGCGATACC GATAACATGT ACGTGTACGC GCACGAGCGC
AAATTGAACC TGCTGAACCG CATTCAGCCT TCGTTCGTGG TGATGTATGA TCCAGACGCG
TCATTCATCC GTGAACTCGA AGTCTATCAG GCGACGCGCC CGGACGTCCC GGTCAAGGTG
TACTTTTTGG TGTACGACAC GTCATTAGAG GAGCAAAAGT ATCTCAGTAG CATCAAACGC
GAGAGTGCGG CGTTCGAAAA CCTCATACGC ACGAAGCAGC ATATGGCTGT TCCCGCTGAA
CAAGAGGGTT GGACCGATTC AGAGAATCCG TTGCCGTTGT CGTTGCCGAG CTCGACCGCT
CGACATCGAA TCGAGGAGTC GCAAGAGGCG AGTACGCGTA AAGGAGGCAG ATCGCTCACT
ATTCGTTCGT CTCTCGAAGT CATAGTGGAT ATGCGTGAGT TCATGTCTGC GCTCCCTTGC
GTGTTGCATT CGGCAGGTTT CAAAGTGCGC CCGACGACGC TAGAAGTGGG CGATTACATC
CTCTCGCCCG ATATGTGCGT CGAGCGCAAA GCCATTCCAG ACTTGATTCA GTCCTTCGCG
TCTGGGCGTT TGATAGCACA AGTCGAAGCG ATGTGCAAAC ACTATAAGAC ACCGATTCTA
CTCATCGAGT TTGACGGCTC AAAAGCGTTC GCTCTGCACG CAGAAGCCGA CCTTCCTCGT
TTCGTCGGGC AGCAACATCT CATCACGAAG ATATGTATGC TCATCACACG TTTTCGAAAG
TTGCGTCTTA TTTGGAGTCG ATCGATGCAC ATGACGGCTG AAATTTTCGC AGAGTTAAAA
AGGCTTGAGC CTGAACCCTC ACTCGAAACT GCGCAGCGAA TAGGCGTTCC CGATGCCGAC
GGTGACGTGC ACAAACTCGT AAAGGATAAC CTCAACGACG CTGCCGTCGA TTTGTTGCGC
AGGCTACCGG GTATCACCGA CGGCAATTAC CGACGAGTCA TCGCACGAGT TGAAAGTATC
GAAAAGATGT GCGACCTGAG AGAAGACGAA CTCGCGGATA TCCTTGGCGA CGCACGGCAA
GCGAAGACGC TCCACACATT TTTACACGCG CCGTTTCCGA AAGAATTCAT GTTTTAG
 
Protein sequence
MSLVLADASP SEAVVDGADL LPFQREITKE LLARDGFCVL AEGLGASAVI AALVAVDDAL 
SKTHVLGEPP MVTLIVGASE HAKVSVKERM TALFPRAAPP LEFTADYAGD KRKKFYDAGC
VAFVTTRIAS VDLLSGRLDA KRVRGIIVCS AHRTNETSGE GFVVRLFREG NRKGYVRAIS
DRPGDLTRGF NSVERCLKAL MLTRVHLWPR FHLRVKDDLD ARPPEVIELR QPMSENVLKI
QEAIVSVMDS CMAELKKSRY IDTSDLTLES GLFKSFDLIL QRQLDKVWHI APRRVKQIVY
DLKTLRLLAD ALLKYDSVTF LKYLQALRAS ESRESMWMFT EASHAIFEYA KKRVYLLKRK
AAAAQPKGLG AKRPLPPQIT ETDLIPILEP MPKWTLMEEI LDEIDEERRQ GGELLAVADS
ETVQGATLIV CKEEHVARQL EYCIRYGTPA LMNAHWVDYL FSRGGKNVAA QVTKRQTGWR
GAGRGGGRGS GRGGGRGAAQ KPQRVYSKLE RIQARMEGRE IDEDPGPAKD GKSDENKLLA
AAAAEAKKTL VAAKKAEDAD KKMKATKAAK ELDIKIEMKA EEDAAASDDE VIVVGDTRTR
STVKRDTDNM YVYAHERKLN LLNRIQPSFV VMYDPDASFI RELEVYQATR PDVPVKVYFL
VYDTSLEEQK YLSSIKRESA AFENLIRTKQ HMAVPAEQEG WTDSENPLPL SLPSSTARHR
IEESQEASTR KGGRSLTIRS SLEVIVDMRE FMSALPCVLH SAGFKVRPTT LEVGDYILSP
DMCVERKAIP DLIQSFASGR LIAQVEAMCK HYKTPILLIE FDGSKAFALH AEADLPRFVG
QQHLITKICM LITRFRKLRL IWSRSMHMTA EIFAELKRLE PEPSLETAQR IGVPDADGDV
HKLVKDNLND AAVDLLRRLP GITDGNYRRV IARVESIEKM CDLREDELAD ILGDARQAKT
LHTFLHAPFP KEFMF