Gene OSTLU_92534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_92534 
Symbol 
ID5000947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp979281 
End bp981320 
Gene Length2040 bp 
Protein Length679 aa 
Translation table 
GC content60% 
IMG OID640416368 
Productpredicted protein 
Protein accessionXP_001417114 
Protein GI145345214 
COG category[L] Replication, recombination and repair 
COG ID[COG1948] ERCC4-type nuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACG CGCACGACGC GCGCTGGGCG ACGACGAGCG CGCGAGATGG GGAGCGCGCG 
CGCGCGGCGT ATTGGAAAAA GTACCGCGTG CGCGCGAACG CGGCGCTGGG AACGTTCGTG
GAGACGGTGT GCGAACTACT CGAACGCGGC GCGCGAACGG GAGAAGAGAC GGCGTACGGA
CGCGCGCTGC GGAACTGCGC CAAGGCGTGC GCCAAGGCGG AGACTGCGCT CTCGGGACGC
GAACGCGCGA TCGTGGTGAA GGGAATCGGG CCGAAAATGT GCGACGTAAT GGAGGAGTTT
TGGCGACGAA GTCGAGCGCC GACGGTGACA GAGTACGGCG AGACGCTGGG GGCGAGAAAT
TTCTTCGGAG CGGCGGGAGC AATCGCGGGA GAGCGGGCGC GGGAGACGCA TGGCGCGACG
AGTCGAAACG ACGTCACAGA TCTCGATCAT CTGCGACACA AAGTAGCGCG CGCGGCAGGA
GCGCGCGCGA CGGAGAGCGA ACCGGCGGCG AAACGCGCGA GAAATACCAA ACCGTGGGTG
CCCGGTTATC GCACGGCGGC GTTCGCGCTC TTGGTCACCG CGCATAGGCT CGCGCTCGAG
GGGCGGGAGG TGTTGACTAA AGACGAGCTC CAAGACGAAA CAGAGGTGAG TGGTTTGAGC
GCCAAGGGGA TCAAACCCAA ACCGACGTCG CGCGCCGTCA TGGGTGGGCG CGGCGCGGCG
CAGCACTTCG CGTACTGCGG TTGGAATTCC TTCAAGTCGT TGAAGACGTT GCAGAATGGT
TACGTCGAAC CCATGGTGAA CACGTGGAAG AAGTCCTACG CGATGCAAAT CCGTCTGAGT
AAGACGGGGA CGGAGCTCGC TGCGAAATTA CACGCCGCCG CGGAGGCGCG CGGGGACTGT
TCGTGCGGTT TCGCTGCGCC GGGCGAAAAC GTCAACCCAA ACTTTGCCCG CGAATGCGAA
GAAAACGACG ACGACGACGA CGAAGTGGCG ATGCTGGACG ACGCCGGAGT TTGGACACCA
GTGTGTTCGC AACCTCTGCC ATCGTCGTCG CAAGTACCGC GTGTGACGAA CGCGGCGCCG
GCTTTAAAGA ACCTCGTGAG TCCGTCTAGA GGCGAATGGG CGCTCCCGCC GCTCCAAGGA
GACGAAACGT ACGCTGACAG GTACGAAACC GTCTTAGTGG TAGACGTTTC CGAGACGAAG
TTCACAGAAC GCGATCTTGA GTTCTTCCGG AATGCGGGCG TGAAGACGCT CAGGCACAGC
CTCGACGCCG GTGATTTCGC TTGGGTCGCG TGCCCGAAAG GCCTCGCGCC GAGTTTGGGC
GACGCGTATG TGCTAGATAT CCTCATCGAA CGTAAAGAGG TGAATGATTT ACGAGCGAGC
ATCATACCGA GCGACAAGAG CGGACAGCGT TTTGTGCGGC AGAAGTATCG CATGAAAAAT
TATAGCGGGC TGAAGAACCT GGTCTATCTC ATCGAAGGCA ACTTGCGCAA CGTGAGCGCC
ATGTTTCGTC GCGATCGCGG CGGCGGCGCG CGCACGTCTG TCCCCACGCA TAGTGGAATG
ACGACCGTGG ATATGGTCGG TCGATTACTC AGCGCTCGCG TGCAAACGGA AATCTTTCAC
GGATTCAAAG TGGTAAATAC CATGCACCTG GAGGATACGA AGCGGTTATT GAAGAATTTG
ACGCTCTCCT TGCACGCGAC GTACGGTCCG CTCACTCGCG CCAGGGCGAG CAAGAAGGCT
CGCACTTTTG CCGAGTATGA ACGCGATTTT CGCGAGATAA AACACAAAGA AGAAAGCACG
GTTAAGGTCA CGTGGATGCG CATGCTCGCG CAAATCGACG GCGTCGGTCC AATCAAGGCT
CAGGCGGTCG TCGAGGTTTT CCCCACGCCA TCCTCACTGA AAAGCGTCGT GGATCGGGAC
AGGCACCGCG CGCGCATGGA GCTGCAAGTC ATTAGGACAG CCGCAGAGAC ACAGGCGAGA
TCCGTCGGTC CTTCTGCGAG TGATAAAATA TTAGAAGCAT TATTTCCCGT CGCGGACTAG
 
Protein sequence
MDDAHDARWA TTSARDGERA RAAYWKKYRV RANAALGTFV ETVCELLERG ARTGEETAYG 
RALRNCAKAC AKAETALSGR ERAIVVKGIG PKMCDVMEEF WRRSRAPTVT EYGETLGARN
FFGAAGAIAG ERARETHGAT SRNDVTDLDH LRHKVARAAG ARATESEPAA KRARNTKPWV
PGYRTAAFAL LVTAHRLALE GREVLTKDEL QDETEVSGLS AKGIKPKPTS RAVMGGRGAA
QHFAYCGWNS FKSLKTLQNG YVEPMVNTWK KSYAMQIRLS KTGTELAAKL HAAAEARGDC
SCGFAAPGEN VNPNFARECE ENDDDDDEVA MLDDAGVWTP VCSQPLPSSS QVPRVTNAAP
ALKNLVSPSR GEWALPPLQG DETYADRYET VLVVDVSETK FTERDLEFFR NAGVKTLRHS
LDAGDFAWVA CPKGLAPSLG DAYVLDILIE RKEVNDLRAS IIPSDKSGQR FVRQKYRMKN
YSGLKNLVYL IEGNLRNVSA MFRRDRGGGA RTSVPTHSGM TTVDMVGRLL SARVQTEIFH
GFKVVNTMHL EDTKRLLKNL TLSLHATYGP LTRARASKKA RTFAEYERDF REIKHKEEST
VKVTWMRMLA QIDGVGPIKA QAVVEVFPTP SSLKSVVDRD RHRARMELQV IRTAAETQAR
SVGPSASDKI LEALFPVAD