Gene OSTLU_40695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40695 
Symbol 
ID5005862 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp150671 
End bp152962 
Gene Length2292 bp 
Protein Length749 aa 
Translation table 
GC content56% 
IMG OID640421283 
Productpredicted protein 
Protein accessionXP_001421725 
Protein GI145354926 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID[TIGR00604] DNA repair helicase (rad3) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000149445 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.993011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCGT ACACCATCAC CGTGCCCGCG GACGGGGTCA AGATCGCGCA CAAGCGCGCG 
TTCACGGTTG ACTTTCCGTT CGAAGCGTAC GACAATCAGC TCGTTTTCAT GGAAAAAGCA
CTCTTGGCGA TGTGTCGCGG CGAACACGCG CTCTTGGAGT CGCCCACGGG CACGGGGAAG
ACGCTGTGCT TGCTTGCGAG CGCGCTGGCG TTCGTGCGGA GCGAAGGACG GGCAAGGAAG
CGGAAATTTA GGGAGGAAGT GAGAGATGGG CGAGGCGGCG CCGGAGACGC CGACGACGTC
GCGGGGACGT CGGGACGGGA GTTCGTGGAT GACGTGGAAA AAAAGGAGCG CACGCGCGCG
CCGGTGATTG TGTACGCGAC GAGGACGCAC TCGCAGGTGG ATCAAGTGGT GCGAGAGTTG
AAATTGTTTG ATTCGACGAC GCGCGCGACG ACGCTCGCGA GTCGGCGGCA CGCGTGCGCG
CGGGACGACG TTCGAGCGTT GAATGGGACG GAACAGAAAA ATAGATGCGC TAAATTGGTG
CAGCAGCAAA AGTGTGGGGC GAAAGTGACG CTCGATCGGG CGCTGCAGGG GCGGGATGGG
CGATTCGATT TATTCAGCGA CGGCGTGCAA GACATCGAGG ATTTGGTGTC CAAGGCCAAG
GCGAGGGGGC CGTGTCCGTT TTATCTCGCC CGCACGAAGT GCGCGGAGGC GGAGATAATT
TTTATGCCTT ATAATTACCT GCTCGATGAG AGCGTGCGCA AAGGATTGGA AATCGCGTGG
GAGAACGCGG TGATTATCGT GGACGAGGCG CACAACCTCG AGTCGAGCGC TTCTGATAGC
ATGAGTTATT CCCTCACCGC GGCGAAATTG GCGAAAGCCA TCAAGGAATC GGAGAGAGCT
TACGAGACAA AGTTGACTTT AGAAGACACT AGCGGGGAAG GAATCGTGAG CGAGGCAGAT
TTGCAACTTT TTAGCAGAGG GATGGACAAG GACGCTGCGG CGTTCAAGGG TGAAGATTTT
AAAATGTTAA CGGGCGTGTT GGTGCAACTC GAAGGGTTGC TCGATAGTAT TTGTCGTGAG
GCGGCCAAGG CGCCGAACGC CAAGCACGAA GGAGGTCTCG GCGAGCGTAT TGGTGACGGC
GCGTTCATCT ACGTTATTCT CGCCGAGTTG AACATCACGG CGGATACTTA CGAGCACATC
ACAAAGCTCA TTAAGAGCGC TTCGCGGACG GTGCAACTCG GGAGCGATTT CATGGCGCAG
CCGACACAAA ACGAAACACC GTTAATGGAA ATCGGTAATT TCATAGAGAG GATATTCGTG
CATCGATATG AGCAGTATTT CGTCACGAGA CTTGGGCCGG ATATGGAGAA GTTTAAGACA
AGCAATCGCG CGCGCGCCGG GCCGACGTTG TCGTACTGGT GCTTCTTTCC AGGTCTTTGC
CTCAAGGCAC TGATCGATAA AGGCGTTGGA ACATTTTTGC TTGCCTCTGG TACGCTCTCC
CCCATGGAAT CTTTCGCGTC AGAGCTCGCA TTGGATTTTC CCGTGCGCTT GGAGAATCCG
CACGTGATCA AAAGGAATCA AGTTTGGGGC GGAGTCGTCA CGCACGGGCC GAATAACGGC
GTGCTGAACA GCTCATTTAG ATTTCGCGAT ACGCGCGAGT ATAAGACAGA GATAGGGAGC
GTGATTTTAA GTACGGCAAG AATCGTACCC GATGGATTGC TCGTGTTCTT TCCGTCGTAC
GGCGTCATGC ACTCGTGCGT CAATCACTGG AGATCGACTG GGCTTTGGAA CCAGCTCGAG
ACGAACAAAA CGTGCTTGGT TGAGCCAAGC AATGCAGATG AATTTCATGC GTGCTACGAT
AGCTACAACA AGGCGCTTGA AGAGGATTCG AGGCGCGGTG CGGCGTTCTT CGCCGTGTGT
AGGGGTAAGC TGAGCGAAGG CATCGATTTC GCAGACAAGG CGTGCCGTGG CGTGATTCTC
ACTGGTATTC CGTACGCGGG CGCGAAGGAT CCACTCGTGA TGCACAAGCG AACGTATTTG
GATAAGCGAA AAGCGGACAA CGGGGGTGCG TACTCTGGGA ACGAGTGGTA CTCTCAAACC
GCGATGCGCG CAGTGAACCA GGCGCTTGGG CGTGTCATTC GCCACAAGGA TGACTTCGGC
GCCGTTATCC TTGCCGACGA GCGTTTCGCG AACGAAAACG CGCGAAATCA ACTTTCACTT
TGGCTTCGAC CGTCAGTGCA AGTGCACTCG GTGTTTCACA GTGCTGTGCA CGGCTTGAAG
GAATTCTTCC AA
 
Protein sequence
MHSYTITVPA DGVKIAHKRA FTVDFPFEAY DNQLVFMEKA LLAMCRGEHA LLESPTGTGK 
TLCLLASALA FVRSEGRARK RKFREEVRDG RGGAGDADDV AGTSGREFVD DVEKKERTRA
PVIVYATRTH SQVDQVVREL KLFDSTTRAT TLASRRHACA RDDVRALNGT EQKNRCAKLV
QQQKCGAKVT LDRALQGRDG RFDLFSDGVQ DIEDLVSKAK ARGPCPFYLA RTKCAEAEII
FMPYNYLLDE SVRKGLEIAW ENAVIIVDEA HNLESSASDS MSYSLTAAKL AKAIKESERA
YETKLTLEDT SGEGIDAAAF KGEDFKMLTG VLVQLEGLLD SICREAAKAP NAKHEGGLGE
RIGDGAFIYV ILAELNITAD TYEHITKLIK SASRTVQLGS DFMAQPTQNE TPLMEIGNFI
ERIFVHRYEQ YFVTRLGPDM EKFKTSNRAR AGPTLSYWCF FPGLCLKALI DKGVGTFLLA
SGTLSPMESF ASELALDFPV RLENPHVIKR NQVWGGVVTH GPNNGVLNSS FRFRDTREYK
TEIGSVILST ARIVPDGLLV FFPSYGVMHS CVNHWRSTGL WNQLETNKTC LVEPSNADEF
HACYDSYNKA LEEDSRRGAA FFAVCRGKLS EGIDFADKAC RGVILTGIPY AGAKDPLVMH
KRTYLDKRKA DNGGAYSGNE WYSQTAMRAV NQALGRVIRH KDDFGAVILA DERFANENAR
NQLSLWLRPS VQVHSVFHSA VHGLKEFFQ