Gene OSTLU_32894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32894 
Symbol 
ID5003237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp34096 
End bp37299 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table 
GC content57% 
IMG OID640418658 
Productpredicted protein 
Protein accessionXP_001419051 
Protein GI145349251 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID[TIGR00604] DNA repair helicase (rad3) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0780237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACC TCGAGCGCGC GCACGACGCG CGCGCGACCG CAGACACCGA CGACGGCCGA 
CGCGCGTCCT CTGATCCGAG TCCGTCGACG CAGAGCGCGT GGAAACGCAC CATCGCGTCG
TGCGAGGTGC GATTTCCGTT GACGCCGTAC AAGTCGCAGG TGCAAGTGAT GAGCGCGGTC
GTTCGAGCGG CGCGCCGAGG AACGTGCGCG CTCGTGGAGT CGCCGACGGG GAGCGGGAAG
ACGCTCGCGC TGCTGTGCGC GGCGCTGGCG TGGAGCGAGA GCGAGAGCGA ACGCAGGGAA
GACGTGGGCG TGGACGACGA CGAAGACGGG GAGGAGGACG AAGCGCGCAA ACGCGAGACG
AAGAGAATCG GAAACGGTAA ACCACCAAAG ATTTATTACG CCACGCGCAC GCACGCGCAG
ATCGCACAAA TAGTCGGTGA ATTGTCGAGG ACGGCGTATA AGCCGCACAC CGTGGTGTTG
GCGTCGAGGG AGCATTATTG CGTGAATAAG AGCGCGAGAA AGGGCGGAGA CGTCAACGCG
GAGTGTCGAA GGTTGATGGA TGCCGGCGCC GCGGGTGGGG ACGGGAAAGG GTGTTTTTAT
AGCGGACAAG GGGCGAGTAA GTTGGCGTCG TTGGCGAAAA ACCATCCCGA CGCGCTGGAT
ATCGAGGATT TGGTGAAGAT GGGGACGTCG AAGAAGGGAT GCCCGTACTT TGCGTCGAAA
ATCATGGCGG AGAGCGCGGA ATTAATTTTC TGCCCGTACA ACTATCTCTT GGATCCGCGC
ACGCGTTCGG CCATGGACAT CGATATCGAA GGTTCACTCA TCATTTTCGA TGAGGCGCAC
AACATCGAGG ACACCGCGCG CGAGGCGGCG AGTGAGGAAA TAATTTTAGA CGACGTCGCG
AACGCCATCG ATCGTCTGAG CGAGATGCGG CGGCGCGCGA CGGCAAACGT GAGCGAGTGC
GAATTGGTGT TGAGGAGCAT GAAGGGCGTT TACGATTGGT TCATTGGATT TTGCGACGAA
AAAAGCCCAA GTTACGGTCT GAAGCAAGCG CAAGAAGCGT TATCGGCGAT GGTTCGAGGG
GAGCAGATTT TACAAACGTT GGCGGAAGCC GGGCTGACTG AAGAGAGCGT TTTAGAAGTC
ATGCGAGCGC TCGGCGTCAT CACCAAGTAT AATCAAGAGA ACAAAGATCC GAAAGAGCGC
GTCGCGGGCA GCGTTTTCAA CACCTGTGAA AAAGTGCTGA ATCCTATCAA GTTTTTGCTC
TCGCGAGGCG AAGTCACCGC GCGCGATTAC AAAATCGTCT TCACAAAGAC GCGCGAAAGC
GATCGAGTCG TTTCGACGCA ACGAGTGAAC TCTGAATTGA ATAGATTACC CGTGGAAGAA
CTGGTGAAGA TTAACTTTTG GGCACTGAAT CCAGCGCTGG CTTTCAGAGA GCTCGTGAGC
GAAAACGGCG GCGCGCGTTC AGTCGTGCTC ACGTCTGGTA CTCTCGCGCC GCTCAATTCG
TTCGCGAGCG AGCTCGGCGT ACCGTTTCCC ATCCGCATGG AAGCCCCGCA CTGCGTGGAT
ATGGATCGTC AGGTCTGGGG CGGCATCGTC GCCGCCGGTC CGAGCAACAT AGCTTTGAAT
GCCGGTTACA AGTCTCGCGG CGATACTAGT TTTCAAGACG AACTCGGCGC GTGTTTGCGA
GACGTCGCGA AAGTCACCCC TCACGGCTTG CTCATGTTCT TCCCATCGTA CAGTCTTTTG
GAAACCATCG TTCGACGATG GCGCGAAACG GGACTGCTGC GCGCGATAGA GCAGGCGAGC
GGCAAGAAGA TTTTTCAAGA ACCGGGTAAA TCATGCTCGT ACGGTAAAAA ACCGGTGACG
CTTGAGACGG TGTTGGAAAA GTACTATTCC GCGGTGGCGA CGAGCGTCAA GGCAGCAAAA
CATCCGTACG CTCCGGCTCC AGCGAACGCC AAATGTCGAG GTGCCATTCT CTTCGCCGTG
TGTAGAGGCA AGATTAGTGA AGGTATTGAC TTCGCCGACG CTAACGCGCG GGCGGTGATC
TGTGTTGGGA TTCCGTATCC AAACATCAAA GACGCGCTCG TCGCGGCGAA GCGAAGTTAC
AACGAAGAAG GAGCGCATCG AGGTTTGCTG AGTGGCTCGA AATGGTACGA TCAGCAGGCT
TTTCGGGCGC TCAATCAAGC CGTCGGTCGA TGCCTGCGGC ATCGTCACGA TCACGGCGCA
ATCATGCTCG TCGATTCGCG TTTCAACAAC AGCAACATCC AGGCGCTGCC GAAATGGTTA
CGTCCCGCCA TGCAGAAGAG CGCGTCTCGC TTTGGCGACC AAGTGAAAAG CTTGGAAAAT
TTCTTCGTTT CGCACGCCGA GAATCCACCG AGCGATGACG CTCCAGCGAG CAAGACAGGC
GACCAACGGA AGAATAGCAA ACGAGCGAGA GTGTCCATCG TCGCGACGTC TCCTCGAAAG
GCGATGCGTA ATACGCCAAT CACGAGCTTC TTTCAAAAGG CGTCGACGAG CAACGCGACT
CATTCAGATC CTCTCATGCG CGACATCGAG CAGCCATCGA AAAAGACAAA GGATGACCAA
ATCGGTGACT ACGAGGCGCC AGTGTTCACG GACGACGACG AGCTTGACGT CGACATTGAC
GCGCTCATGG CGGACAACCA TGAGTTATCG CCAGAGGCGA ATGTCGCGCA AGTTCTCGAC
GGTGTCGAGT GGGAGGATTG GGAAGACGAC GACGATCTAA TGGCGCAAGC CGTGCAAGCG
ACACAAACCG CGCACACGAT GAGTAAGGTA GAGAGCACGA CACCGACTGA ACAAGTCCTG
GATGTCGTCT CGCCTCTCAA GCAACGCGCG AAGAAAGTCT GCGGCGACGA CTCCGTCTGC
GCCAAGTGTG GAAACACTCG CTATTCGCAG ATTGACACCG AAAATGATGA TCGAATCGCG
CTTCAATCTC AATATTTGGA TTGTGTTCTG CAAACCAAGG CGAATGACAT CCATGAATTC
GCCGTGGTCG AGATGCGCGG CGGTTGTTGC GCCTCGCCGC CGTCGGAGAC GGTTGAATTC
GACCGCGACT TGCGCGTCGC CTTCAGTCGA GTGAACTCCT GCGATGGCGA TCGATTGATC
GGATTACGAG TGGAAGCAAC GAATGCGATG TATGCACACT TGTGCGATCG CGTCTTGCTC
CGGCAACACG ACGCCCGCGC GTAG
 
Protein sequence
MSHLERAHDA RATADTDDGR RASSDPSPST QSAWKRTIAS CEVRFPLTPY KSQVQVMSAV 
VRAARRGTCA LVESPTGSGK TLALLCAALA WSESESERRE DVGVDDDEDG EEDEARKRET
KRIGNGKPPK IYYATRTHAQ IAQIVGELSR TAYKPHTVVL ASREHYCVNK SARKGGDVNA
ECRRLMDAGA AGGDGKGCFY SGQGASKLAS LAKNHPDALD IEDLVKMGTS KKGCPYFASK
IMAESAELIF CPYNYLLDPR TRSAMDIDIE GSLIIFDEAH NIEDTAREAA SEEIILDDVA
NAIDRLSEMR RRATANVSEC ELVLRSMKGV YDWFIGFCDE KSPSYGLKQA QEALSAMVRG
EQILQTLAEA GLTEESVLEV MRALGVITKY NQENKDPKER VAGSVFNTCE KVLNPIKFLL
SRGEVTARDY KIVFTKTRES DRVVSTQRVN SELNRLPVEE LVKINFWALN PALAFRELVS
ENGGARSVVL TSGTLAPLNS FASELGVPFP IRMEAPHCVD MDRQVWGGIV AAGPSNIALN
AGYKSRGDTS FQDELGACLR DVAKVTPHGL LMFFPSYSLL ETIVRRWRET GLLRAIEQAS
GKKIFQEPGK SCSYGKKPVT LETVLEKYYS AVATSVKAAK HPYAPAPANA KCRGAILFAV
CRGKISEGID FADANARAVI CVGIPYPNIK DALVAAKRSY NEEGAHRGLL SGSKWYDQQA
FRALNQAVGR CLRHRHDHGA IMLVDSRFNN SNIQALPKWL RPAMQKSASR FGDQVKSLEN
FFVSHAENPP SDDAPASKTG DQRKNSKRAR VSIVATSPRK AMRNTPITSF FQKASTSNAT
HSDPLMRDIE QPSKKTKDDQ IGDYEAPVFT DDDELDVDID ALMADNHELS PEANVAQVLD
GVEWEDWEDD DDLMAQAVQA TQTAHTMSKV ESTTPTEQVL DVVSPLKQRA KKVCGDDSVC
AKCGNTRYSQ IDTENDDRIA LQSQYLDCVL QTKANDIHEF AVVEMRGGCC ASPPSETVEF
DRDLRVAFSR VNSCDGDRLI GLRVEATNAM YAHLCDRVLL RQHDARA