Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32894 |
Symbol | |
ID | 5003237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 34096 |
End bp | 37299 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | |
GC content | 57% |
IMG OID | 640418658 |
Product | predicted protein |
Protein accession | XP_001419051 |
Protein GI | 145349251 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | [TIGR00604] DNA repair helicase (rad3) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0780237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCACC TCGAGCGCGC GCACGACGCG CGCGCGACCG CAGACACCGA CGACGGCCGA CGCGCGTCCT CTGATCCGAG TCCGTCGACG CAGAGCGCGT GGAAACGCAC CATCGCGTCG TGCGAGGTGC GATTTCCGTT GACGCCGTAC AAGTCGCAGG TGCAAGTGAT GAGCGCGGTC GTTCGAGCGG CGCGCCGAGG AACGTGCGCG CTCGTGGAGT CGCCGACGGG GAGCGGGAAG ACGCTCGCGC TGCTGTGCGC GGCGCTGGCG TGGAGCGAGA GCGAGAGCGA ACGCAGGGAA GACGTGGGCG TGGACGACGA CGAAGACGGG GAGGAGGACG AAGCGCGCAA ACGCGAGACG AAGAGAATCG GAAACGGTAA ACCACCAAAG ATTTATTACG CCACGCGCAC GCACGCGCAG ATCGCACAAA TAGTCGGTGA ATTGTCGAGG ACGGCGTATA AGCCGCACAC CGTGGTGTTG GCGTCGAGGG AGCATTATTG CGTGAATAAG AGCGCGAGAA AGGGCGGAGA CGTCAACGCG GAGTGTCGAA GGTTGATGGA TGCCGGCGCC GCGGGTGGGG ACGGGAAAGG GTGTTTTTAT AGCGGACAAG GGGCGAGTAA GTTGGCGTCG TTGGCGAAAA ACCATCCCGA CGCGCTGGAT ATCGAGGATT TGGTGAAGAT GGGGACGTCG AAGAAGGGAT GCCCGTACTT TGCGTCGAAA ATCATGGCGG AGAGCGCGGA ATTAATTTTC TGCCCGTACA ACTATCTCTT GGATCCGCGC ACGCGTTCGG CCATGGACAT CGATATCGAA GGTTCACTCA TCATTTTCGA TGAGGCGCAC AACATCGAGG ACACCGCGCG CGAGGCGGCG AGTGAGGAAA TAATTTTAGA CGACGTCGCG AACGCCATCG ATCGTCTGAG CGAGATGCGG CGGCGCGCGA CGGCAAACGT GAGCGAGTGC GAATTGGTGT TGAGGAGCAT GAAGGGCGTT TACGATTGGT TCATTGGATT TTGCGACGAA AAAAGCCCAA GTTACGGTCT GAAGCAAGCG CAAGAAGCGT TATCGGCGAT GGTTCGAGGG GAGCAGATTT TACAAACGTT GGCGGAAGCC GGGCTGACTG AAGAGAGCGT TTTAGAAGTC ATGCGAGCGC TCGGCGTCAT CACCAAGTAT AATCAAGAGA ACAAAGATCC GAAAGAGCGC GTCGCGGGCA GCGTTTTCAA CACCTGTGAA AAAGTGCTGA ATCCTATCAA GTTTTTGCTC TCGCGAGGCG AAGTCACCGC GCGCGATTAC AAAATCGTCT TCACAAAGAC GCGCGAAAGC GATCGAGTCG TTTCGACGCA ACGAGTGAAC TCTGAATTGA ATAGATTACC CGTGGAAGAA CTGGTGAAGA TTAACTTTTG GGCACTGAAT CCAGCGCTGG CTTTCAGAGA GCTCGTGAGC GAAAACGGCG GCGCGCGTTC AGTCGTGCTC ACGTCTGGTA CTCTCGCGCC GCTCAATTCG TTCGCGAGCG AGCTCGGCGT ACCGTTTCCC ATCCGCATGG AAGCCCCGCA CTGCGTGGAT ATGGATCGTC AGGTCTGGGG CGGCATCGTC GCCGCCGGTC CGAGCAACAT AGCTTTGAAT GCCGGTTACA AGTCTCGCGG CGATACTAGT TTTCAAGACG AACTCGGCGC GTGTTTGCGA GACGTCGCGA AAGTCACCCC TCACGGCTTG CTCATGTTCT TCCCATCGTA CAGTCTTTTG GAAACCATCG TTCGACGATG GCGCGAAACG GGACTGCTGC GCGCGATAGA GCAGGCGAGC GGCAAGAAGA TTTTTCAAGA ACCGGGTAAA TCATGCTCGT ACGGTAAAAA ACCGGTGACG CTTGAGACGG TGTTGGAAAA GTACTATTCC GCGGTGGCGA CGAGCGTCAA GGCAGCAAAA CATCCGTACG CTCCGGCTCC AGCGAACGCC AAATGTCGAG GTGCCATTCT CTTCGCCGTG TGTAGAGGCA AGATTAGTGA AGGTATTGAC TTCGCCGACG CTAACGCGCG GGCGGTGATC TGTGTTGGGA TTCCGTATCC AAACATCAAA GACGCGCTCG TCGCGGCGAA GCGAAGTTAC AACGAAGAAG GAGCGCATCG AGGTTTGCTG AGTGGCTCGA AATGGTACGA TCAGCAGGCT TTTCGGGCGC TCAATCAAGC CGTCGGTCGA TGCCTGCGGC ATCGTCACGA TCACGGCGCA ATCATGCTCG TCGATTCGCG TTTCAACAAC AGCAACATCC AGGCGCTGCC GAAATGGTTA CGTCCCGCCA TGCAGAAGAG CGCGTCTCGC TTTGGCGACC AAGTGAAAAG CTTGGAAAAT TTCTTCGTTT CGCACGCCGA GAATCCACCG AGCGATGACG CTCCAGCGAG CAAGACAGGC GACCAACGGA AGAATAGCAA ACGAGCGAGA GTGTCCATCG TCGCGACGTC TCCTCGAAAG GCGATGCGTA ATACGCCAAT CACGAGCTTC TTTCAAAAGG CGTCGACGAG CAACGCGACT CATTCAGATC CTCTCATGCG CGACATCGAG CAGCCATCGA AAAAGACAAA GGATGACCAA ATCGGTGACT ACGAGGCGCC AGTGTTCACG GACGACGACG AGCTTGACGT CGACATTGAC GCGCTCATGG CGGACAACCA TGAGTTATCG CCAGAGGCGA ATGTCGCGCA AGTTCTCGAC GGTGTCGAGT GGGAGGATTG GGAAGACGAC GACGATCTAA TGGCGCAAGC CGTGCAAGCG ACACAAACCG CGCACACGAT GAGTAAGGTA GAGAGCACGA CACCGACTGA ACAAGTCCTG GATGTCGTCT CGCCTCTCAA GCAACGCGCG AAGAAAGTCT GCGGCGACGA CTCCGTCTGC GCCAAGTGTG GAAACACTCG CTATTCGCAG ATTGACACCG AAAATGATGA TCGAATCGCG CTTCAATCTC AATATTTGGA TTGTGTTCTG CAAACCAAGG CGAATGACAT CCATGAATTC GCCGTGGTCG AGATGCGCGG CGGTTGTTGC GCCTCGCCGC CGTCGGAGAC GGTTGAATTC GACCGCGACT TGCGCGTCGC CTTCAGTCGA GTGAACTCCT GCGATGGCGA TCGATTGATC GGATTACGAG TGGAAGCAAC GAATGCGATG TATGCACACT TGTGCGATCG CGTCTTGCTC CGGCAACACG ACGCCCGCGC GTAG
|
Protein sequence | MSHLERAHDA RATADTDDGR RASSDPSPST QSAWKRTIAS CEVRFPLTPY KSQVQVMSAV VRAARRGTCA LVESPTGSGK TLALLCAALA WSESESERRE DVGVDDDEDG EEDEARKRET KRIGNGKPPK IYYATRTHAQ IAQIVGELSR TAYKPHTVVL ASREHYCVNK SARKGGDVNA ECRRLMDAGA AGGDGKGCFY SGQGASKLAS LAKNHPDALD IEDLVKMGTS KKGCPYFASK IMAESAELIF CPYNYLLDPR TRSAMDIDIE GSLIIFDEAH NIEDTAREAA SEEIILDDVA NAIDRLSEMR RRATANVSEC ELVLRSMKGV YDWFIGFCDE KSPSYGLKQA QEALSAMVRG EQILQTLAEA GLTEESVLEV MRALGVITKY NQENKDPKER VAGSVFNTCE KVLNPIKFLL SRGEVTARDY KIVFTKTRES DRVVSTQRVN SELNRLPVEE LVKINFWALN PALAFRELVS ENGGARSVVL TSGTLAPLNS FASELGVPFP IRMEAPHCVD MDRQVWGGIV AAGPSNIALN AGYKSRGDTS FQDELGACLR DVAKVTPHGL LMFFPSYSLL ETIVRRWRET GLLRAIEQAS GKKIFQEPGK SCSYGKKPVT LETVLEKYYS AVATSVKAAK HPYAPAPANA KCRGAILFAV CRGKISEGID FADANARAVI CVGIPYPNIK DALVAAKRSY NEEGAHRGLL SGSKWYDQQA FRALNQAVGR CLRHRHDHGA IMLVDSRFNN SNIQALPKWL RPAMQKSASR FGDQVKSLEN FFVSHAENPP SDDAPASKTG DQRKNSKRAR VSIVATSPRK AMRNTPITSF FQKASTSNAT HSDPLMRDIE QPSKKTKDDQ IGDYEAPVFT DDDELDVDID ALMADNHELS PEANVAQVLD GVEWEDWEDD DDLMAQAVQA TQTAHTMSKV ESTTPTEQVL DVVSPLKQRA KKVCGDDSVC AKCGNTRYSQ IDTENDDRIA LQSQYLDCVL QTKANDIHEF AVVEMRGGCC ASPPSETVEF DRDLRVAFSR VNSCDGDRLI GLRVEATNAM YAHLCDRVLL RQHDARA
|
| |