Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_40372 |
Symbol | CHR3502 |
ID | 4999958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 972053 |
End bp | 974041 |
Gene Length | 1989 bp |
Protein Length | 663 aa |
Translation table | |
GC content | 56% |
IMG OID | 640415379 |
Product | predicted protein |
Protein accession | XP_001415990 |
Protein GI | 145341798 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGCGT GGGAAGAAGG GCAGTGGGCG CGAGAATCGG CGGAGAGCAA GCGGTTGGGA GAAGTGGAGA GCATCGTCGG GTGGCGACGA TGCGAAAAGG AGGAGACGGA GACGTTGATA AAGTGGAAAG GTACGTCGTA CGCGCATTGC ACGTGGGTGA AAGTCACGGC GTTGGAAAAC GATCCGACAT GTGGTGTGCA GGGTAAGATG CGTGTGGCGA GGTATTTCGA TAAGTATCCA AAGGAGCTCG GGCCGTGCGT GGACGTCAAA CCGGATTACT TGGTCGTCGA CCGCGTCTTT TCGATGTTTG AAGAGGTGGA CAGAACACTC GTGTGCGTGA AGTGGTCTCG AATGAGTTAT GACGAGACGA ATTGGGAAGA TATAACTGCC GTGCGCGAGA TGGAAGGTGG TGCGAGCGCT TTGGAAGAGT TTGAACGCGT CCGAAGCCGC GCTTCAGCGG CGCGCGAGCG CCAAGCCATC GTTGATGCGG AGACGGATGA GGACGTCGCC AATGCGTGGA GTTCGTACGA TGCCGACACC GTGCGCGACT CGTACGGAGA GTCCGACGAG TTGCGATCGT ATCAGAAGGA AGGCGTGAAG TGGATGGCGT TCAACTTCCG AGCCGGACGC GGGTGCATCT TGGCGGACGA GATGGGTCTC GGGAAAACTG CGCAGGCGCT AGCTCTCATA CATCACTGCT TGCAAGTGCG GCCAGGTCTC CCTGCTCTTG TCGTCGTTCC CCTTTCAACG ATTGTGAACT GGGAGCGCGA GGCGCAGCGC TGGGTCCCGG ACGCGTACGT GGTGACGCAC GTCGGCAAGC AAGCCGGTCG CGAATTCGCG CGAGAACACG ACTGGTATCA CCCAGTTGAC GAAACCCAGA GCATATCGCG AGCGTTTAAG GCTAATATCG TCCTCACTAC TTATGAAACG ATTACTGCCG ATCGCCAATC TTTCGCGAAG GCAAAATGGA GTACGATGGT CGTCGACGAA GCGCATCGCT TGAAACGAGT TGGAGGTAAG CTTGGGAACG ATTTGAACAG CCTCGCGGTG GAGCGCATTT GCTTACTCAC GGGCACTCCG CTTCAAAACA ACACCACCGA GCTCTGGTCG TTGCTGAACT TTGTCGATTC TAAGCACTTC TCCAACGCGG AGGAGTTTGA AGAAGCGTTT GGAGGCATGG CAAAGGCTGC GCAAGTCGAG CGTTTACAAA AGGTTCTTGG TCCGTACTTG CTGCGTCGAC TGAAGCGCGA CGTCGAGCAA AAGTTACCAC CGCGAAGTGA GACACTTGTC GAGTGCGAGC TCGCGCCTTT GCAGAAAAAG TGCTATCGTG CATTATTTGA GCGTAACTTT TCCTTTCTTC GGCAAGGTTG CGACTCGAGA GAGAGTTTTG CAAACTTTGC GAACATCATG ATGGAAGTCC GTAAGTGTTG CCAGCACCCG TTTTTGCTCG ACGGCGTCGA AGCTGCCATC GCGCCGGAAG GCGCGAGCAC CACTGCCTTG GTATCGAGCG CGGGAAAGTT GCAGCTCTTG GACAAGCTCC TTCCGCATCT TCGCGAAGGT GGGCATCGAG CTCTCATCTT CAGTCAAATG ACGCGCGTTT TGGACGTCCT GGAGGATTAT TGCCGCGCAC GAGGTCACTC TTACGTGCGA CTTGACGGTA GCATCACCGG CAAAGCACGT CAAGAAGCGA TCGACAAGTA TTGCGCTGAG GATTCTGACA CTTTTCTGTT TCTCCTCTCC ACGCGCGCCG GAGGCCAAGG CATCAACCTC GTCCAGGCTG ACACTGTCGT TATGTTCGAC AGCGACTGGA ATCCGCAAAA CGATGCACAG GCGCTCGCGA GAGCGCATCG CATCGGGCAA ACGCGCCAAG TCCAGGTATA TCGACTCGTC ATGCGGGCCA CGTACGAAAA GGAAATGTTT ACGCGGGCGT CGATGAAACT CGGTCTCGAA CAAGCCATCT TTGGGAGCGC AGAAAAGGAA GAGAAATCA
|
Protein sequence | MHAWEEGQWA RESAESKRLG EVESIVGWRR CEKEETETLI KWKGTSYAHC TWVKVTALEN DPTCGVQGKM RVARYFDKYP KELGPCVDVK PDYLVVDRVF SMFEEVDRTL VCVKWSRMSY DETNWEDITA VREMEGGASA LEEFERVRSR ASAARERQAI VDAETDEDVA NAWSSYDADT VRDSYGESDE LRSYQKEGVK WMAFNFRAGR GCILADEMGL GKTAQALALI HHCLQVRPGL PALVVVPLST IVNWEREAQR WVPDAYVVTH VGKQAGREFA REHDWYHPVD ETQSISRAFK ANIVLTTYET ITADRQSFAK AKWSTMVVDE AHRLKRVGGK LGNDLNSLAV ERICLLTGTP LQNNTTELWS LLNFVDSKHF SNAEEFEEAF GGMAKAAQVE RLQKVLGPYL LRRLKRDVEQ KLPPRSETLV ECELAPLQKK CYRALFERNF SFLRQGCDSR ESFANFANIM MEVRKCCQHP FLLDGVEAAI APEGASTTAL VSSAGKLQLL DKLLPHLREG GHRALIFSQM TRVLDVLEDY CRARGHSYVR LDGSITGKAR QEAIDKYCAE DSDTFLFLLS TRAGGQGINL VQADTVVMFD SDWNPQNDAQ ALARAHRIGQ TRQVQVYRLV MRATYEKEMF TRASMKLGLE QAIFGSAEKE EKS
|
| |