Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16586 |
Symbol | CHR3511 |
ID | 5003492 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 647384 |
End bp | 650505 |
Gene Length | 3122 bp |
Protein Length | 983 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418913 |
Product | predicted protein |
Protein accession | XP_001419229 |
Protein GI | 145349626 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTCGA GCGACGACGA CGACGACGAC GACGACGACG AAGAAGAAGA CGAAGGAGAC GAATCATACG AGGCGCGCGA TGGCGACGAC GACGACGACG ACGTCGTCGT GACGCAACGG GACGTGAAAG CGGCGAACAT CGCGTCGCTC GTGAACGGGA CGTTGGAGAC GTGCGCGCGA CCGTACGTGC GGGGACTGAC GGTGGAGGAC CCGGCGGTGA CGCTGAAACG GGCGTTTAAA TCGCCGTTTC CGAACGCGCC GAATCGGAGC GGTGCGGGAC GAGCGATGCG ATGCGACGGC GATGCGAATG GGATGGAGGC GCGCGAGACG CGTTTGAATC TTCGCGTCGG CGAGGGAAAC CGGGCACCCG AGGGAAGAAG GCGGCGCGCG AAGACTGACG ATGAAACGCG CGCGTCGAAC GATCGCGATT CGTGCGAACA GCGGAATTGG AGCGACGGTT GGCGAGTCGA CGGGTGTTCG TGCCGTGGGG GTCGAAGGCG AGCGACGTCG CGAGGGCGTT GCCGAAGCCG ACGCAGTGCG CGGCGACGGA GGAGGCGATC GTATTGCCGG AAGGGATCGA GGATTTAGTG CTTTGGGAGC CGGATCGCGA GGGTGATGGT GGAGAGACGA GCGCGACGGC CGGTGGGGCG AGCGCGAAAC CGATCGTGGT CGATCGCATG CTCACGAGAT GGTTGCGACC GCACCAACGC GAGGGGGTCA AGTTTATGTT TGAGTGCGTC ATGGGTTTGC GCGATTTCGA AGGCCAAGGG TGCATTCTCG CGGATGACAT GGGTTTAGGT AAGACATTAC AAGGGATCAC GCTGCTCTGG ACGCTCTTGA AGCAAGGTAT CGATGGTACG CCCGCGGTGA AGCGCGCGTT AATCGTCTGT CCGACGTCGC TGGTGTCCAA CTGGGACGAC GAGTGCAACA AGTGGTTGAA TGGGCGCGTG AAAACGTTGC CCATTTGCGA CTCCACTCGC GCTGAAGTCG TGAGCTCGGT GAAGCAATTT CTCGCCCCGC GACATCTCGC GCAAGTCATG ATTGTGAGCT ACGAAACGTT TAGGATCCAC TCGGACCGTT TCAACTTTGA CGGCGCGGTG GATTTGATCA TGTGCGACGA AGCGCATCGC CTGAAAAACG GCGAGACGCT GACGAACAAG GCGTTGTGTT CGGTACCGTG CTTGCGAAGA GTGATGCTGA GCGGGACACC CATGCAAAAT CACTTGGATG AGTTTTACTC GATGGTTGGC TTTTGCAATC CCGGCTTGCT CGGCACCCCT CCCGAGTTCG CGAAGAAGTT TGAGCGACCC ATCTTGGCTG GACGCGAACC AGACGCGACG GAAAAGGAAC TCGAACGCGC GCAAGAGGCG AATAGCGAGC TCTCTGATCT CGTCAACAAG TTTATCTTGC GTCGCACGAA CACGATTTTG AGTAAGCACC TTCCGCCGAA AGTCGTGGAA GTCGTGTGCT GTAAGCTTTC GCCGCTTCAG CAGGCGCTGT ACGAGCACTT TCTCACGTCA AAGGCGGCGA ATCAGGCGTT GACTGGAAAG GCGACGGCCG TGTTACCCGC AATCACGGCG TTAAAGAAGT TGTGCAATCA CCCGAAATTG ATTTACGACA TGATTAATGG CGCAAAGAAC ACTGGTCAAG CGGCGAGCGG ATTCAGCACG TGCGCGGAAT TTTTTACTCC CGGAATGTAC GACGGCGGCG GGGGGCGGAG TGGTCGCGGT GGCGGTGGCA TGATGCACGG CTGGGAAGAG CACAGTGGAA AGTTTGCTGT ACTCGCGCGT CTTCTCGCTA ATTTGCGCGC CGAGACTAAG GACCGTATCG TCATCATTTC CAATTACACG CAAACTCTCG ACTTGGTGGG CAACATGTGC CGCGAGCGAA ACTACCCGTT CGTGCGCCTC GACGGCTCGA CGTCTATCGG AAAGCGTCAA AAGCTCGTCA AACAGTTCAA TGATCCCACG AGTAATTCTT TTGTGTTTCT CTTGTCTTCC AAGGCTGGCG GATGCGGTAT CAACCTCATC GGAGGTAATC GTCTCGTGTT ATTCGATCCC GACTGGAATC CCGCCAACGA CAAGCAAGCC GCCGCGCGGT GCTGGCGCGA CGGTCAAAAG AAGAAGTGCT ACTTGTATCG CTTTTTGGCC GCGGGCACGA TCGAGGAGAA AGTTTTCCAG CGTCAACTGT CCAAGGAAAG CTTGCAAAAC GTCGTCAACG GTTCGGGGGA ACTCGAACAA TCCGTCATGT CCAAGGACGA GTTGCGAAAG TTGTTTTCGT TGGACTGCAC GACGTACTCG GATACGCACG ACACGTGCGG TTGCAAGCGA TGCCCCGCGC GGAACGGTGA CGACCTCGGG TGCGACCCGG ACGCGGAATA CAAGGAATGG GAGGAACAAA TCGACGACGC AGACGAACAA AAGCTCGACG AGTGGGCGCA CCATCACCGC ATGGACAAAG TTCCCGACGA TATGATGAAA AAGTCGGCGG GCGAGGACGT ATCCTTCGTG TTTTCGCTCA AAGTTGAGGG CGCGGCTATT GATGAATCGA AGAAAGCTCC AGCACCCGAA AAGAAGTCGG AGGAAGCGTC TGAGAAGAAG CCGACGCCGA CGATGCCGTC GCACTCGACG CGCCCACCGA CCAGATTCGT GCCGCCGGCG CGACGTCCGC TCGCGCCGCG AGTTGCGCCG CAGGCGACGG CGATGCCGGC GTATCAACGA CCGAGTACTC TGATCGCTGC GAAGCCATCG GAAGGTCGCG TGCCGCCCAT GCGCGCAGCT GTGAAACCTG CCGCGAAAGC GCCCAAGAAG AAGAAGAAAC TTGAGAGCGA ATCCGAAGAC GAAGACACCG AGGAAGAATC CGAACAGGAA GAATCTGAAG CCGAGGAAGA ATCTGAAGAA GATGAGCCAG TCGCTGATAC GGAAGGGGAC GACGATGATG AAGTACCAGA TTCTGAAGAT GAAGACGTCG CCGACACCTC TGCGCCGCGT CAAAAGCGAT CATCACCGGA CGCGGATGAA AGCGCGACGC GATACAAGGC GGCGCGCCTG TCGTCCTTAA CGTCAGATCG AGGCTTTGCG GTGCCGCCGC GCTCGCCCAG CGACGAATCC GACGGATACT AG
|
Protein sequence | MRSSDDDDDD DDDEEEDEGD ESYEARDGDD DDDDVVVTQR DVKAANIASL VNGTLETCAR PYVRGLTVED PAVTLKRAFK SPFPNAPNRS AELERRLASR RVFVPWGSKA SDVARALPKP TQCAATEEAI VLPEGIEDLV LWEPDREGDG GETSATAGGA SAKPIVVDRM LTRWLRPHQR EGVKFMFECV MGLRDFEGQG CILADDMGLG KTLQGITLLW TLLKQGIDGT PAVKRALIVC PTSLVSNWDD ECNKWLNGRV KTLPICDSTR AEVVSSVKQF LAPRHLAQVM IVSYETFRIH SDRFNFDGAV DLIMCDEAHR LKNGETLTNK ALCSVPCLRR VMLSGTPMQN HLDEFYSMVG FCNPGLLGTP PEFAKKFERP ILAGREPDAT EKELERAQEA NSELSDLVNK FILRRTNTIL SKHLPPKVVE VVCCKLSPLQ QALYEHFLTS KAANQALTGK ATAVLPAITA LKKLCNHPKL IYDMINGAKN TGQAASGFST CAEFFTPGMY DGGGGRSGRG GGGMMHGWEE HSGKFAVLAR LLANLRAETK DRIVIISNYT QTLDLVGNMC RERNYPFVRL DGSTSIGKRQ KLVKQFNDPT SNSFVFLLSS KAGGCGINLI GGNRLVLFDP DWNPANDKQA AARCWRDGQK KKCYLYRFLA AGTIEEKVFQ RQLSKESLQN VVNGSGELEQ SVMSKDELRK LFSLDCTTYS DTHDTCGCKR CPARNGDDLG CDPDAEYKEW EEQIDDADEQ KLDEWAHHHR MDKVPDDMMK KSAGEDVSFV FSLKVEGAAI DESKKAPAPE KKSEEASEKK PTPTMPSHST RPPTRFVPPA RRPLAPRVAP QATAMPAYQR PSTLIAAKPS EGRVPPMRAA VKPAAKAPKK KKKLESESED EDTEEESEQE ESEAEEESEE DEPVADTEGD DDDEVPDSED EDVADTSAPR QKRSSPDADE SATRYKAARL SSLTSDRGFA VPPRSPSDES DGY
|
| |