Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26305 |
Symbol | |
ID | 5003943 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 572921 |
End bp | 576423 |
Gene Length | 3503 bp |
Protein Length | 1120 aa |
Translation table | |
GC content | 55% |
IMG OID | 640419364 |
Product | predicted protein |
Protein accession | XP_001420218 |
Protein GI | 145351726 |
COG category | [A] RNA processing and modification |
COG ID | [COG5161] Pre-mRNA cleavage and polyadenylation specificity factor |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.694924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.69508 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACG AGGACGCGCG CGAGAGCGGT TCAGGCGCGC GGTGTCAGCA CAATTACGTC GTCACGGTGC GCGCGCGCGC GTACGACGCG AAGCGACGCG ACGCTCGATC GTTGACGACC GAGCGAACGA CCGACTGCGC GACGCTCTGC GCGATGACGA GCGATCGACT GACGACGACG ACGACGACGA CGCGTGGTTT CCGCAGGCGC ACAAACCCAC CGTGGTCACG CACTCGGCCG TGGGAAAGTT TACGTCGAGC GAGGCGACTG ATTTAATCGT CGCGAAGAGC ACGCGACTGG AGGTGTATCG CTTGCACGCG GAGGGGCTGA AACCGGTGCT GGATGTGCCG ATAAATGGTC GAATCGCGAC GATGTCGCTG TGCCAAACGG GATCGGGCGA TGGGAAGGCG CGGTTGTACC TGACGACGGA GAGGTATGGG TTTACGGTGC TGTCGTACGA CGAGGCGAAC GAAGAGTTGA AGACGGAGGC GTTCGGGGAC GTGCAGGATA ATATCGGTCG ACCGGCGGAT GATGGACAGA TTGGGATCGT GGATGACACG TGTCGAGCGA TCGGGTTGCG GCTGTACGAC GGATTATTCA AGGTGATTCC GTGCGACGAA AAGGGCGGGG TGAAGGAGGC GTTTAACATT CGCCTCGAGG AGTTGCGCGT GGAGGACATC AAGTTCTTGC ACGGCACGCC CAAGCCGACG ATCGCGGTGC TTTACCGAGA CACTAAAGAT GCGGTACACA TTAAGACGTA CGAGATCGGC ATTCGCGAGA AGGAATTCGT GTCGTCGCCT TGGGCGCAGA ACGATTTAGA GGGTGGCTCG AATAAAATAA TTCCGGTCCC CGCGCCCATC GGCGGTGTCG TCGTCTTGGG GCAGGAAATT ATCGTGTACT TGAACAAGTT CGAAGACGAC GCAGATGTGT TTCTCAAAGC GATCAACATC CCCAACATCC CCGATCGGAC GAACATCACG TGCTACGGCG CTATTGATCC GGACGGCTCG CGCTACCTGC TGGGCGACGC GGACGGCATG CTCTACTTGC TCGTCATCTT ACACGACGGC AAGCGCGTTC GAGAGCTTAA AATTGAGAGA CTTGGCGACA CGTCAATCGC GAGTACGCTT AGCTATCTCG ATAACGGCGT GGTATTCGTC GGTAGCACGT ACGGCGACTC GCAACTCATC AAGCTGCACG CGGAGAAGAC GAGCATCGAT AAAGACGGCA ACCCGACGTA CGTGCAAATT TTAGAAGAGT TCACCAATCT CGGTCCCATT GTAGACTTTG CATTCGTTGA CTTAGAGCGG CACGGTCAAG GGCAAGTGGT TACGTGTAGC GGGGCGTTAA AAGATGGGAG TTTACGCGTA GTGCGCAACG GCATAGGCAT TGACGAGCAA GCGGTGATTC AGCTTCCCGG TGTCAAAGGC TTGTTTTCAC TTCGCGATAG CGACGATAGC CAAATGGATA AGTACTTGGT CGTCACATTC ATAAACGAAA CTCGCATCTT GGGCTTTGTC GGGGACGAAG GGGACACGTT GGACGAGACA GAAATTGCGG GTTTCGACGC TGAAGCGCAA ACTTTGTGCT GTGGCAACAT GCAAGGCAAT GTTTTCCTTC AAGTGACGCA CAGGGGCGTC CGCCTCGTCT CAAGAGGTGG CGACTTACTC GATGAATGGA AGCCAAAGGA CGGTGCGGAG ATTCTTTCGG CGAAGTGCAA CCCGACTCAA ATTCTTGTTG CAGCGGCGGG CGGGCAGTTA CACTGCTTGA ACGTGGCGAA GGGCAAAATC GTCCTCTTGG CGAGTAAGAC ATTCGAGAAC GAAATAGCTT GCTTAGATTG CACGCCCATG GGCGATGGGA TGAGCTCGCC GGTGTGCGCG GTTGGTTTGT GGTCCATGGA TATCGTGCTC GCGTCAATGA GTGACTTGAG CGTCATCACG AAAGAAAGTA CAGATGAAGA CATCATCCCT CGATCGACGT TGCTGTGTTC GTTCGAAGAC ATTCCATACT TGTTCGTAGG CTTAGGTGAC GGTCAGTTGA TCACGTACGT TTTAGATCAG AACACTGGGG CTTTGAGCGG GCGCAAAAAA TTGAGCCTCG GTACCAAGCC GATTACGTTG CAAACGTTTA AGAGCCATGC GACGAATGTG AGTAGTGTGT TCGCGGCGTC GGATAGGCCG ACGGTGATCT TCAGCAACAA CAAAAAGCTA ATCTACTCCA ACGTCAACGT GCAAGAGGTG TTGCATGTCT GTCCGTTCAG CAGCGAAGCG TTCCCGGACG CTCTCGCGTT GGCGGGCGAC GAAGATTTAA CCATCGGCGG GATTGACGAC ATTCAAAAGC TGCACATTCG CACGATCCCA CTCGGTGGTC ACCCGCGTAG GATCGCGCAT CAGGTCGACA CGAACACATT CGCGGTCGCG GTCGAGCATT TGATGTCGAA GGGTGATCAA GAACTCTTCA TCAGACTCAT CGACGACGGT TCGTTTGACA CGCTCCACCA GTTTCGATTA GAAGAGCACG AATTGGCGAG TTCCTTGATG TCGTGCTCGT TCGCGGGCGA TTCGAGAGAG TACTACGTCG TCGGTACGGG GTTTGCTTAC GAGCAAGAAG ATGAACCGTC GCGCGGGCGC ATTCTCGTCT TACGCGTTGA GGCGGACGCC CTTGAACTCG TATCCGAGAA AGAAGTTCGA GGCGCCGTGT ACAACTTGAA CGCATTCAAG GGTAAACTTC TCGCGGGGAT CAATTCTAAG CTAGAGCTAT TCAAATGGAC ACCGCGCGAA GACGACGCGC ACGAGCTGGT GAGCGAGTGC TCGCACCACG GCCAAATCAT CACGTTCTCC GTCAAGACGA GAGGCGATTG GATTCTCGTC GGCGATTTGC TTAAGTCCAT GTCGCTGTTA CAGTACAAGC CCGAAGAAGG CGCGATCGAC GAGATCGCGA GAGACTTTAA CGCAAATTGG ATGACTGCGG TGGCGATGTT AGACGATGAC GAAACCTATC TCGGCGCCGA GAACAGCTTG AACCTGTTTA CCGTCGCTCG CAACATGAAC GCTATGACGG ACGAAGAGCG TAGTCGTTTG GAAATCACGG GCGAGTATCA CCTAGGTGAG TTCGTCAACG TGTTCTCTCC TGGTTCACTC GTCATGAGCC TCAAAGATGG CGACAGTTTA GAAGTCCCCA CTTTGCTCTT CGGCACTGGT AACGGCGTCA TCGGCGTCTT GGCCAGTCTG CCGAAGGACG CCTACGATTT CGCCGAGCGC CTTCAAACCT CCATGAACAA GCACATCCAA GGCGTCGGTG GCTTGAAACA CGCGGAGTGG CGTTCATTCC GCCACACGCT TCGCCGCAAG AGCGATCCTT CGAGGAATTT CGTCGACGGC GACCTCGTCG AATCGTTTCT CGACTTAAAA GTCGAGCAAG CCGACGTCGT CGCCGCTGAC ATGAAGTGCG ATCGCGCCGA AATCATTCGT CGCGTCGAAG AGCTTCAGCG CTTGACGCAT TAA
|
Protein sequence | MTNEDARESG SGARCQHNYV VTAHKPTVVT HSAVGKFTSS EATDLIVAKS TRLEVYRLHA EGLKPVLDVP INGRIATMSL CQTGSGDGKA RLYLTTERYG FTVLSYDEAN EELKTEAFGD VQDNIGRPAD DGQIGIVDDT CRAIGLRLYD GLFKVIPCDE KGGVKEAFNI RLEELRVEDI KFLHGTPKPT IAVLYRDTKD AVHIKTYEIG IREKEFVSSP WAQNDLEGGS NKIIPVPAPI GGVVVLGQEI IVYLNKFEDD ADVFLKAINI PNIPDRTNIT CYGAIDPDGS RYLLGDADGM LYLLVILHDG KRVRELKIER LGDTSIASTL SYLDNGVVFV GSTYGDSQLI KLHAEKTSID KDGNPTYVQI LEEFTNLGPI VDFAFVDLER HGQGQVVTCS GALKDGSLRV VRNGIGIDEQ AVIQLPGVKG LFSLRDSDDS QMDKYLVVTF INETRILGFV GDEGDTLDET EIAGFDAEAQ TLCCGNMQGN VFLQVTHRGV RLVSRGGDLL DEWKPKDGAE ILSAKCNPTQ ILVAAAGGQL HCLNVAKGKI VLLASKTFEN EIACLDCTPM GDGMSSPVCA VGLWSMDIVL ASMSDLSVIT KESTDEDIIP RSTLLCSFED IPYLFVGLGD GQLITYVLDQ NTGALSGRKK LSLGTKPITL QTFKSHATNV SSVFAASDRP TVIFSNNKKL IYSNVNVQEV LHVCPFSSEA FPDALALAGD EDLTIGGIDD IQKLHIRTIP LGGHPRRIAH QVDTNTFAVA VEHLMSKGDQ ELFIRLIDDG SFDTLHQFRL EEHELASSLM SCSFAGDSRE YYVVGTGFAY EQEDEPSRGR ILVLRVEADA LELVSEKEVR GAVYNLNAFK GKLLAGINSK LELFKWTPRE DDAHELVSEC SHHGQIITFS VKTRGDWILV GDLLKSMSLL QYKPEEGAID EIARDFNANW MTAVAMLDDD ETYLGAENSL NLFTVARNMN AMTDEERSRL EITGEYHLGE FVNVFSPGSL VMSLKDGDSL EVPTLLFGTG NGVIGVLASL PKDAYDFAER LQTSMNKHIQ GVGGLKHAEW RSFRHTLRRK SDPSRNFVDG DLVESFLDLK VEQADVVAAD MKCDRAEIIR RVEELQRLTH
|
| |