Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36244 |
Symbol | |
ID | 5000269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 247309 |
End bp | 249399 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | |
GC content | 58% |
IMG OID | 640415690 |
Product | predicted protein |
Protein accession | XP_001416108 |
Protein GI | 145342048 |
COG category | [A] RNA processing and modification |
COG ID | [COG5107] Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGACA CGGCTCGCGC GACGCGGGTG AAGAATAAAG CGCCCGCGCC GGTGCAAATC ACCGCGGAAC AAATCGTGCG CGAATCCGCG GAGCGCGCGG AGGACGTGTA CGGGGCGCCG AAGCGGAAGA TCGCGGACCA GGAGGAGCTG AAGGAGTACA GATACGAACA GCGGAAGCAG TTCGAGGATC GAGTGCGGTC GTCGTACTGG GAACCGCGGG CGTGGATACG GTACGCGAAG TGGGAGGAGG GACAGGGGGA TTTGCCGAGG GCGCGAAGCG TGTGGGAGCG AGCGCTGGAA CATCACGGAC GGGACGTGCC GATATGGTTG CAGTACGCGG AGATGGAGAT GAAGAACAAA GCGATCAATC ACGCGAGGAA CGTGTGGGAG CGGGCGTGCT CGACGCTGCC GAGGATTGAT GTGTTTTGGT ACAAGTACGT GAACATGGAG GAGACGCTCG GGCAGGTTGC GGCGGCGAGA CAGGTTTTCG AGAAATGGAT GAAGTGGGAA CCCGAACACA CGGCGTGGAA CGCGTATGTG AAGATGGAGC AGAGGTACGG GGAGAAAGAG CGGGCGAGGG ATATCTTTCA GCGGTATGTG CAGGTGCACC CGGACGTGAA GGCGTGGACG CGCTGGGCAA AGTTTGAGTT TTCGTCGGGC GAACGCGACA AGGCGCGCGA GGTGTACGAA GCCGCGGTGG AGTTTCTGCG GAACGAACCA GAGGTTGGGA ACTTATACGC CAACTTTGCC AAGTTTGAGG AGATGTGTCA CGAAGTTGAG CGCGCTCGGG CGATTTATAA ATTCGCTCTC GATCGTTTGC CCAAGGAGCA AGCCGAGTCT GTGTACAAGG AGTTTATGAA GTTTGAAAAG ATGCACGGAA ATCGAGAAGG GATCGAGGAC GTCGTTGTGG GGCAACGACG GTTCAAGTAC GAGGAAGAAG TGTCCAAGAA TCCGCTGAAC TACGACACTT GGTTCGATTA CATTAGACTA GAAGAAAACG CCGGCGACAT GGCGAAGACG CGCGAGGTGT ACGAACGCGC CATCGCCAAC GTGCCGCCAG CGAACGAAAA ACGCTTTTGG CAGCGATACA TTTATATTTG GATCAACTAC GCGCTTTACG AAGAGCTCGA GGCGCGAGAC GTGGAACGCA CGCGAGAGGT GTACCGAGCG TGCCTCAAGG TGATTCCACA CGCCGAGTTT TCGTTTTCAA AGATTTGGAT CATGGCGGCA AAGTTCGAGC TTCGTCAGCG ACGACTCGAT GCGTGCCGTA AGATTTTTGG ATTGGCCATT GGATTAGCAC CGAAGGCAAA GATATTCGCG ACTTACATTG AGATCGAGTT CCAGCTCGGG AACGTCGACA GGTGTCGCAC CCTGTACGAA AAGTACTTGG AAATTGAGCC GCAAAATTGC TCGACGTGGA TCAAGTACGC CGAACTCGAG CGATCGCTCG GAGAAATCGA ACGTGGGCGC TCGATTTTCG AACTCGCGGT TGATCAGGCC ATGCTAGACA TGCCGGAGGT ACTTTGGAAG GCGTACATCG ATTTCGAGAC GTCCGAGGGA GAGCGCGAGC GTACGCGCGC GCTTTACGAG CGTCTTCTCG AGCGTACGAA GCACGTCAAG GTATGGATGT CTTACGCGCG CTTCGAAGCA ACACCGATTG TTGTTGTGGA TGACGACGCC GACGACGCTG CGATAGCCGC CGCCACCGCC GCGGCGGAGA ACGACGAGCA CGAACGCCTC GAGACGCGCC AAGCGAAAAG CCGCGCCGTG TACGAGCGAG CGCTCGCTGA ACTCAAAGAA ATCGATCCAG ACGCCAAAGA AGAGCGTGTC ATGCTTCTCG AGGCGTGGAA GTCATTCGAA GACACCTTAC CGAGCGAATT CTCGAAATCC GCCGACGTCA AGGCGCGTCT TCCCAAGCGC GTTAAGCGCA AGCGCGCCGT CGAGGACGAC GACGGTCGCG AAATCGCACA GGAAGAGTAC TACGATTACG TCTTCCCCGA CGACGCCGGC GCCGCGCAAC CGTCGCTCAA GCTTCTCGAA GCCGCGTACG CGTGGAAGAA GGCCAAAGCC GCGGCTGAGG CGACGGATTA A
|
Protein sequence | MSDTARATRV KNKAPAPVQI TAEQIVRESA ERAEDVYGAP KRKIADQEEL KEYRYEQRKQ FEDRVRSSYW EPRAWIRYAK WEEGQGDLPR ARSVWERALE HHGRDVPIWL QYAEMEMKNK AINHARNVWE RACSTLPRID VFWYKYVNME ETLGQVAAAR QVFEKWMKWE PEHTAWNAYV KMEQRYGEKE RARDIFQRYV QVHPDVKAWT RWAKFEFSSG ERDKAREVYE AAVEFLRNEP EVGNLYANFA KFEEMCHEVE RARAIYKFAL DRLPKEQAES VYKEFMKFEK MHGNREGIED VVVGQRRFKY EEEVSKNPLN YDTWFDYIRL EENAGDMAKT REVYERAIAN VPPANEKRFW QRYIYIWINY ALYEELEARD VERTREVYRA CLKVIPHAEF SFSKIWIMAA KFELRQRRLD ACRKIFGLAI GLAPKAKIFA TYIEIEFQLG NVDRCRTLYE KYLEIEPQNC STWIKYAELE RSLGEIERGR SIFELAVDQA MLDMPEVLWK AYIDFETSEG ERERTRALYE RLLERTKHVK VWMSYARFEA TPIVVVDDDA DDAAIAAATA AAENDEHERL ETRQAKSRAV YERALAELKE IDPDAKEERV MLLEAWKSFE DTLPSEFSKS ADVKARLPKR VKRKRAVEDD DGREIAQEEY YDYVFPDDAG AAQPSLKLLE AAYAWKKAKA AAEATD
|
| |