Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42368 |
Symbol | |
ID | 5003289 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 302538 |
End bp | 304439 |
Gene Length | 1902 bp |
Protein Length | 563 aa |
Translation table | |
GC content | 57% |
IMG OID | 640418710 |
Product | predicted protein |
Protein accession | XP_001419339 |
Protein GI | 145349849 |
COG category | [A] RNA processing and modification |
COG ID | [COG5107] Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0945405 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.143131 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATTGA AGGCTGTCGA TCTCAAGAAG CGCTGCGCCG ACGCTGGGCT CGACGATAGC GGCACGAAAG CCGATCTCGT GGGAAGATTA CTCAATCAAG CGTCAAACGA AGATTCTTCG CCCCCGGACG CCGGCGATGC GCCCGAACGC GTCGTCGCGA CGGACGCCGA CGACGCAAAG TTAGAAGATG CGGACGTGGA TGCGGTCGAA ACCGACGGGA GCGGCGATCA CGATGATTTG CTGACGCATT TAAGAGAGAA TCCGTCAGAC GCCGCCGGGT GGGAACGGTG CGCGAGACTG GCGTTGGCGA CGACGATTCC GAGCGCTCGA CCGCTGTTCG ACGCCATCAC CGAACAGTTC CCTCGATCAT CGTTGGCGTG GTGCTGGTAC GTCGACGCGG AGCTTTCGAA GAACGATGCG GGGACGCCGG ACGACGAAGC GATCCGCGCC ATTTTCGGGA AGTGCTTGAT ACCGTGCCCG AGCGCGTTGC TCTGGCGAAG ATACGCGTCG TACATGGCGA GCACGAACGA TGTGACGACA GAAGAGGGGG TGAACACAAT GAAATCGGTG TACGAGTACT CCGTGGACGT CGTCGGCGAG GACGCGGACG CTGGTGATCT TTGGATGGAC TATTGTCAGT TCTTGCGCAG CACCGAGGCG ACGCTGATTG TCACCGACGT CGCTGTGGAG CAAGCGCCGA GTGCGAGAGA TATGATCGTT CGCAGGACGT ACCAGAAGGC GATTTCAGTG CCGATGCACA AATTAGATGC AGTTTACAAG GTTTACGAAG CGTTTGAGCT TGAGAAGAAC AAGGCGCTCG CGAGGGCGTT GTTGCAGGAA ATAGCGCCAA AGTTGTTGCT CACACGCACG GCGCTCGGTA AGAGGAAGAA GGTGCTCGCC GACGTCGTCG TGGGCGCGGT GTGTGTTGAT CCAGTGCAGC GCGGTGCGGA TGGGTTGACT TCGATTATTT GTCCCGCGTT CAATGCCGCT GGGTGTAGAG GTTGTGCGCG CGCGCACGAA TGTAAATTTT GCGGCTCGCC AGCGCACGGG GCGAAGGCCT GCAAGAGTGG ACGATACGCG CTTCTTGCGG CATCACCCAC GGCGTGCGCA GCTCAATGGG CAGAGATTAT CGATTTTGAG AAGTCAAACG TGCAAAAGCT TGAAGGTGCA ACGCCGAATG AACCGTCCCC ACAACTCTAT GCGCGCGTGA AGCACGCCTA CGAATTAGCA GGTCTGTCTC TCGGTGAGAC ACCTGAGTTT TGGCTAGAGT ACGCGCACTG GCACGAGAGC GAGAATCGTT CAGACGAGGC GGTGGAAGTT TTACAGCGAG CGCGGGAGGC GTTACCGTAT TGCACGCTTA TCACTTTTGC GTCGGCGGAT ATCGAAGAGA CTCGCGGCGA CGCCGACGCG TGTAGAGCCA TCTACGAGTC CGTGCTTGAC GCCTATGAAG AGAGCGCTGA CGAAGCCATC GAGCGAGGTG AGGAGATAAT GATGCCCTCG GATATAATTC TAACGTACTG CGAATACGTT CGCGCGTCGC GTCGTGTTGG AGATCAAGAT TCGAGTCGAA AAGCTTTCAT GCGAGCCCGT AAGGCGCCCG GGGCGACGTG GGAAATTTAC GCAAACTCCG CGATGATTGA ATGGCAGTAC GATAAGAGCG ATAAACCGGC GAGAAACATC TTTGAGCTCG GGCTTAAGAA ATTTTTGACG TCGCCAGACT ACGTCGAGCG TTATGCCGAG TTCTTGATTG GCGTGAACGA TGTTGCCAAT GCCCGAGTTT TGTTTGAACG TTCCCTGAGT GAATCGCCGT CGATGAAGAT TTGGGATATG TTTGTCGATT TTGAGCGTTC GCATGGCACA GTGGATACGA TTTTAGATGC GGAGGCACGG CGTAATGCCG CG
|
Protein sequence | MRLKAVDLKK RCADAGLDDS GTKADLVGRL LNQASNEDSS PPDAGDAPER VVATDADDAK LEDADVDAVE TDGSGDHDDL LTHLRENPSD AAGWERCARL ALATTIPSAR PLFDAITEQF PRSSLAWCWY VDAELSKNDA GTPDDEAIRA IFGKCLIPCP SALLWRRYAS YMASTNDVTT EEGVNTMKSV YEYSVDVVGE DADAGDLWMD YCQFLRSTEA TLIVTDVAVE QAPSARDMIV RRTYQKAISV PMHKLDAVYK VYEAFELEKN KALARALLQE IAPKLLLTRT ALGKRKKVLA DVSGRYALLA ASPTACAAQW AEIIDFEKSN VQKLEGATPN EPSPQLYARV KHAYELAGLS LGETPEFWLE YAHWHESENR SDEAVEVLQR AREALPYCTL ITFASADIEE TRGDADACRA IYESIMMPSD IILTYCEYVR ASRRVGDQDS SRKAFMRARK APGATWEIYA NSAMIEWQYD KSDKPARNIF ELGLKKFLTS PDYVERYAEF LIGVNDVANA RVLFERSLSE SPSMKIWDMF VDFERSHGTV DTILDAEARR NAA
|
| |