Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30338 |
Symbol | |
ID | 5000553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 25966 |
End bp | 27270 |
Gene Length | 1305 bp |
Protein Length | 429 aa |
Translation table | |
GC content | 57% |
IMG OID | 640415974 |
Product | predicted protein |
Protein accession | XP_001416530 |
Protein GI | 145344005 |
COG category | [A] RNA processing and modification |
COG ID | [COG5623] Pre-mRNA cleavage and polyadenylation factor IA/II complex, subunit CLP1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCCTCGTCAT GGCGTCCGAT GGTGACGATG GCATCGCCGT GCAGACGTTC ACGCTGGAAC AAGAACAAGA ACTGCGCGTG GAGACGCCCG CGAGGGGGGA GATAAAGCTC AAGCTCGTCG ATGGCACGGC GGAAGTCTTC GGGGCGGAGA TCGCCGTCGG GCAGAGCATT ACGTGTGTTT CGGGACGTAA ACTCGCCGTT TTCACCTATC ACGGCGCGAC GATCGAAGTG AGAGGAGAGG TAGAGATCGC GTACGTCGCC GGGGAGACGC CGATGGTGAG TTACGCGAAC ACGCACTCGG TTTTGAATGC GAAACGCGTG GCCGCGGCGA GCGAGAATTC GAGCGAAGCC GAGGGACCGA GGGTAATGTG TGTCGGACCG ACGGACGTGG GTAAGAGCAC GGTGTGTTCT ATATTATGTA ACTACGCCAC GCGCGCCGGA CACGCTCCGC TGTACGTGGA TTTAGATTTA GGACAGGGCG CGGTCACGGT GCCGGGAACG ATTTGCGCCG CGCCGATTGA CGCGCAGATA GACCTCGAAG AGGGAATACC GCTGGAGATG CCTCTGGTGT ACTTTTACGG CGACTTGACT GTGAATAATC CCGATTACTA TAAGCACATC GTCTCGAGGT TGGGCACTAT GCTAGACGAG CGAAGCAAGG CAAACGAAGA GGCGCGCGCG GCTGGATGCG TGGTGAATAC GATGGGTTGG ATCGATGGCG TCGGCCTGGA GCTCTTGCTT CACGCTCGAG AGGCGCTCAA GATTGATCAC GTCCTTGTCA TTGGTCAGGA GCGTTTGTTC GGGCAACTGC AGCAAAAACT TAAGGGAACG GACTGCCAAG TGTTTCGACT GCAAAAGTCT GGCGGCGTCG TTGAACGCAC GCCCGAGTAC CGCCGAGCAT CTCGCGATCG CATGTTCAAG GAATACTTTT ACGGCGCTAC CGGCGAGCTC GCACCGGCGT CGCAGACGGC TTATTTCTCG AAAATCAGCA TATATCGCAT CGGGGGTGGT CCGCGAGCGC CGACGTCCGC GTTGCCAATC GGTCAAGCTC CGTCCACGGA TCCCATGCGA GTCACTCCCG TTGTGCCTTC CACGTCGCTT TTGCACTCCG TCTTAGCGGT TAGTCACGGG AAAACACAGG GGGACTTGCT CACTTCGAAC GTTGCTGGTT TCATTTATAT CACCGAGGTG AACATGATGC AGAAATCGTT CACGTATCTG TCGCCGTGCC CGGGCGAATT GCCGTCAAAC GTCTTGCTCT CTGGTAACTT GAAATGGTTA GGCGAAGATG TGAAGTAGCG AGTGA
|
Protein sequence | MASDGDDGIA VQTFTLEQEQ ELRVETPARG EIKLKLVDGT AEVFGAEIAV GQSITCVSGR KLAVFTYHGA TIEVRGEVEI AYVAGETPMV SYANTHSVLN AKRVAAASEN SSEAEGPRVM CVGPTDVGKS TVCSILCNYA TRAGHAPLYV DLDLGQGAVT VPGTICAAPI DAQIDLEEGI PLEMPLVYFY GDLTVNNPDY YKHIVSRLGT MLDERSKANE EARAAGCVVN TMGWIDGVGL ELLLHAREAL KIDHVLVIGQ ERLFGQLQQK LKGTDCQVFR LQKSGGVVER TPEYRRASRD RMFKEYFYGA TGELAPASQT AYFSKISIYR IGGGPRAPTS ALPIGQAPST DPMRVTPVVP STSLLHSVLA VSHGKTQGDL LTSNVAGFIY ITEVNMMQKS FTYLSPCPGE LPSNVLLSGN LKWLGEDVK
|
| |