Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_4881 |
Symbol | |
ID | 5004769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 294262 |
End bp | 295215 |
Gene Length | 954 bp |
Protein Length | 318 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420190 |
Product | predicted protein |
Protein accession | XP_001420651 |
Protein GI | 145352650 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1218] 3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase |
TIGRFAM ID | [TIGR01330] 3'(2'),5'-bisphosphate nucleotidase, HAL2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.00138812 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.468917 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GCCGCGCGCG CGGTGCGGCT CGCGGGCGCG CTGTGCCGGA AGATGCAGTT CGAGCTGCGA ACGAACGAAA AAGTGTCGAA ATCGGACGAC TCGCCGGTGA CGGTGGCGGA TTTCGCGGCG CAGGCGGTGG TGTCGCACGT CCTGGGCGTC GCGAGGCCGG ACGTCGGGCT GGTGGCGGAG GAAGACGCGC GGAGTATGCG GGAACCAGCG GGCGCGAAAT TGCGAGCGAG AGTGACGGCG GTGGTGAACG ATGCGCTCGA AGGCGTGGTG GAGCGCAGAC TGAGCGAGGA GGAGGTCATG GACGCGATCG ATCGCGGGGC GACGGACGGC GGCGCGTCGG GGTCGTTTTG GATTCTCGAT CCAATCGACG GCACGAAAGG ATTCATTAAT GGTCGGCAGT ACGCCATCGC TTTGGCGCTC ATGGAGGACG GCGAAGTTAC GGGTGGTGTT CTCGGGTGTC CGAACATGCC GAGCGAGAAG ATACCGCGAG GAGCGACGGA AATTCCGACG GCGGCGCCGG GAGTAATTTT CGTCGCGTAC AAGGGGCGCG GGACGACTGT GGGGGCGTTC GACGCGGAGC ATCCTCTGCG AGATGGCGCG AAAATAACGA CGAATAAAGT GGCCAGTTCG AGCGAAGCGA CGTACATGGA ATCGTGGGGG GACTCCATCG TCGCCGATCA TGGGTTTACG AATTCTTTGA GCGCGGCGAT GGGCGTAACG GCGCCGCCCG TGCGCATCGA TAGCATGGCA AAGTACGGTG CGCTCGCCCG TGGAGACACG AATATGTATC TCAGGTTTCC GCCCGCGAGT TATAGAGAAA AAGTTTGGGA TCACGCCGCG GGCGCGATCG TGGTTCAGGA GGCGGGAGGG GTCATCACCG ATGGCGCCGG GAATCCACTC GATTTTTCAA AGGGACGATT TTTGGACATC GACATCGGCA TCGTGGCCAC GTCT
|
Protein sequence | AARAVRLAGA LCRKMQFELR TNEKVSKSDD SPVTVADFAA QAVVSHVLGV ARPDVGLVAE EDARSMREPA GAKLRARVTA VVNDALEGVV ERRLSEEEVM DAIDRGATDG GASGSFWILD PIDGTKGFIN GRQYAIALAL MEDGEVTGGV LGCPNMPSEK IPRGATEIPT AAPGVIFVAY KGRGTTVGAF DAEHPLRDGA KITTNKVASS SEATYMESWG DSIVADHGFT NSLSAAMGVT APPVRIDSMA KYGALARGDT NMYLRFPPAS YREKVWDHAA GAIVVQEAGG VITDGAGNPL DFSKGRFLDI DIGIVATS
|
| |