Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_33487 |
Symbol | |
ID | 5003667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 296266 |
End bp | 297450 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | |
GC content | 65% |
IMG OID | 640419088 |
Product | predicted protein |
Protein accession | XP_001419770 |
Protein GI | 145350768 |
COG category | [C] Energy production and conversion |
COG ID | [COG1227] Inorganic pyrophosphatase/exopolyphosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.265227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0616217 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGC GCGCGAACGT CGACGACGGC GCGCGCGAGG CGAACGCGCT GCGAACGTTC CTGCGCGACG CCAAGGAGGC GTTCGCGCGC GATCCGGGGG CGTGCGACGT GAGCGTGGGG AACGAGGCGT GCGATTTGGA CTCGGTGGCG TGCGCGGTGG CGACGGCGCG AGCGGCGAGC GCGAAGCGCG GACGCGACGA TGGCGAGCGC GAAACGCGCG CGGTGCCGAT CGTGTCGTGC GCGAGGGAAG AACTGAAATT ACGACCCGAC GTGGTCTTGG CGCTGGCGAA CGCGGGGGTG AAGTTGGGCG ATTTGACGTG CGCGGAGGAC GTCGCGGCGG CGGCGACGAA GGCGACGCCG CGAAGCGTGA CGTTGGTGGA TCATAACGCG CTGAGCGCGC GGTTGTTTCC GGACGCGTGG CAAGCGCGCG TGGTTCGGGT GATTGATCAT CACGAGGATT CGGGGATGTA CGCGGAACGG GCGGATAGGG TCATCGAGTT GATCGGATCG TGCTCGAGTT TGGTGTACAG GGACGTCGTG GCGAAAGCCG CGGACGAGGG CGTCGCGCGA GACGTCGCGC GTTTGCTTCT GGGAGCGATC GTGTTGGACA CGAGAATGCT GGACGCGACG ACGACGCGGG CGGCACCCGT GGACTTTGCC GCTGCGGAAT CGCTGCGAGA TATTTTGGGA TGGGACGAGG ACGCGACGCG AGCGGAGTAC GAATCGCTCT CTCGCGCGCG TCACGATCAG AGCTCGTTTT CGTGCGCGCA ACTCTTGGCG AAAGATTACA AGCAGTGGAC GATGGGGTCG CTCGAGGTCG GCATCGCGTC GTTCGGCGTG CGGTTTCAGG ATTTGCTGGC GCGACAGGAC GCTTCATCCG TCAACGATGA AATCGTCGCC TTCGTCGACG CGCGGCGCAT CGACGTGTTA TTTATGATGT CCTCGTTCGA AGACGCCGAC GCCGACGGCG CGTTCGCGCG TCAGATCGAT GTCACGAAAT CGAGCGCGTG CTCGATCGAG CTCGAAGCCG TCATGCGCGA CTTGGGCGAG CGAACGCCGC TCGCGCCGCT GCGTCTTCCC GAAAACGACT TCGGCGTGTT CAAATCCGCG CGCGCGCAGC TCGACGTCAA GGCGAGTCGG AAGAAAGTCC AACCGATTCT CCTCGAGATT TTAGCGAGAT TTTAG
|
Protein sequence | MATRANVDDG AREANALRTF LRDAKEAFAR DPGACDVSVG NEACDLDSVA CAVATARAAS AKRGRDDGER ETRAVPIVSC AREELKLRPD VVLALANAGV KLGDLTCAED VAAAATKATP RSVTLVDHNA LSARLFPDAW QARVVRVIDH HEDSGMYAER ADRVIELIGS CSSLVYRDVV AKAADEGVAR DVARLLLGAI VLDTRMLDAT TTRAAPVDFA AAESLRDILG WDEDATRAEY ESLSRARHDQ SSFSCAQLLA KDYKQWTMGS LEVGIASFGV RFQDLLARQD ASSVNDEIVA FVDARRIDVL FMMSSFEDAD ADGAFARQID VTKSSACSIE LEAVMRDLGE RTPLAPLRLP ENDFGVFKSA RAQLDVKASR KKVQPILLEI LARF
|
| |