Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19644 |
Symbol | |
ID | 5003203 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 579522 |
End bp | 581660 |
Gene Length | 2139 bp |
Protein Length | 268 aa |
Translation table | |
GC content | 67% |
IMG OID | 640418624 |
Product | predicted protein |
Protein accession | XP_001419425 |
Protein GI | 145350026 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0024] Methionine aminopeptidase |
TIGRFAM ID | [TIGR00500] methionine aminopeptidase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00530685 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.519725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATAC CTATTTTCCC CGGCACCAAC CACAGCTTGG TTTGTTTCAA TCGCCTGCGT TTTCGCGCCC GCTTCAAAAA CTCGCGCTGC AACTCGGTGC CGTCTCGCAC GACGACGAGC GCCTTCTCCC CCTCCATCGC CCCGCAGTGC TCGGCGGTGA AGGCGTCGTA AAATATCGAT GAATCAAACA CTCCGTCACG CGTGAGCGTC AAAACTTCGA TGCGTTGCCC CATCTGCGCC ATCTCCTTGG CGCGTTGAAT CAACTTGGCG CCCGAGCTCC CAGATTTCAA CGGCGCCGCC TCGTTCGTGA ACAAATACAC GCTCTTCCGC CCCGCTCGCT TCGGTCCGTT CTCGAGCATG TGCGACGCCG TCCATAACCC CTTCGTCAGC GCGTCCTCTT GATTGAAAAA ATCGCCCGTC CCATCGTCGT CCTCGGGTTT CAACTCGCCG AATCGCTCGC GAAATTTGTC GGCGCCCTTC TCCCCGTTCG CGTATTCGCT CAATTCCAGC GCTCCCGCCG CGCTCGGGTT CTCCGCCCTT CGCACCTCGC ACACGCGTTC CATCCCTATC CCACCCACGC TCTTCCCGGT GTTGTACGCG CACACGCCCA GCACGTCGTC CGGCGCCACG ACGACGCGCG CGCGCGCGAA TTCAAAGCAC GCCCGCGTCG CCGCCGTGAA CGCGCACGCC CCGTCCGCGC GCGTCGCGTC CGCCTCGAAC ATCGCCGGCG AGCAATCGAT CAGCATCACC ACCGCGTCGC GCTGCGCGTC CGGGTCCCAC GCGCGCGCGC CGTCGCCGTC GCCGTCGCCG TCTTCCGCCG CGCCATCGAA TTCGAAATCA TCGTCCGAAT CGTCGTCGTC GCGCGCCATT TCGCCCGCGC GCGCGTCCGA CGGGCGCCGC GCGACGTCCC CCTTCGCTCG CGCGCCGCCC TCGCGCGCGC GCGTCGCCGT CGCGCCGCCG CGCGACGTTT TCTGCCGCAC TCTTCGACGC GCGCTTCGAC GCATCGTTTC ATTCGTCATG CGCGTCGCCG CGCGTCGCGC CGACGCCCTC GCGTTCGCGC GCGCCGTCGC GCGTCGTCCT CGCGCGTCGC TCGACCGTCT CCGGCGCCCG CGCTTCGGCG CGCGCGCGAC GGAGACGCGC GCGAAGAAGA AGGGCCTGCT CGGCGACCTG CTCAACGTCG CGAGGGACCG CGCGCGCGAC GAGGCGACGT GGTACAACGG GCGCGCGCCG CTTCGACCGG GGACGTACGC GCCGCAGAAG ACGGTGCCGG CGTCGATCGA GCCGCAGCCG CCGTACGCGA GGGACGGACA CCTGCCCGAG TACGACGACG GCGTCGTGCA GGTGCAGACG ACGGCGGCGG ACGTCGAGGG GATGCGACGC GCGGGAAGAC TCGCGGCGGA GGTGCTGGAC ATGGCGGAAA AGATGATCAC GCCCGGGACG ACGACGACGA ACGACATCGA CGAGGCGGTG CACGCGATGA CGATCGCGGC GGGGGCGTAT CCGAGCCCGT TGAACTACGG CGGGTTTCCG AAGAGCGTGT GCACGAGCCT GAACGAGTGC ATCTGTCACG GAATCCCGGA CGACACGGTG ATCCTGGACG GAGATATCAT AAATATTGAC GTCACGGTGT ACCTGAACGG GTATCACGGC GACACGTCGA GGACGATCAT GGTGGGGAAC GTGACGGAGG AGGTGCGGCG GCTGGTGGAG ACGACGGAGC GAGCGCTGGA CGCGGCGATC GCGATTTGTA AGCCCGGGAC GCCGGTGAGG AAGATCGGGG CGACGATTCA TCAGATCGCG GACGACGCCA AGTTCGGGGT GGTGGATAAG TTCGTCGGGC ACGGCGTGGG GAAGGTGTTT CACAGCGGAC CGACGGTGCG GCATCATCGC AACAACGACC CGGGGACGCT GCGGGTCGGT CAGACGTTCA CCATCGAGCC CATGCTGACG ATCGGGACGA CTCGAGACAA GATGTGGAAG GACGGATGGA CGAGCGTCAC CGCGGACGGG AAGTGGACGG CGCAGTGCGA GCACACGCTG CTCGTCACCG AGACGGGCGT CGACGTCTTA ACCGCGTCGC CGTATCGCGC GTCTCTGTCC GAGGCGTGAC CGCGCGCCGG ACCCTCGTCG CGTCGTCGC
|
Protein sequence | MGIPIFPGTN HSLVQTTAAD VEGMRRAGRL AAEVLDMAEK MITPGTTTTN DIDEAVHAMT IAAGAYPSPL NYGGFPKSVC TSLNECICHG IPDDTVILDG DIINIDVTVY LNGYHGDTSR TIMVGNVTEE VRRLVETTER ALDAAIAICK PGTPVRKIGA TIHQIADDAK FGVVDKFVGH GVGKVFHSGP TVRHHRNNDP GTLRVGQTFT IEPMLTIGTT RDKMWKDGWT SVTADGKWTA QCEHTLLVTE TGVDVLTASP YRASLSEA
|
| |