Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28526 |
Symbol | |
ID | 5006471 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009373 |
Strand | + |
Start bp | 17412 |
End bp | 18693 |
Gene Length | 1282 bp |
Protein Length | 379 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421892 |
Product | predicted protein |
Protein accession | XP_001422365 |
Protein GI | 145356288 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0024] Methionine aminopeptidase |
TIGRFAM ID | [TIGR00495] 42K curved DNA binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 63 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGACT CCGACAGCGA CTACGCGAGC GACGACGAAT CGACGATCGA CGAGAGCGAG TTCACGTGCG AGAACGTGCG TCGATCGACG CGACGCGCGA CGCGACGCGG TGACGCGCGA CGACGCCGCG ACGAGCGAAG AGGACGATCG AACGATCGAC GACGACAGGC GCGAGCGACT GACGACGACG CCGAGACGAC GCGCGCGGAC GAAACAGCCG GATGTGGTGA CGAAATATAA GATCGCGGCG GACTGCGCGA ACGCGGCGAT GAAGGAGGTG CGCGCGGCGA TCGCGGTCGG GGCGAAGGTT GTGGATTTGT GCGCGCTCGG CGACGCGGCG ATCGAGCGAG AGACGGCGAA GTATTACAAT AAAAAGGATA AGGATGGGAA TAAGGTGGAG AAAGGGATCG CGTTTCCGAC GTGCGTGTCG ATCGATAACT GCGTGTGCCA TAACTCGCCG GACGCGAGCG ATGCGAAGAC GATCGAGGAC GGGGCGAGCG TGAAGATCGA TTTGGGGGCG CACGTGGACG GGTACGTGGC GACGACGGCG ACGACTGTCG TCGTGGGCGG TAAACCGGTG ACGGGGGCGC AGGCGGACGT GATGAAGGCG GCGGAGTTGG CGAGTGAAAT CGTCATTCGT AAGCTCAGGC CGGGGGCGTC GACGGGGGAA ATCGGCGGCG TCATCGAGGG CGTGGCGAAG GATTTCGGCG TCAACGTCGT CGAGGGCGTG ATGACGCATA ACATGAAGCG TTTTATCATC GACGGTAACA AGGTGATTCT GAACAAGTCT ACGCCGGAGA TGAAGGCTGA TCCCGAGGAG ATTGAGCTCT ATGAAGTCTA CGCGTTGGAC ATTGTCATGT CGAGCGGCGA GGGCAAGCCC AAGCAAAGGG ACGAGAGAGA AACCAAGGTT TACAAGCGCG CCATCGAAAA GAACTACCAG TTAAAGATGC AAGGTTCGCG CGCGGTTTTC TCGGAAATCT CAAAGCGATT CCCCACCATG CCCTTCACCG CGCGGGCGTT GGAAGAGAAG CGCGTCAACT TTGGCCTCGT CGAATGCTGC AACCACGGGT TGTTGCACGC CTACCCGGTG CTTTACGAAA AGGATGGTGC CGCCGTCGCG CACGTCAAGA GCACGTTCTT AGTGTTGAAG AAGGGCAACG ATCGCATCAC GACCTTTGAG CCGCAAGAGG TGCAATCGGA CAAGTCGCTT AGCGACAACG CGTTGGTCGA ACTCATCGCG ACCGAACTCA AGCCGAAGCC CAAGCCCAAA AAGAAGAAGT GA
|
Protein sequence | MADSDSDYAS DDESTIDESE FTCENPDVVT KYKIAADCAN AAMKEVRAAI AVGAKVVDLC ALGDAAIERE TAKYYNKKDK DGNKVEKGIA FPTCVSIDNC VCHNSPDASD AKTIEDGASV KIDLGAHVDG YVATTATTVV VGGKPVTGAQ ADVMKAAELA SEIVIRKLRP GASTGEIGGV IEGVAKDFGV NVVEGVMTHN MKRFIIDGNK VILNKSTPEM KADPEEIELY EVYALDIVMS SGEGKPKQRD ERETKVYKRA IEKNYQLKMQ GSRAVFSEIS KRFPTMPFTA RALEEKRVNF GLVECCNHGL LHAYPVLYEK DGAAVAHVKS TFLVLKKGND RITTFEPQEV QSDKSLSDNA LVELIATELK PKPKPKKKK
|
| |