Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18226 |
Symbol | |
ID | 5005252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 684794 |
End bp | 685918 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420673 |
Product | predicted protein |
Protein accession | XP_001421359 |
Protein GI | 145354157 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGA TACCGCACCG CGCCTCGACG GTCGCGTCGC GCGATCGAAG CGCTTCACGA CGACGCGCGG AGCGCGTTTC GACGACGCGA CGTCATCGGG GCGTCGCCAT CGCGCGCGCG GTGGCGCCTA CGTACGAAAA GGTGTTCGAC GAGCGCACGG GACAGGACGT GGATGTGCTT TCGCAAAGCG TCGAACGCGT CGAGTTGGGA CAAGGCGTGA CGTGGGCGTA TCGTCGAGGC GTCGCGGCGC CGAAGGAGGG GGCGACGGCG CGCGAGACGC CGGTGGTGTT CGCGCACGGA CTGGGCTCGC GTGCGTATGG ATTTCGAGCG ATGTCTCGCG AGTTGCAAGA GAATGGATTT GAGACGTACG CGGTAGACGT CACGGGACAC GGAGATTCGA GTAAGCCAGC GGTGGGGAAA GGGTTGGCGG CGTACGACGC CGCGGCGACG GGGGCGGCGA TGGAGGCGTT TTTGGAAAAA ATTGGGTTGG CGAATGATCG CGTGGACTTG ATCTTGCACG GATTCGTGAT ACCGCAACAC TTGTTACTGC TCGTCGCGCG CCGACCTGAA TTGTTTCGTC GGGTGGTGAT CTTAAACTCG CCGTTGGCGC CGTCGCACGC GTACCCGCCG CAGATGGCGA CGTATACGAG ACCGTTCGGC ATGGGCAAGG GCGCACCGTT CGACGCGGCC GGATATTTGT ATAACGGAAA CGAGTTTGCG TTACCGGGCG ATGTTTTGGC CGAGTACGAA AAGCCGTACG TTGGCGCAGA GGCTGAAGCG GCTCGCGCGG CGGCGGAGGC GTACGTCACC AAGTCGAGCG ACTTGAAAAA GCTCAACGCC GAAGTGAAGA ACGCCTTGAG CGCGCGCGGG TTGCCTAAGA TTCGCGTCGT CTGGGGGACC GCAGATCGCT ACCTCGACGA CGCACCGATT TACGATTGGT GCGCGGACAT TCGCGCGTCC TTTTCCGCCA TGCGTAAAGT CGGGCACTGC CCGCACGAAG ACTTCGCCGC CGAAGCCGCC GCGCGCTGCC AAGAGTTCTT CCTCGCGGAT TTGCGCGCGA GCGCCAAGGC GGCACTCAAC TCTGTGCGCG TCGGCAAAAT TACCACTGAC GATGGTCAAG GGTAA
|
Protein sequence | MRAIPHRAST VASRDRSASR RRAERVSTTR RHRGVAIARA VAPTYEKVFD ERTGQDVDVL SQSVERVELG QGVTWAYRRG VAAPKEGATA RETPVVFAHG LGSRAYGFRA MSRELQENGF ETYAVDVTGH GDSSKPAVGK GLAAYDAAAT GAAMEAFLEK IGLANDRVDL ILHGFVIPQH LLLLVARRPE LFRRVVILNS PLAPSHAYPP QMATYTRPFG MGKGAPFDAA GYLYNGNEFA LPGDVLAEYE KPYVGAEAEA ARAAAEAYVT KSSDLKKLNA EVKNALSARG LPKIRVVWGT ADRYLDDAPI YDWCADIRAS FSAMRKVGHC PHEDFAAEAA ARCQEFFLAD LRASAKAALN SVRVGKITTD DGQG
|
| |