Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_49070 |
Symbol | |
ID | 5000653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 24718 |
End bp | 25903 |
Gene Length | 1186 bp |
Protein Length | 381 aa |
Translation table | |
GC content | 57% |
IMG OID | 640416074 |
Product | predicted protein |
Protein accession | XP_001416824 |
Protein GI | 145344615 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCACATCCAT CGCACGATGG CCGCCGTGAC TTCTATCCCG CGCACGACGC TGCGTCGAAT CCCTCTCGGC AGCGCGAAAG ACGTTTTCGT CACCGACGTG TGCCTGGGGA CGATGACGTG GGGCGTGCAA AACACCGAAG CCGAGGCGCA CGAGCAGTTG GATTACGCCG TCAAACAACG AGGCGTGAAC TTCATCGACA CCGCGGAGAT GTACCCGGTG CCGTCGAGCG ATGCGCGATG GAAACCTGGG ACGACGGAGG AAATCATCGG GAATTGGCTC GCAAAGAACG TCGAGCTGAG AAAGGAGCTC GTCGTGGCGA CCAAGGTGAG CGGATACCAA GCCAAGAGCG AGACGGCGGG TAACCGAACG GTGCCTGCGG GCGCGCCGTG CGCGGCGAGA TTGGATAAAC AAAGCATATT TCAAGCGTGT GATGCGTCGC TGCGACGATT GAGAACGGAT TACATCGATT TATACCAAGT GCACTGGCCC GACAGGTATC TGCCCATCGG CGCGTTCACG GGATCGACAG AGTACATTCA GAGCAAGGAG AGATCGGACT CTGTCCCTAT TCGCGAGACG GTCGAAGCGC TCGGTGAGCT CATCAAGGCT GGGAAGATCA GGCATTACGG GTTATCAAAC GAGTCAACGT TCGGAGTGTG CGAGTTTGTT CGCGCGGCGG ATGAGCTCGG CGTTCCCCGT CCGGTGTCGA TTCAGAACTC TTTTTGCCTT CTGCATCGAC AGTTTGACAC TGAAGTCGCC GAGGCGTGCT CGAAGTCAAA CTACAACATT TTACTCCTTC CCTGGACCCC ACTCGCGGGC GGAGCCTTAT CGGGCAAATA CCTCGACGGC GCTCGTCCGG AGGGCGCTCG CATGTCTGTC TTCAAACATT TCCACCAGCG TTACCTGAAC GAAAACTCCG TCAAGGCGAC GAAGCAGTAC AAAGAAATCG CCGATAAGGC GGGTATGAGT CTCACCACCA TGGCGCTTAA CTGGTGCAAG ACGCGCGCTT TCAACACTTC CACCATCATC GGAGCCACCA CGCTCGAGCA GTTGAAGGAG AACATCGATG CGTTTGAGCC CTCGGTTGTG TTGAGCAAGG AAACGCTCAA GGCCATAGAC GCCGTGCATC AGCAGTGCAG AGACCCGTGC ATCGCCGTTT AAACGTGCGA CTCCTTCGTC GCTGTC
|
Protein sequence | MAAVTSIPRT TLRRIPLGSA KDVFVTDVCL GTMTWGVQNT EAEAHEQLDY AVKQRGVNFI DTAEMYPVPS SDARWKPGTT EEIIGNWLAK NVELRKELVV ATKVSGYQAK SETAGNRTVP AGAPCAARLD KQSIFQACDA SLRRLRTDYI DLYQVHWPDR YLPIGAFTGS TEYIQSKERS DSVPIRETVE ALGELIKAGK IRHYGLSNES TFGVCEFVRA ADELGVPRPV SIQNSFCLLH RQFDTEVAEA CSKSNYNILL LPWTPLAGGA LSGKYLDGAR PEGARMSVFK HFHQRYLNEN SVKATKQYKE IADKAGMSLT TMALNWCKTR AFNTSTIIGA TTLEQLKENI DAFEPSVVLS KETLKAIDAV HQQCRDPCIA V
|
| |