Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16034 |
Symbol | |
ID | 5003059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | - |
Start bp | 162408 |
End bp | 163940 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418480 |
Product | predicted protein |
Protein accession | XP_001418857 |
Protein GI | 145348852 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0673665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.542124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCGG AGGAAGTGCG AGAGAGAATC GCCAAAGCGG CGAAGGCGCA GAGAGAGTGG GCGAAAAGCT CGTTCGCGAC GCGTCGGAAG CTGCTGCGGG TGATACAGCG GTTCATATTG GAAGAGCAGG ATACGATTTG TCGAGTGAGC GCGCGAGACA GCGGTAAGCC GTTGGTGGAC GCGGCGTTCG GGGAGGTTCT GGTGACGCTG GAGAAGATTC GATGGCTGTG CAACGAGGGC GAGCGGTGGT TGAAACCCGA GAAGCGATCG ACCGGGGCCA TGATGTTTTA TAAGAAGGCG CGGGTGGAGT ATCATCCGGT GGGCGTCATG GGCGCAATCG TGCCGTGGAA TTATCCGTTT CACAACGTTT TCAATCCTTT GGTGGCGAAC TTGTTCGCGG GGAACGCGCT CGTCGTCAAG GTGAGTGAGT ACGCGAGCTG GAGCTCGCAG TACTACGGTC GCGTCATCGA TGCCGCGCTT GATGCCGTGG GAGCGCCGCG CGACTTGGTG CAAATCATCA CCGGTTACGG CGAAGCGGGC AGCGCGTTGG TCACTGGCGG TGTGCAAAAG GTAGTCTTCG TCGGATCCAC TGGCATCGGA CGTAAGGTTA TGGAGGCAGC GGCGAAGACT TTGACTCCGG TCGTCCTCGA ACTCGGTGGT AAAGATCCGT TCATCGTCTG CGCAGACGCG GATCTCAAGC AGTGCGTTCC CATGGCGCTG CGCGGCGCGT TTCAATCGTG CGGTCAAAAC TGCGCGGGAG CCGAACGATT CTACGTGCAT GAGAAGATTC ACGACAAGTT TTTAGCCAAA GTACTGGAGT CGGCGAGGAA GTTACGTCAA GGATGGGCGC TGAGCTCGTC CGTGGATTGT GGGGCGATGT GCATGCCAAA GCAAGCGCAG TACGTGCAGT CTCTCATCGA CGACGCCGTC GCACGTGGTG CGACCGTCCA CGTCGGTGGT AAGATCGAAC TGGGCGCCCA GGGTGGTCAG TTCTACCCTC CGACAGTGAT CTCTGGCATC ACGCACGATA TGCGAATCGC TCGCGAAGAG GTCTTTGGTC CCGTGCTCGC CATCGTCAAG ACAAAGAGCG ACGAGGAATC CATCTCGCTC GCGAACGACT GCGACTTTGG TCTCGGATCA AATGTTTTCA CGCGCTCAAC GAGACGCGCA GAAAAGCTTG GCTCACAGCT GGAGGCCGGT ATGACTTCCA TCAATGACTT TTGCTCGACG TACATGGCGC AGTCCCTTCC CTTTGGCGGC GTCAAGGAAT CCGGCTTCGA CCGCTTCGCC GGTATTGAAG GACTCCGCGG TTGCTGCGTC CCGAAATCCG TCGTCGTCGA TCGATTTCCA TGGCTCATGA AGACCAATAT CCCTCCTCCG TTGTGCTACC CCGTGGCGGA TAACGCGTTT GCCTTTTGCA AGGCGCTGTC GCGCATGTTC TTCGGCTTGA ACGTCGCGCA ACAGTTCGGT GGATTGTTGT CGCTCGCCAA GTGCTTCCTC ATGCCATCGA ATTCTTACAC CAAGTACGAT TAA
|
Protein sequence | MRAEEVRERI AKAAKAQREW AKSSFATRRK LLRVIQRFIL EEQDTICRVS ARDSGKPLVD AAFGEVLVTL EKIRWLCNEG ERWLKPEKRS TGAMMFYKKA RVEYHPVGVM GAIVPWNYPF HNVFNPLVAN LFAGNALVVK VSEYASWSSQ YYGRVIDAAL DAVGAPRDLV QIITGYGEAG SALVTGGVQK VVFVGSTGIG RKVMEAAAKT LTPVVLELGG KDPFIVCADA DLKQCVPMAL RGAFQSCGQN CAGAERFYVH EKIHDKFLAK VLESARKLRQ GWALSSSVDC GAMCMPKQAQ YVQSLIDDAV ARGATVHVGG KIELGAQGGQ FYPPTVISGI THDMRIAREE VFGPVLAIVK TKSDEESISL ANDCDFGLGS NVFTRSTRRA EKLGSQLEAG MTSINDFCST YMAQSLPFGG VKESGFDRFA GIEGLRGCCV PKSVVVDRFP WLMKTNIPPP LCYPVADNAF AFCKALSRMF FGLNVAQQFG GLLSLAKCFL MPSNSYTKYD
|
| |