Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_50750 |
Symbol | |
ID | 5004022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 371416 |
End bp | 373346 |
Gene Length | 1931 bp |
Protein Length | 523 aa |
Translation table | |
GC content | 61% |
IMG OID | 640419443 |
Product | predicted protein |
Protein accession | XP_001420154 |
Protein GI | 145351589 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0113332 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCGC CGCCGCCGGG ACGATGCGGA CGGGTGGAAC AATTTATTAA TAACGCGTGG GTGCGCGCGC CGCAGGCTGC GGCGCAGGCG CAGGCGCTCG GCGGTGCCGC GAGGACGCTG CCGGTGGTGA ACCCGCACGA CAATAAAAAC ATCGGTGCCG TCGGTGCGGG CGATCGCGCG ATGATCGACG ACGCGGTGCG GGCGGCGCGC AAGGGATATA AGGTGTGGAG CGCCACGCCG GGGCGCGAAA GGTCGCGGGT GCTGCGGGGG ATTGCGAGAG GGATCGAACG GCGGAAACGC GCGCTCGCCG AGCTGGAGAC GCTGGACGCG GGGAAGCCGA TCGAGGAGAG CGAGTGGGAT ATCGATGATG TCAGCGCGTG CTTTGATTAT TACGCCGATC GGTGCGATGA AGTGTTCGGG GATAAGGCGT ACGCGGAGGA GGATGTGAAG TTACCCATGG ACGAGTTCGC GGGACGGTTG CGACGCGAGG CGCTGGGGGT GATCGGTTTG ATCACGCCTT GGAACTATCC GTTGCTCATG GCGACGTGGA AAGTCGCGCC CGCGCTCGCG AGTGGATGTG CGGTGGTGCT GAAACCGAGT GAGCTCGCGA GCTTGACGTG TCAGGTGTTG GGGGACGTGT GCGTCGAAGC AGGGTTGCCG CCGGGGGCGT TCAACGTAGT CACGGGGCGA GGCGACGAGG CTGGCGCTGC GCTGTGCGCA CACAAGGGTG TGGATAAAAT ATCGTTCACT GGCTCGTTGC AAACCGGCCG CATCATCATG AGCGCGTGCG CGAAAGATGT CAAGCCCGTA TCGTTGGAAT TGGGCGGCAA GAGCGCGTTG GTCATTTTCG ACGACTGCGA TTTGGAAAAG GCGGTGGAGT GGGCTCTGTT TGGGTGTTTC TGGACGAATG GTCAGATCTG CAGCGCGACG TCGCGCGTCT TCATTCACGA GCGCATTCGA GAGAAGTTCT TAGCGCGTCT CAAGGAGGCT GCGGAAGCCA TTCCGTACGG CAACCCACTC GTCAAGGGGT GTCGCCTCGG CCCGCTCGTG AGCGAAGGGC AGTACAAAAA GGTGATGAAA ATGGTCGAAC GCGCGAAGCG CAAGGGATAC ACGCTTTTGA CGGGCGGCAA GAAACCGAGC GATCCGGACT GTCGAGAGGG GTTCTATCTC GAACCCACGG TATTCGTCGA CGTCCCTATG GATGCGGAAG TGTGGCGAGA AGAAATTTTT GGCCCGGTGA TGTGCGTCAA GACGTTCGCC TCTGAAACGG AAGTCGTGGC GATGGCGAAC GATTCCGATT ACGCCTTAGC CGCGGCGGTG ATCACAGACG ACTTAGCGCG ACGCGAGCGT ATGACTGCCG CGTTTGACAC CGGGATCGTG TGGGTGAATT GCTCGCAACC CTGCTTCGCT CAGCTCCCGT GGGGCGGTCG CAAGCGAAGC GGATTCGGTC GCGACCTCGG CGCGAACGGC ATGGACAAGT ACTTGCACCA AAAGCAAGTC GTCACCTACG TGTCCGAAGA CCCGTTCGCG TGGTACCCCA TGTTCGACGC CAAGCCGAAC AGCAAGCTGT AGCGCGTGTA CTATCGTCGT CGATGAGTGA ATGAATAAAT ATGTATTTCA AGATTTCAAC GCGAGCGTCG AGCGCTCCTT CGCCCTCCGG CGACTCAACT CGTTCGTCTC GACGCCTCAC GGCGGCAACG CCGCGGTAAT TTGCACCTCA ACCAACAATT CAGGTCGCGC CAACTCGCTC TGCACGCACG CCCGCGTGGG TTTATTATCC GGATCGATCC ACGCGTTGTA CACGGCGTTG AACTCGGGCG CGTCTTTGAT GTCCCGCAGC CAGCACATCG AGGTCAATAT TCGCGACTTA TCCGTCCCCG CCATCGCGAG CAGTTTGTCA AACTTTTCCA GCGTTTCGAT CGTTTGTTCT C
|
Protein sequence | MAAPPPGRCG RVEQFINNAW VRAPQAAAQA QALGGAARTL PVVNPHDNKN IGAVGAGDRA MIDDAVRAAR KGYKVWSATP GRERSRVLRG IARGIERRKR ALAELETLDA GKPIEESEWD IDDVSACFDY YADRCDEVFG DKAYAEEDVK LPMDEFAGRL RREALGVIGL ITPWNYPLLM ATWKVAPALA SGCAVVLKPS ELASLTCQVL GDVCVEAGLP PGAFNVVTGR GDEAGAALCA HKGVDKISFT GSLQTGRIIM SACAKDVKPV SLELGGKSAL VIFDDCDLEK AVEWALFGCF WTNGQICSAT SRVFIHERIR EKFLARLKEA AEAIPYGNPL VKGCRLGPLV SEGQYKKVMK MVERAKRKGY TLLTGGKKPS DPDCREGFYL EPTVFVDVPM DAEVWREEIF GPVMCVKTFA SETEVVAMAN DSDYALAAAV ITDDLARRER MTAAFDTGIV WVNCSQPCFA QLPWGGRKRS GFGRDLGANG MDKYLHQKQV VTYVSEDPFA WYPMFDAKPN SKL
|
| |