Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_46693 |
Symbol | |
ID | 5004237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 53590 |
End bp | 56669 |
Gene Length | 3080 bp |
Protein Length | 979 aa |
Translation table | |
GC content | 54% |
IMG OID | 640419658 |
Product | predicted protein |
Protein accession | XP_001420062 |
Protein GI | 145351387 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.649333 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GCGCGCGCGC GATCGGTGAC GGTCGGTGAA CGGGACGGTG ACGATAGACT CGAGCTCGGG GAGTCGAAGA GGACAAAGAG TTTACAGATG CCGCGAGGGA CCACGACGAC GACGCGAGCG ATGAGCGGAG GAGAAGCGGG AGGGGCGACG ATTGAACTCG AAGCCGCCGC TGAATTCGCC AAGGATGAGT TTTACACGAA AAGTTTCGTC GTCAAGGCGC GCGAGCGCGT GGAAGCCTCT GTGAAAGTTC GGGTCGAGAC GAATGGTGTT AAGTACAAGG TCACCGTGGA AACGGACATG GACGCCGAAG GGCAAGATTT GCGCATGCAT TGGGGCGTGG CGACTAGTAA GGAAACGTGG GATAACATGG AGACACCTCC AGTAAAAATT AGACCGCCGT TTACGATTGA AACCACGGGG GTATGTCAAA CGCGCATGAC GCCGACGTCG TCACGCTTGA GCCCCACGCT CACGGCTACG ATCGAAGGTG ACGTAGCGGA CGGCTTTTAC GCAATCAACT TCCTGTTCAA AGAACCAAAG AAAGATCGCT GGATTCATAA CACGAACGGA AGAGATTGGC ACGTGCCGAT ACCACAGCCG CCGGAACCTG AGTTAGTGAC GCGTACGATT ACTATGCTGG AAGATTATGA GGAAGAGATT GAAGAGGACG AAGAATACGA AGAGGAACAA GAGGAGCAAG AGTTGGACGA ATCTCTCGCC TCACTTTCAT CAGTGGTCGA AGTGAACATC GATTTGAACG CGCGCGCGAG CTCTCCCGTC AAGGAGGAAG TCGTCTCAGC AGCGTCGTCA GAAATCTCCG AGCCGGCAAA CGCGTGGAAA CGTCTTCTAA CCCCGCCTGT GGACCCCAAG CCGTTCGACA CGAAAGACGA GAATATTAAC GCCTTAGTTG AGTCACTTTT TGGCGCTGGT TCGAACGGTT TGACATCTAT CAAGCCAATC GCCGCGCGAG ATGAAGGACC GCCTGAGGAT GAGGACTCTG AACCGGAACT TGAACTGGCC AAAGCGGCTC CCATGGAAAA GCCGAAGACG CTCAAAACTG GGATGCGAAA GGTGAAGAGG ATGGTAACAA AGTCCCGCAC TATCACTAAG TCGATAGATG AACCGCTGCC GGTGAAGTTG GTGGAGGGTC CTTGGCTCAC TACGAACACC ATTGAAGAAA AGTTGATTCG CGAGAAAGAA GTGAGTTATA GAGTTGGCGC CGTAGTGGAA AAGAACGTCG AGGGTGGTGG AGTCTTGGTG CGTGTAGAAG CTGAACTTCC GTGGAATATT GTTTTGCATT GGGGTATTGT TCCTCGCGGC GCGCGTGCGG ATGTTTGGAC GTTGCCTCCG GAACAATGGC GCCCAGAAGG TTCAGTGGTC GGGGACACGG GCAAAGCGTG CGAGACTCCC ATGAAGAAAT GCGAGAACCC GCTGTCCGAT CGGATTCAGA TGAGTTATGC GGAGCTTCAG CTCGGAAACG CACCCACTGC GATGCGGTTT GTTCTCAAAG AAAATGGTGG AGAAGGACGG TGGCTTGATC GCAATGGCGA CGATTTTGTC ATTCCCATGC CCGAGCCGGC GTACGCAAGC ACCACGCTCG ATCTCACTGG TGAGCGGACT AAGGAAGCGA TAGACGTCGC CACAGCCGCT GCGTTTCGTG CGGCGGAGTT GCATCTCGAC TCCATGGATG AAGTCGAAGA GAAGGAGCTT GATATGAATA TGCAAGTCGA GTACGAGGTG GCGACTCCAC CACTGAGCGA AGACGACTTT GCTCCGGCGG AGCGAATTCA AAAGCCTCAG AAATCAGCCG TCGGTAACGG TCGGGAAGTT TTGCTTCAAG GCTTCAACTG GGAGTCTTGC AAGGCTCCAT GGTACCAAGC CGTCGAGCGA CTTGCACCCA CCATTGCGGA ACTAGGATTC ACGGTGGTTT GGTTGCCTCC GCCGACATCG AGCGTCAGCG AGCAAGGTTA CATGCCTCTG GATTATTATA ACTTGGACTC TCGATACGGC ACAAAGGAAG AATTAAAGGG CGCCATCAAG GCTTTGCACG ATAATGGTGT GATGGCGCTC GGTGATGCCG TGCTTAACCA TCGATGCGCC CATTTCATAG GCGACGTACC CGGCACGTAC AACAAGTTTG GAGGCAAGTT ACCGTGGGAC GCGACAGCCA TCGTGGCAGA CGATCCCAAT TTTCATGGCC GAGGAAACAA AGCCGATGGT GAAATGTTCC ATGCAGCTCC TAATATCGAC CATAATCAAG CTTTCGTCAA GGCTGATCTC GAAGATTGGA TGAGCTGGCT CATGCGAGAA GTCGGCTACG ACGGTTGGCG ACTCGATTAC GTGAGAGGTT TCTGGGGTGG TCACGTGAAG GATTACATGG AGGCGACGAA TCCGCAGTTC GCGGTCGGTG AGTACTGGGA TTCGTTGGCC TATAACATGG ATGCTCTGGA TTACAACCAA GATGGTCATC GACAACGGAT CGTGAACTGG CTCAACGCCG CCGGCGGAAA CGCCGGCGCG TTCGACGTTA CCACCAAGGG CATCCTGCAC GCCGTATTTG AACGCCAAGA GTATTGGCGC TTATCGGATA AAGCAGGCAA GGCTCCGGGT GTCATGGGTT GGTGGCCTAG TCGAGCGGTG ACTTTCATAG AAAATCACGA CACTGGATCG ACGCAAGGCC ACTGGCGGTT TCCTCGCGAC AAAGAGCTTC AAGGATACGC GTACATCCTG ACCCACCCGG GCACGCCGAC AATCTTCTGG GATCACATCT TTGACAACAA CTGGGGACAT TTGCACAAAC CCATCGAAGA CATGATTCGT ATTCGCAAAC AGTCCGGTAT TCACTGCCGG AGCGAAGTGA AAATCGTCAA GTGCGAGCAG AGCGTGTACG CCGCCGTCAT CGATGACCGA CTTTTGATGA AAATTGGACC TGGTCATTTT CATGCAGACG ACGCGTGGGA TTGCGTGCTT TCTGGACAAG ACTTTGCAAT CTGGCGCAAG AAGAAAGATC GTTCGCAGCC GCGGTGATTT CTCATCTCTA CTTAATATAC TGTAATACTA CAACTCGCGA GCGAGCAAGC
|
Protein sequence | MPRGTTTTTR AMSGGEAGGA TIELEAAAEF AKDEFYTKSF VVKARERVEA SVKVRVETNG VKYKVTVETD MDAEGQDLRM HWGVATSKET WDNMETPPVK IRPPFTIETT GVCQTRMTPT SSRLSPTLTA TIEGDVADGF YAINFLFKEP KKDRWIHNTN GRDWHVPIPQ PPEPELVTRT ITMLEDYEEE IEEDEEYEEE QEEQELDESL ASLSSVVEVN IDLNARASSP VKEEVVSAAS SEISEPANAW KRLLTPPVDP KPFDTKDENI NALVESLFGA GSNGLTSIKP IAARDEGPPE DEDSEPELEL AKAAPMEKPK TLKTGMRKVK RMVTKSRTIT KSIDEPLPVK LVEGPWLTTN TIEEKLIREK EVSYRVGAVV EKNVEGGGVL VRVEAELPWN IVLHWGIVPR GARADVWTLP PEQWRPEGSV VGDTGKACET PMKKCENPLS DRIQMSYAEL QLGNAPTAMR FVLKENGGEG RWLDRNGDDF VIPMPEPAYA STTLDLTGER TKEAIDVATA AAFRAAELHL DSMDEVEEKE LDMNMQVEYE VATPPLSEDD FAPAERIQKP QKSAVGNGRE VLLQGFNWES CKAPWYQAVE RLAPTIAELG FTVVWLPPPT SSVSEQGYMP LDYYNLDSRY GTKEELKGAI KALHDNGVMA LGDAVLNHRC AHFIGDVPGT YNKFGGKLPW DATAIVADDP NFHGRGNKAD GEMFHAAPNI DHNQAFVKAD LEDWMSWLMR EVGYDGWRLD YVRGFWGGHV KDYMEATNPQ FAVGEYWDSL AYNMDALDYN QDGHRQRIVN WLNAAGGNAG AFDVTTKGIL HAVFERQEYW RLSDKAGKAP GVMGWWPSRA VTFIENHDTG STQGHWRFPR DKELQGYAYI LTHPGTPTIF WDHIFDNNWG HLHKPIEDMI RIRKQSGIHC RSEVKIVKCE QSVYAAVIDD RLLMKIGPGH FHADDAWDCV LSGQDFAIWR KKKDRSQPR
|
| |