Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29467 |
Symbol | |
ID | 5006763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | + |
Start bp | 289230 |
End bp | 290841 |
Gene Length | 1612 bp |
Protein Length | 514 aa |
Translation table | |
GC content | 60% |
IMG OID | 640422184 |
Product | predicted protein |
Protein accession | XP_001422543 |
Protein GI | 145356656 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2317] Zn-dependent carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.000501593 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0333028 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GACGCTCGCG CGCGCGTTCG CCGTCTCCCG CGCGTCGCTC GCGCCGAGAC GCCGGCGCGC GCGCGCGATG GCGACGCGCG AGGCGTCGTA CGACGCGCTC AAGCGCGAGC TCGCGACGCT GAACGACCTG CGCTCGCTCG AGGGGCTCGC GGGATGGGAC GAACTCGTGA TGATGCCGTC GGGCGACGGC GCGGCGAACG CGCGCGCGCG CGCGATGGCG ACGCTCGCGG GCGTGATTCA CGACAAGGCG ACGTCGAAGG CGCTCGGTGC GCTGATCGAG GCGTGCGATG GGACGCCGCG CGGTAAGGAT GGGACGCTGC GCGGTAAGGA TGGGGCGAAC GTGCGAAGAG CGAAGGAGAG CTACGCGAAG GCGACGGCGA TTCCGAGCGA GATGGCGAAG AAACGAGCGG AGCTCGGCTC GCGCGGGTAT CAGACGTGGG TGAAGGCGAG AGAGAGCGGA GAGTACGGCG CGTTCGCGCC GGTGCTGGAG GAATGGGTGG CGTTGACGAA GGCGCGGTGC GAATTGATCG CGCCGGGCGC GGACGCGTAC GACGTCGCGT TGGACGACTA CGAGCGAGGG ATGACTTCGG CGCGACTGCG TGAGATTTTC ACCGTAGTGC GCGAGGGCGT GGTGCCGCTG ATCGAAAAGG TGTACGCGAA AGATGGTCCA AAGGCGCTGA GAGGACGCGC GAATCCGCTC AGTGGGACGT TTGACGTCGA TAAGCAGGCT GAACTCGCGA GAGACGTCGC GGTTGCTCTG GGATTTGATT TAACCAAGGG GCGCTTGGAC GTCAGCGTGC ATCCTTTCAC GGGTGGTTGC GGGCCAGATG ACGTCAGAAT GACAACGAGG TACAAAGCAG ATGATTTGCT CGAAGGTTTG TCAGGATGCG TGCACGAAGC CGGTCACAGC GCGTACGAGC AAGGACGCTC AGTCGACTAC CGCGGGCAAC CGGTGAGCGA AGCGCACTCG ATGGGTGTGC ACGAATCGCA GTCGCTGCTC TGGGAGCGCA TGGTGGCGCT GAGCGAGCCT TTTGCGCATT TCCTTCTCCC AAAGTTACAA TCGACGTTTC CCGGGAGGTT CGATGGCGTC ACCGAGGAAG CGCTTTATGC CGGGTACAAC GTCGTGAAGA AGCCGAGCGT CATTCGCGTC GAATCCGACG AAGTCACCTA TCCCATGCAT GTTATCCTAC GAACGGAATT GGAGATGGAT TTGTTGAGCG GTAAAATTAC CGTACATGAT TTGCCCAAGC TCTGGAACGC GAAGATGAAG GAATATCTGA ACGTTGACAT CGAGAACGAC AAACAAGGAG TTTTACAAGA CGTGCACTGG GGGTCTGGAG CCATCGGCTA CTTCCCGACG TACATAATCG GCCAAATATT GGCGTGTCAA ATCTTCAACG CCGCCAAACG CTCGATCGAT GATTTGGACG GGCAGATCAA GCGAGGTGAA TTCGCCCAAC TCTTGGCTTG GTTGCGCGTC AACGTGCACG AGCGCGGATC AGAGTGCGAC AGCGTAGACG AGTTGATGAT GAAAGTCACA GGCAAGCCGC TCGATGCGGC AGAATTCGTG ACTTATTTGA CTGAAAAGTA CACCAAGCTC TATGATCTTT GA
|
Protein sequence | MATREASYDA LKRELATLND LRSLEGLAGW DELVMMPSGD GAANARARAM ATLAGVIHDK ATSKALGALI EACDGTPRGK DGTLRGKDGA NVRRAKESYA KATAIPSEMA KKRAELGSRG YQTWVKARES GEYGAFAPVL EEWVALTKAR CELIAPGADA YDVALDDYER GMTSARLREI FTVVREGVVP LIEKVYAKDG PKALRGRANP LSGTFDVDKQ AELARDVAVA LGFDLTKGRL DVSVHPFTGG CGPDDVRMTT RYKADDLLEG LSGCVHEAGH SAYEQGRSVD YRGQPVSEAH SMGVHESQSL LWERMVALSE PFAHFLLPKL QSTFPGRFDG VTEEALYAGY NVVKKPSVIR VESDEVTYPM HVILRTELEM DLLSGKITVH DLPKLWNAKM KEYLNVDIEN DKQGVLQDVH WGSGAIGYFP TYIIGQILAC QIFNAAKRSI DDLDGQIKRG EFAQLLAWLR VNVHERGSEC DSVDELMMKV TGKPLDAAEF VTYLTEKYTK LYDL
|
| |