Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34179 |
Symbol | |
ID | 5000789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 852779 |
End bp | 854512 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416210 |
Product | predicted protein |
Protein accession | XP_001417073 |
Protein GI | 145345125 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.137613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.109742 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCGG TGGAGAAGGC GTGCGAAGGA TTCAAGGCGA CGTACGAGGG GACGCTCGAC ACGTCGCTGC TCGCGGCGAT CGAAACGTAC GAGGGCATCG AGCAAACGCT GACGACGGTG CTGAGCTACA TGTCGTTGAG CGCGGACACG CGGTTGATGG ATGATGCGGT GCAGCAAAGA AAAGGGGCGC TGATGCAGCA GTACGGACAG GCGAGCGGGA ATTATTTGAC GTGGTTCGGA CTCGAGTTGG CGGATATGGC GGAGAGCGCG TTGGAGGCGC AGTACGCAAA GGACGCAAAG TTGAAGAATT ACAAGCCGTA CATCGATGAG GTGCGTCGAT CGGCGCCGTA TAACTTGTCC AAAGAAGTCG AGCGGGCGTT GACCGTGCGC AGCTCGTTCA CGGGCAAGGG CAAGGTTATT GAATTCTACG GCAACGAGCT CTCTCGGTTG CAGTTCGACC TCGACGGCAA AAAGGTCAAC ATGGAAGTTT TGCTGGCGAA GATGACGAGC AGCAAGGACA CGATGGTGCG TCGCCAGTGC ATGAAGACGC TCAACGATGG ATTGAAGGAT TTTGAGAGAA TCTGCGCGCT TTCGCTCAAT ATGGCTGCCG GGTCCTGGGC AATCGAAAAC AAGGAGCGCG GGTTTAAGAA TTTGCGTTCG TCGCGCAACC TGAGCAACAA CTTGCCCGAC GACGTCGTCG ACTCGCTCTT GGCCGCCGTG GCGACCACAG GTGTGGATTA CTGCAAGAAG TACTACACGT TGAAGAAACA AATCTTGAAG TCGACGCAAG GACTTGAAAC CTTCACTTGG GCCGATCGCA ACGCTTCCAT CGACATCGGT TCGGGCGAGG ACAACTACTC GTGGGAACAA GCGGTGGAAC TCGTGCACAA GGGCTACAAC AAGTTTTCTC CCACTATGGC GGACATGTTC ACGAAGTTCG TCGACGAAAA GCGCATCGAC GTGCCGGCGC AAGATGGGAA AAGAGGCGGC GCATACTGCT CTTCGGCGTA CGGCTCCGGC CCGTTCCAGC TTCTCAACTT TACCCACTCC GCGCGTGACG TCGCCACGCT GGCGCACGAG TCTGGGCACG CAGTCCATTT TGAGTTGAGT TACCCGCAAG GCATCTTGCA ATTCCACCCG CCGCTCACGC TCGCGGAAAC CGCGAGCATC TTTGGAGAAA TGATTGTGTT CCGTGACTTG TTGGAGAAGA CTCCGAGCGA TGAAGATCGT TTGGCGATGA TCATGTCCAA GGTGGACGAC ATCATCAACT CCGTCGTGCG TCAATGCTCT TTTGATAAAT TTGAAGAAAA GGTGCACACC ATGCGCGCCA AGGGTACGGT GACGCCGGAG GAAATGTCCA AGGCGTGGCG CGAGTGCACG GAAGAGTATT ACGGCAAGGA GGGTGAAATC TTCGACTCTC TCGAAGACAC TTCTCACCTT TACGCGTACG TCTCGCACTT CCACAACGTC GCCTTCTACG TCTACGCTTA TGCATTCGGT GATCTCTTGG TCGGCTCGTT GTACGGCGCG TACATGAAGC AACCGGAGGG CTTTGAAGCG AAGCTTCTCG ATCTCTTGCG AGCCGGCGGC ACGAAGGACT TCGTCGAGGC GGTCGAACCG TTTGGTTTGG ACCCGAAATC GGCGACGTTC TGGAGCGACG CGTTGCATGC CCACCTCGGT GGTCTTATGG CCGAGGCTGA AGAGTTGTCC AAGAAGCTCG GATACTCTTC GTAG
|
Protein sequence | MSAVEKACEG FKATYEGTLD TSLLAAIETY EGIEQTLTTV LSYMSLSADT RLMDDAVQQR KGALMQQYGQ ASGNYLTWFG LELADMAESA LEAQYAKDAK LKNYKPYIDE VRRSAPYNLS KEVERALTVR SSFTGKGKVI EFYGNELSRL QFDLDGKKVN MEVLLAKMTS SKDTMVRRQC MKTLNDGLKD FERICALSLN MAAGSWAIEN KERGFKNLRS SRNLSNNLPD DVVDSLLAAV ATTGVDYCKK YYTLKKQILK STQGLETFTW ADRNASIDIG SGEDNYSWEQ AVELVHKGYN KFSPTMADMF TKFVDEKRID VPAQDGKRGG AYCSSAYGSG PFQLLNFTHS ARDVATLAHE SGHAVHFELS YPQGILQFHP PLTLAETASI FGEMIVFRDL LEKTPSDEDR LAMIMSKVDD IINSVVRQCS FDKFEEKVHT MRAKGTVTPE EMSKAWRECT EEYYGKEGEI FDSLEDTSHL YAYVSHFHNV AFYVYAYAFG DLLVGSLYGA YMKQPEGFEA KLLDLLRAGG TKDFVEAVEP FGLDPKSATF WSDALHAHLG GLMAEAEELS KKLGYSS
|
| |