Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28244 |
Symbol | |
ID | 5006250 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | + |
Start bp | 64828 |
End bp | 66531 |
Gene Length | 1704 bp |
Protein Length | 526 aa |
Translation table | |
GC content | 59% |
IMG OID | 640421671 |
Product | predicted protein |
Protein accession | XP_001422089 |
Protein GI | 145355699 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.719833 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GACGCGGCGA GGAGGTCGAG AGATGTCTCG CGCGCTCGTC GCGCTGAGCG CCGCGCTCGC GCTGAGCGCG CGACGCGTCG ACGCGAACGA TGTCGTCGAC GACGCCGCGC GGACGCGAGA CGCGCTGACG TCGCGATTGA CGACGAAAAC GGTGCGACTC GAACGATTCG CGACGGATTT GGAATCTCTA GCGGCGAATG ATTACGACGA GTACGCCGCG TCGAGCGGGT ATTTCGCGCT CAATCGCACG ACGAAGGACG CGCACATGTT TTACACGTTT TTTGATGCGC GCTCGGGAGG CGCGGAGAGC GAGGACGCGA TCCCAATCAT TCTGTGGCTC ACGGGCGGAC CGGGATGTTC TTCGGAATTG GCAGCGTGCG TGTCGTCGGC GCGCGCGCGC GCACGCGCGA TCTTTCGCGT CTTGAACGAT ACAGGAGAGG GACGCGACAC TGACTGACCA CCGTCTTCCA CACTAGTCTA TACGAAAACG GACCGTTCGC GTTCGACGAA GACGACGCGA CGAAATTGAA GCGACGCAAA TACGCGTGGA ACGACGCCGG AAGGTTGCTT TACGTGGACT CGCCGGTGAA CACGGGATTT TCGTATTCGA GCTCGCGGCG CGACGCGGCG AAGGACGAGA CGACGGTGGC GAACGATTTG TTGGAGTTTT TGTACGCCTT CATGTTGAGT AGACCGATGC TCGTGGATGC GCCGGTTTAC GTCACGGGAG AATCGTACGC GGGGCACTAT GTGCCGGCGT TCGCGAGGGC GATTTTCGAC GCCAACGCTC GAGACGATGG ACCCGTGAGA ATAAATCTTC AAGGCTTAGC CATCGGAAAT GGGCTGACGG ATCCCGCTAT TCAGTACGCG GCGTACGCGG ATTATTCGCT CGGGAACGAC ATCGTGAGCG CGGCGACGGT GAAGCAAACG GCGAAGAAAT TACCGTCGTG CGTGGAGAAA ATCAAGTCGT GCGCGAGCGG TAAAACGTCG AGCAAGGAAA ATCGCGCCGA ATGCTTGGAC GCGGTGGATT CGTGCCAAGC CATTCCTGAG GCATTGCTCG AAGATGCCGC TGAACGCAAC GGTGGGAAGG CAATCAACGT GTACGACATA CGTAAATCGT GCGACGCCGA GCTTTGTTAC GATTTCAGCG CCGCGGAAGC GTTTTTGAAC CGTAAAGACG TTCAAGAAGC GTTCGGGGTG AGTAAGAAAT GGGAAATGTG CGACGCGAGC GTGCACCAAG ATATGATGGG GGATTGGATG CACGACTACG AGACGTTGAT TCCAGACATG ATCGAGGCCG GGATTCGCGT GATGATTTAC GCCGGCGAGG ATGATTTCAT CTGCAACTGG CTCGGAAATC TTCGCTGGGT GAAGGCGATG CAATGGAACG GACGCGAAGC GTTCAACGCC GCACGTCCTG AGCCTTTCAT CATCCAGGGT GCGGGTGACG GCGAAGACGA CGTTGTGGGT GGCGACGTGC GCGAACACGG CGGTTTATCC TTCGTAAAGA TCAGCGAAGC GGGACACATG GTACCTATGG ATCAGCCACG AAACGCGCTG ACAATGATTC AGCGCTTTGT GAACAACGAA CCGATCGCGC GCGGTCGAGG TGGTGACGAG CCGAAACTCT CCGCTGCACC ACGACGTTTC GGCCCCGTCG AAGACGACGT CGTTGGCCGC CTCGCTGTGG CGACTCAGAA ATGA
|
Protein sequence | MSRALVALSA ALALSARRVD ANDVVDDAAR TRDALTSRLT TKTVRLERFA TDLESLAAND YDEYAASSGY FALNRTTKDA HMFYTFFDAR SGGAESEDAI PIILWLTGGP GCSSELAALY ENGPFAFDED DATKLKRRKY AWNDAGRLLY VDSPVNTGFS YSSSRRDAAK DETTVANDLL EFLYAFMLSR PMLVDAPVYV TGESYAGHYV PAFARAIFDA NARDDGPVRI NLQGLAIGNG LTDPAIQYAA YADYSLGNDI VSAATVKQTA KKLPSCVEKI KSCASGKTSS KENRAECLDA VDSCQAIPEA LLEDAAERNG GKAINVYDIR KSCDAELCYD FSAAEAFLNR KDVQEAFGVS KKWEMCDASV HQDMMGDWMH DYETLIPDMI EAGIRVMIYA GEDDFICNWL GNLRWVKAMQ WNGREAFNAA RPEPFIIQGA GDGEDDVVGG DVREHGGLSF VKISEAGHMV PMDQPRNALT MIQRFVNNEP IARGRGGDEP KLSAAPRRFG PVEDDVVGRL AVATQK
|
| |