Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_4151 |
Symbol | |
ID | 5000009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 858141 |
End bp | 859037 |
Gene Length | 897 bp |
Protein Length | 299 aa |
Translation table | |
GC content | 60% |
IMG OID | 640415430 |
Product | predicted protein |
Protein accession | XP_001415624 |
Protein GI | 145341040 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.690672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATTCGTTCG TGCGGGATGC GGCGGACACC ATCGGACCCG CGGTCGTGCG CATAGACAGC GATCAAGCGT CGCTGAGGAA GGCGATGACG GTGAGTGGAT CGGTGAGCGA AGGACGGGTG GGCGGGAAGC CGACGAGCGG GCGAAGCGGC GTGATTCGCG GCACCGGCTG CGGATTGGTG ATCGATGCCG AGGAAGGGTA CGTGGTGACG AACAGTCACG TGGTGAAGCG GGAGGAAAAG GTTAAGGTGA CGTTCATCGA TGGGAACGTG TACGACGGGG AAGTGAAAGG GGTGGATTCG TTGACGGATA TCGCGCTGAT TAAGCTCAAG CCGCGAGCGG GGCACGCGCT GCCGGTGGCG ACTTTAGGGA ATAGCGACAG CGTCGAGGTT GGGGATTACG CCATCGCGCT CGGGAACCCT TTGGGTTTGG ATAACAGCGT GACTTTGGGG ATCATCAGTA ACGTGCACAG GACTTCGGCG GAGCTCGGGA TTACGGACAG GCGGGTCGAT TTCGTGCAGA CGGATTGCGC CATCAATCCG GGGAATTCCG GCGGGCCACT GGTGAATGAA TTCAGTGAGG TCGTCGCGTT GAACACTGCC ATCCGCGCCG ACGCGGAAGG CATCGGATTC GCGATCCCGA TCAACACCGT GAAGCGCGTT GCGAGTCTAC TCGCCAACGG CGAAAAGGTG CCGCATCCGT TCATTGGTAT ACGCATGGTA GATAACTTCA CAGCGAACTC AGCGTTTGGT GACCGCGCGC TGGAACTTCC GGTGGGAACT GTGATAGTAG ACGTGATCGA AAAGTCACCC GCTGCTCTCG CGGGACTGGC TGTCGGCGAT GTCATCACCG CCGTCAACGG CGTCGTGATG AACGGCGCGC AAATTCTCGA TCGCGTA
|
Protein sequence | NSFVRDAADT IGPAVVRIDS DQASLRKAMT VSGSVSEGRV GGKPTSGRSG VIRGTGCGLV IDAEEGYVVT NSHVVKREEK VKVTFIDGNV YDGEVKGVDS LTDIALIKLK PRAGHALPVA TLGNSDSVEV GDYAIALGNP LGLDNSVTLG IISNVHRTSA ELGITDRRVD FVQTDCAINP GNSGGPLVNE FSEVVALNTA IRADAEGIGF AIPINTVKRV ASLLANGEKV PHPFIGIRMV DNFTANSAFG DRALELPVGT VIVDVIEKSP AALAGLAVGD VITAVNGVVM NGAQILDRV
|
| |