Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38040 |
Symbol | |
ID | 5004062 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 192915 |
End bp | 194288 |
Gene Length | 1374 bp |
Protein Length | 437 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419483 |
Product | predicted protein |
Protein accession | XP_001420108 |
Protein GI | 145351488 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.990763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00560331 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGAAGT ATCCTCGCTG GCTCGAGATT ACGGCGTTCC TGTTCAACGC GTTGACGCCG ATGGTGATGG TGTTCTATGC GTGGCTTATT TACGAAGGCA CGTACAAGGA TAGCTCGGAA GACATGTTCG GTAACATGAC TACGCGTAAC TACGACTCGA ACGTTCGCCA AGGGATGACG TTGAAGGACA TCACCGGCAT CGACAACGTC AAGGCGGAAA TGTTTGAACT CATTTCCTAC TTGAAGGATT TCGAAAAGTA CAACTCCATG GGCGCGCGCA TTCCCGCAGG CGTGCTTTTG TGCGGTCCGC CCGGTACGGG TAAGACGTTG CTCGCTCGTT GTGTCGCGGG CGAGGCAAAT GTGCCCTTCT TCTCATGCGC TGGTACGGAG TTTATGGAGA TGTTCGTCGG CGTCGGAGCC GCGCGTATTC GCAACTTGTT TGATCAAGCC AAGAAGGTTG CGCCGTGCAT CATCTTCATC GATGAATTCG ATGCCGTCGG CACGAAGCGT ACTGAGACAC AATCCGGTCA GGTTTACGGT AACGACGAAG CGACGGCGAC GATCAATCAA ATGCTGACGG AGATGGACGG TTTCTCGACC GCCACGGGCA TCATGGTGTT GGCGGCGACG AACCGTCCGC AAGTACTCGA TCCTGCGCTC ATTCGTGCCG GTCGTTTTGA TCGCATCATC GAGATGGGTC TGCCGAACAA GAAGTCGCGT CAAGAAATCT TGTTCTTGCA CTGCAACAAG CCATCGTTCG CGTCGAGTGT CGATCCCAAC TTGGACTACG AGTACCTCGC CAGACAAACT GCCGGTTTCA GCGGCGCCGA CATCGAGAAC CTCACCAAGT CGGCCGTCAT GCGCTGTGCG CAAGGCGAGA AGGCGCTCGC GTCTACGGGT GACTTCTTGT TTTGCATCGA CGATATTCGA CGATCGCAAG CGTTCGTTCG CAACGGAAGT GGCAGCGGAA GTTTGGCTCG CGATAGAATG CTCGAAGATA CCCTCATCGC GCAACTCGAC GCGTACGAAC GAGACTCGGT GGTGAATTAT TACGCCGCGC AGGCCGTGGT CGCGATGCAC ATGCCTTCGT ACGACGAGAT TAGCAAGGTG ACGGTGTTTA ACGGTGGTGT AGCCACGGGT CAAATCGTCT ACGTGCCGGA TGAAGTCGAC TCTCCCGCGG CTCGCACGGT GCGCTCCATG GAGTATTACG AAGCCAAGCT GTGCGTACTC CTCGCAGGCC AGATGGCGGA GCGTTATTTG TACGGTCCCG AGAACGTAAC GACCCGCGGC ATGCACGATG TCGCCGCTGC GACGAATTTG GCGTGCGAAA TGGTGATGCA GAACGGGTGG AGTGATTTAG GTCCCATCGC CCTC
|
Protein sequence | MEKYPRWLEI TAFLFNALTP MVMVFYAWLI YEGTYKDSSE DMFGNMTTRN YDSNVRQGMT LKDITGIDNV KAEMFELISY LKDFEKYNSM GARIPAGVLL CGPPGTGKTL LARCVAGEAN VPFFSCAGTE FMEMFVGVGA ARIRNLFDQA KKVAPCIIFI DEFDAVGTKR TETQSGQVYG NDEATATINQ MLTEMDGFST ATGIMVLAAT NRPQVLDPAL IRAGRFDRII EMGLPNKKSR QEILFLHCNK PSFASSVDPN LDYEYLARQT AGFSGADIEN LTKSAVMRCA QGEKALASTG DFFLARDRML EDTLIAQLDA YERDSVVNYY AAQAVVAMHM PSYDEISKVT VFNGGVATGQ IVYVPDEVDS PAARTVRSME YYEAKLCVLL AGQMAERYLY GPENVTTRGM HDVAAATNLA CEMVMQNGWS DLGPIAL
|
| |