Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_35099 |
Symbol | |
ID | 5003803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 414541 |
End bp | 416496 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | |
GC content | 61% |
IMG OID | 640419224 |
Product | predicted protein |
Protein accession | XP_001419590 |
Protein GI | 145350390 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.235498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000591243 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACGAACG GCGCGTTCGC CGGGGCGGCG CGCGCGGACG CGTTCAACGC GGCGCCGACG GAACAAGTGC AACAACAGCG CGGTGAGGCG GTGTCCGCGT TCGCGGATGC GTCCAAGGAA GCACCGGCGG TGAACGCCGA TGGGTTGCCG GAAGGGAACA ACTGGCGTTA CAGCGAATTC ATCAAGGCGG TGATGAGCGG CAAGGTTGAG CGCGTGCGCT TCTCCAAGGA TGGCTCCGCG TTGCAGTTGA CCGCGGTCAA CGGTGCGCGC GCGACCGTCA TCTTGCCCAA CGATCCGGAA CTCGTCGACA TCCTGGCCAA GAATGGTGTC GACATCTCCG TCAGCGAAGG CGAGCAACAG GGCAACGCCG CCTCTTTGGT CGGCAACTTG TTGTTCCCGC TCGTGGCGTT CGGAGGTTTG TTCTTCTTGT TCCGCCGCGC GCAAGGTGGC GATGGCGGCA TGGGTGGCAT GGGTGGTATG GGCGGTGGAC CCATGGACTT CGGTAAGTCC AAGTCCAAGT TCCAAGAAGT TCCGGAAACC GGTGTGACGT TCGCGGACGT CGCCGGCGTC GAAGGCGCCA AGTTGGAATT GCAAGAAGTC GTCGACTTTT TGAAGAACCC AGACAAGTAC ACCGCGCTCG GTGCCAAGAT CCCGAAGGGT TGCCTTTTGG TCGGTCCGCC GGGTACCGGT AAGACCCTGA TCGCCAAGGC GGTCGCCGGT GAAGCCGGTG TGCCGTTCTT CTCTTGCGCC GCGTCCGAGT TCGTCGAACT CTTCGTCGGC GTTGGCGCGT CTCGCGTTCG CGACTTGTTC GAAAAGGCCA AGGCCAAGGC TCCGTGCATC ATCTTCATCG ATGAAATCGA CGCCGTCGGT CGCCAACGTG GCTCCGGTAT GGGTGGTGGC AACGACGAGC GCGAACAGAC CATCAACCAG CTTCTCACCG AGATGGATGG TTTCGAAGGC AACACGGGCG TCATCGTCCT TGCGGCGACG AACAGACCGG ACGTGCTCGA TAGCGCGCTC CTTCGCCCGG GACGTTTCGA TCGTCAAGTT ACCGTCGATC GTCCGGACGT CGCTGGTCGC ATCCGCATCC TCAAGGTGCA CGCCCGTGGC AAGACTTTGG CCAAGGACGT CGACTTCGAC AAGATCGCTC GCCGTACGCC GGGTTTCACG GGTGCCGATT TGGAAAACCT CATGAACGAG TCCGCGATTC TCGCCGCGCG CCGTGAACTC ACGGAAATCT CCAAGGAAGA AATCGCCGAT GCTCTCGAGC GCATCATCGC CGGTGCCGCC AGAGAAGGTG CCGTCATGTC TGAGAAGAAG AAGAAGCTCG TGGCGTACCA CGAAGCTGGC CACGCGCTCG TCGGGGCCCT CATGCCGGAT TACGACGCCG TGACGAAGAT TTCCATCGTC CCGCGCGGTA ACGCCGGTGG TTTGACTTTC TTCGCCCCGA GCGAAGAGCG TCTCGAATCT GGCTTGTACT CTCGCACGTA CCTTGAGAAC CAAATGGCTG TCGCCATGGG TGGTCGCGTC GCCGAAGAAC TCATCTTCGG CGCTGAAGAC GTCACCACGG GCGCGTCCGG TGATTTCCAG CAAGTCACCC GCACCGCGCG TATGATGATC GAGCAAATGG GTTTCTCCAA GCGAATTGGT CAAATCGCCA TCAAGTCTGG CGGCGGTAAC TCTTTCCTTG GCAACGACAT GGGTCGCGCC GCTGATTACT CCGCCGCCAC CGCCGCCATC GTCGATGAAG AAGTCAAGAT CTTGGTCACT GCGGCCTACC GCCGCGCCAA GGACTTGGTT CAATTGAACA TGGACGTCTT GCACGCCGTC GCGGACGTGT TGATGGAGAA GGAGAACATC GACGGCGACG AATTCGAGCG CATCATGCTC GGCGCCAAGT CGGAGCTCTA CCTCAAGGCG GACGAGCCTT CGGTCGCAGT GCCGTACCAA AACTGA
|
Protein sequence | MTNGAFAGAA RADAFNAAPT EQVQQQRGEA VSAFADASKE APAVNADGLP EGNNWRYSEF IKAVMSGKVE RVRFSKDGSA LQLTAVNGAR ATVILPNDPE LVDILAKNGV DISVSEGEQQ GNAASLVGNL LFPLVAFGGL FFLFRRAQGG DGGMGGMGGM GGGPMDFGKS KSKFQEVPET GVTFADVAGV EGAKLELQEV VDFLKNPDKY TALGAKIPKG CLLVGPPGTG KTLIAKAVAG EAGVPFFSCA ASEFVELFVG VGASRVRDLF EKAKAKAPCI IFIDEIDAVG RQRGSGMGGG NDEREQTINQ LLTEMDGFEG NTGVIVLAAT NRPDVLDSAL LRPGRFDRQV TVDRPDVAGR IRILKVHARG KTLAKDVDFD KIARRTPGFT GADLENLMNE SAILAARREL TEISKEEIAD ALERIIAGAA REGAVMSEKK KKLVAYHEAG HALVGALMPD YDAVTKISIV PRGNAGGLTF FAPSEERLES GLYSRTYLEN QMAVAMGGRV AEELIFGAED VTTGASGDFQ QVTRTARMMI EQMGFSKRIG QIAIKSGGGN SFLGNDMGRA ADYSAATAAI VDEEVKILVT AAYRRAKDLV QLNMDVLHAV ADVLMEKENI DGDEFERIML GAKSELYLKA DEPSVAVPYQ N
|
| |