Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_44098 |
Symbol | |
ID | 5004568 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 383927 |
End bp | 385510 |
Gene Length | 1584 bp |
Protein Length | 494 aa |
Translation table | |
GC content | 56% |
IMG OID | 640419989 |
Product | predicted protein |
Protein accession | XP_001420493 |
Protein GI | 145352310 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0738436 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTTT TGGCGTACTC CGTCGGCGGC TTGGCGTTTA TGATCGGTTC GATTTACTTG GGCATCGTCC GTAGGACGTC GGTGCCGCAA GATCAGTTCC AGGCGATGCA GTTTGCGCAG TCCCGCGCGG GTGCGAGGCG AGACGGCACG GTGGACGTGA CTTTGGAAGA CGTTGGTGGG TTGGAGAACA TCATAGAAGA TTTGGAGGAA GTCGTCGCCT TTTTAAAGGA ACCCGAGCGT TTCGCCAAGG TTGGCGCTAG GCCCCCCAAG GGTTTGCTCA TGGAAGGAGG CCCGGGCGTT GGTAAGACGC TGATCGCGAA GGCGATCGCG GGTGAAGCCA AAGTGCCGTT CTACTCCATG TCTGGCTCTG AGTTTGTTGA AATCATCGTC GGCGTCGGCG CGGCTCGAGT GAGAGATTTA TTTAAACGCG CGCGCATCAA CGCGCCGTGC TTGATATTCG TCGACGAAAT CGATGCGTTA GGCACGAAAC GCGCCGCTGC TGGCACGCGT GGCACGGAAG AGCACGAACA AACGCTGAAT CAGCTTCTCA CGGAGATGGA TGGTTTCACG CCAGACACTG GCGTCGTCTT CATCGGCGCG ACAAACCGAG CCGATTTACT CGATCCCGCG CTGTTGCGGC CAGGTCGATT CGATCGTAAA GTTCGCGTCG GCCTTCCAAA CGTCGAGGCA CGGGCTAAGA TTTTGCAGAT TCACTTGTCC AAGCGAAACT GCAACCCAGA AATTGACACC AAGCGATTGG CGCAGAACCT TCCCGGTTTG TCGGGCGCCG AGATTGCAAA CATTTGCAAC GAAGCCGCGG TGCACTGCGT GCGACGCCAG GGTGAACAGA TCGAAGAGCA CGACGTCTTG GATGCCGTTG AGCGCGTCGT GAGCGGCATT CGTCTTACAG CACACCCCAA GGAAAGCGTG ACGACGCGCA AACTCGCGGC GCACGAAGTC GGCCACGCCC TGGTCCAGAA CTTGCTCCAT AAGAGCAACG GTCTCATCGA AGACATCGAG ATGATTTCCA TCATTCCGCG AGGTTTCGAG CCAGCGATTA CGCTGATCCA GAGAAAGCGT GACGAGGATT ACCGATATCC TACGCGCGCG CGAATGTGCG AGCGCGTGCA AGTTTTGCTC GCGGGCCGCT CCTGCGAAAA AGTGTTGTTT GGAGAGGCGA GCACTCGCGG TTCGGAAGAT GTTTGCGAAG CGAATGATTT GCTTCGAAAT ATGATCGTAA ACTTTGGCTT GGGGCAGCCC GGCATGATGA CGACGTACAC ATACGACCCG AAGCATTTGA ACAAGTCGGA GAGACGAGTG GCGCGCTTGC AAGGTGCGGT GTCTAAATCT GGAGAGTTGA TGCCCTTGGA CGAACTGTTG ACGATTGCAG GTCCGATCAG AGAGGTAGTC CCCGATCACT ATCAATACGC CGAGCGCAAG ATGGTTCAGA TTCTTCAAGA AGCGGAGGCA AACTGTTTGG CGATCATCGC CGCGCACGAG GACGCAGTCA ACGCGATGGT GGATCGCTTG ATTGAAAACG AGACGCTTTC TTTGGCCGAG TTCGAAGAAA TCCTCGCGGC TCAT
|
Protein sequence | MEFLAYSVGG LAFMIGSIYL GIVRRTSVPQ DQFQAMQFAQ SRAGARRDGT VDVTLEDVGG LENIIEDLEE VVAFLKEPER FAKVGARPPK GLLMEGGPGV GKTLIAKAIA GEAKVPFYSM SGSEFVEIIV GVGAARVRDL FKRARINAPC LIFVDEIDAL GTKRAAAGTR GTEEHEQTLN QLLTEMDGFT PDTGVVFIGA TNRADLLDPA LLRPGRFDRK VRVGLPNVEA RAKILQIHLS KRNCNPEIDT KRLAQNLPGL SGAEIANICN EAAVHCVRRQ GEQIEEHDVL DAVERVVSGI RLTAHPKESV TTRKLAAHEV GHALVQNLLH KSNGLIEDIE MISIIPRGFE PAITLIQRKR DEDYRYPTRA RMCERVQVLL AGRSCEKVLF GEASTRGSED VCEANDLLRN MIVNFGLGQP GMMTTYTYDP KHLNKSERRV ARLQGAMVQI LQEAEANCLA IIAAHEDAVN AMVDRLIENE TLSLAEFEEI LAAH
|
| |