Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38163 |
Symbol | |
ID | 5003937 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | + |
Start bp | 557031 |
End bp | 559505 |
Gene Length | 2475 bp |
Protein Length | 800 aa |
Translation table | |
GC content | 63% |
IMG OID | 640419358 |
Product | predicted protein |
Protein accession | XP_001420035 |
Protein GI | 145351332 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.158186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.217653 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA CGACGACGCG CGCGACGCGC GGGACGGTCG TCGTCGTCGG TGGGCGCGCG CGCGCGCGCG AGGGACGCGA GGGCGCGCGC TCGGGACGGA CGCGAAGGGA GGCGCGAGGG CGCGAGGACG AGGACGAGAC GGGCGAGGAC GAGGCGCGAG GCGCGAGCGC GGAGGGATCG AGCGAAACGA CGACGGGAGG CTCGGAAACG TCGACGTCCG AGACGGCGGC GACGGCGACG GGACGCGAGG GGGGGAAAGC GTCGTGGATA AGGGCGAAGG CGAAGGCGTT CGCGGACGGT TTACCCTTCG CGGCGAATAA AAAGAAGTGG GACGCGCTCT TCGCCGACGC CGACGCGAGT CCGCTGGACG CGGCGAAGCA GGACGTGTTG ATGCGAGAGC TGTTGGCGTA CGATAGGAAC GATGATTTGA TGAAGCGGTT CGAGACGAGA CGGTACGCGA GCGGGCCGGT GTCGGTGCTC GCGTACGTCA CGGCGCTCGT GCGGACGAAT AGATTGGAAC ACTTCGTCGC GGGCGGCGAT GCGGGATTCG GGAAGCCGTT GCCGAGCTTC GACGAGTCCA CGGCGTATCG CAAGCTTCCG GATTTGTTGG GCGATCTCTC CGAACGCTCG AAAGGCGCGG ACGCGCTCGT GCCTTTGGAG ACGGGGAAGA GCGCGCAGGC GCCGTTGCAC GTCGCCTTTG TCGGTGGTGT CGGTGGATTG GCGCCGCCGC AAAGCGCGGG GCGGCGATTG TTGAATGCGT TTTTAGGTTT CATGCTCTTC ATCGCGAGTT TAAGCTTCTT GAGCACGCTG GCGTTGCGTC ACATCGCGGT TCGCGTCATC GAGAGCGGGC CGAGTCACAG TTCGAGTCAC TCGTTGCCGG CGCCGCGAGA CGAATCGGGC TCGTCTGCGT CTTCGAGCTT GGGGCCGGGT TCGAACGGTG GTGGACCGAA CTTTGACCCG AAGCAGTTCA ACAAAGACAC GATGCCGGAG AAGAGTTTGA AGACGTTCGA CGACGTCAAA GGGTGCGACG AGGCCAAGGA CGAGCTCGCG GAAATCGTCG AGTATTTGAG AAATCCGGAA AAGTTTACGC GACTCGGTGG CAAGTTGCCG AAGGGCGTCT TACTCACGGG TCCACCGGGA ACGGGTAAGA CCTTGCTCGC GCGCGCGGTC GCGGGCGAGG CGGACGTGCC ATTCTTTTAT CGAAGCGGGA GCGAGTTCGA AGAAATGTTC GTGGGCGTGG GGTCCAAGCG CGTTCGCCAG CTCTTCGCCG CGGCGAAGAA GAAGACGCCG TGCATCGTCT TCATCGACGA AATCGATTCC ATCGGTACGT CGCGCAAGTC CATCGAGAAC CAGCACCGAA AGACGCTGAA CCAGTTGTTG ACGGAGATGG ATGGGTTCGA GCAAAACGAC GGCATCATCG TCCTGGCGGC GACGAACATC CCGGAGTCGC TCGACCCCGC GCTCACGCGT CCGGGGCGAT TCGATCGCAT GGTGCACGTG CCTAATCCCG ATATCGGTGG CCGCCGCGAG ATCCTCGAGC ATTACTTGGA CGACAAGCCG ACGACGAGCG ACGTGGACGT CGACAAAATC GCGCGCGGCA CCGCCGGATT TAGCGGCGCC GAGTTGTTCA ACCTGGTGAA CATGGCGGCG GTGCAAGCGG CGATGGCAGA CGCGCCGGCG ATCACCGCCG CGGATCTTGA TTGGGCGCGC GATCGCGTGT TGATGGGTGC CGAGCGCAAA TCCGCGGTAT TGAGCGAAGA AAACCGAAAG TTGACGGCGT ACCACGAAGC CGGACACGCG CTCGTGGCGT TGAAATCCGA CGCGGCGCTT CCGATTCATA AAGCCACCAT CATGCCTCGA GGGTCGGCGC TGGGGATGGT GATGCAACTT CCCGACAAGG ACGAGACAAG CGTGAATCGC AAGCAACTCA TGGCGCGTTT GGACGTGTGC ATGGGCGGTC GCTTGGCCGA GGAGCTCATC TTTGGCTCGG ACGAAGTCAC CACGGGGGCG AGCGGGGATC TGCAGCAAGC CACACGTTTG GCGTTTTACA TGATTAGCGA CGTCGGCATG AACACCAACT TAGGTCCGGT GCATCTTTCG AGCATTCGCG GTGGAAACGC CGGACGCGGT GCTTCTGGGT CCACGGAGTC CGCCGTCGAC GGCGAAGTCA TCAAGCTCTT GAAGGATTCG CAAACGCGCG TGCAAAAGCT GTTAAAATCC AACCTGAGCG ATTTGCACAC CATCGCCAAG GCGTTGATGG AGAAGGAAAC GCTCACGGGG AACGAAATCC GCGCGCTCAT CGGGATGCCT CCGGCGAAAG AACCGGTGGC CGTCGTATCG CGCGCCAAGC CCAAGGCGCC GAAGCCCAAG GCGCCCGAGG CCAAGGCGGA AAAACCGAGC GAAGACGCGG CGGAGGAGGA CGAAACGACG ATCATCGTCA TTCAGCTCAA GGACGAGGAC GATTCGCCCA AGTAG
|
Protein sequence | MTTTTTRATR GTVVVVGGRA RAREGREGAR SGRTRREARG REDEDETGED EARGASAEGS SETTTGGSET STSETAATAT GREGGKASWI RAKAKAFADG LPFAANKKKW DALFADADAS PLDAAKQDVL MRELLAYDRN DDLMKRFETR RYASGPVSVL AYVTALVRTN RLEHFVAGGD AGFGKPLPSF DESTAYRKLP DLLGDLSERS KGADALVPLE TGKSAQAPLH VAFVGGVGGL APPQSAGRRL LNAFLGFMLF IASLSFLSTL ALHESGSSAS SSLGPGSNGG GPNFDPKQFN KDTMPEKSLK TFDDVKGCDE AKDELAEIVE YLRNPEKFTR LGGKLPKGVL LTGPPGTGKT LLARAVAGEA DVPFFYRSGS EFEEMFVGVG SKRVRQLFAA AKKKTPCIVF IDEIDSIGTS RKSIENQHRK TLNQLLTEMD GFEQNDGIIV LAATNIPESL DPALTRPGRF DRMVHVPNPD IGGRREILEH YLDDKPTTSD VDVDKIARGT AGFSGAELFN LVNMAAVQAA MADAPAITAA DLDWARDRVL MGAERKSAVL SEENRKLTAY HEAGHALVAL KSDAALPIHK ATIMPRGSAL GMVMQLPDKD ETSVNRKQLM ARLDVCMGGR LAEELIFGSD EVTTGASGDL QQATRLAFYM ISDVGMNTNL GPVHLSSIRG GNAGRGASGS TESAVDGEVI KLLKDSQTRV QKLLKSNLSD LHTIAKALME KETLTGNEIR ALIGMPPAKE PVAVVSRAKP KAPKPKAPEA KAEKPSEDAA EEDETTIIVI QLKDEDDSPK
|
| |