Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32286 |
Symbol | |
ID | 5002527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | - |
Start bp | 610141 |
End bp | 613486 |
Gene Length | 3346 bp |
Protein Length | 1034 aa |
Translation table | |
GC content | 61% |
IMG OID | 640417948 |
Product | predicted protein |
Protein accession | XP_001418530 |
Protein GI | 145348173 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.414837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.522406 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGGCGACGCG CGCGCGCGCG AAGGAACGAA CGCGCGACGG ACGAGGGACG AGCGACGCGA AGACGACGCC GAACGCGCGG TGCATGCTTC GAGCGGTCGC GCGACGGTCG ACGACGACGG CGACGACGAC GACGCGCGGA TTCGCGAGGA CGAGACTCGG CGAACGACGC GGGCGAAGAG GACGAGACGC GACGCGAGCG ACGCGAGCGA CGCGGCGAGA GGTGATGGTC GGGACGAGCG CGGCGTCGAT GTCGGCGATC GGGGCGGGAC GGGTCGGGGG GGCGAACGCG ACGGCGACGG CGACGAAACG GGCGACGAGC GAGGACGCGA AGGCGACGCT CGACGCGCTG GAAAATTTGA AACCGGGGAC GAGGCTCGCG GGAGGGGCGT TCGAGGTGAC GTCGACGAAG CGAGTGATGC CGTACGACGT CGTGGCGGTG GAGCTGGAGC ACGTGAAGAC GGGGGCGAAG GTGCTGCACG TGGGCGCGGA CGACTCGAAC GCGGGATTCA ACGTGGCGTT TCGGACGACG CCGCGGGATT CGACGGGCGT GGCGCACGTG TTGGAACACA CGGTGTTGTG CGGAAGCGAA AAGTTTCCGG TGAGAGATCC GTTCTTTAAC ATGCTGCGAC GATCGTTGAG CACGTTCATG AACGCGATGA CGGCGAGCGA TTTCACGTGC TACCCGTTTT CGACGATGAA CCGCGTGGAT TACAAAAACT TGCTCGACGT CTACTTGGAC GCCGCGTTTT TCCCCAAGAT CGCCGCGGAG GATTTCTCGC AAGAAGGTCA CCGCTTCGAG TTCGCGAAGA TGGACGACCC GACGAGCGAT TTGATTTACA AGGGCATCGT GTTCAACGAG ATGAAGGGCG CGATGGGGTC GCAGAGCGCG CGATACGGCA GAGCGCTCGG CGAGAACTTG TTCCCGACGT CCACGTACCA TTGGAACAGC GGTGGAGATC CGGTCAACAT TCCCGACTTG ACGTACGAAC AGCTCAAGGC GTTCCACGCG TTGCATTATC ACCCGTCAAA CGCGAAGTTT TACACGTACG GCGACTTGCC GCTCGAGGAG ACTTTGCAGC AAATCGAAGA CTCGGCGTTG CACCGCTTTG ATAAACTCGA CGTGAGCAAG TTGATCGTCG AAGACGAAAA GCGTTTCACC GCGCCGAAGC GCGTCGAAGC CACCGTACCC GCGGACGCGG TGGTGGCTGA CGCGAACAAG CAGTCGCTCA TCTCTCTCGC GTGGCTCATG GTGAACCAAA TCGAGGATCC AGTTTCGTTG GACAACTTCG CCTTGGGCGT CGCCAGTGAC TTGCTCACGA GCGGACCGCA ATCGTACCTT TACGAAGCGC TCCTCGAGCC CGGCCTCGGG AGCGGTTTCG CCCCGGGCAC TGGCTACGGT GGCTCTCGCC GAGAGACGTC CTTCGCCGTC GGCTTGAAGG ATGTCGCCGA CGCGGATATG GACAAGATTG AAAAGACTAT TCTCGACGTT CTCGAGCGCA TCTCTCGCGA AGGGTTCCCG CGCGAGCGCG TCGAAGCGGT GATGCACCAG CTCGAACTCG ACTCCGCGGC GGTTACGACG CAGTTCGGAT TGTACACCGG TTTCGGAGCC TTCTCGACGT GGGTGCACGA CGGCGACTCG CTGCGCGCGT TGCGCACGCC CGAGCTCGCG GCCAAGCTCA ACGCCGCGCT CGACGCGGAT CCGCAGTACT GGCAAAAGCT CATCAAAAAG TGGTTCTTAG ACAACACGCA CCGTCTCACG ATCACCGCGC GCACTGATCC TGATTACGAC AAAAAGCTCG ACGAAGCGGA GAAGGCAAAG CTGAAGAGCA TCGAAAAGAC GTTGACCGAG GACCAGAAGA AGAAGATCGT CGCCGACGCG CTCGTGCTCA AAGAGAATCA AGATAAGAAG GAAGACGTAT CGGTGTTACC CACCCTGATC GTCGCCGAGG CCGTGCCGAA AGACATTAAG CGCTGGGGGT CGAAGAATAT GAAAATCGCC GGTAACATTC CGCTTCAGTA CGACGAGCAA CCGACCAACG GTGTCGTCTA CTTTAGCACG CACTTTGATC TGGATGGTCT TCCGCAGCGG TTGGTGCCTT ATTTAGACAT GTTCATGGAT TTCATCGATC AGCTCGGCAC GGAAAAGATG AAGTACAAGG ATTTGGCGGA ACAAATCAAG CTTCGCACAG GCGGCTTTTC CGTCGGCTCC GTCGTTCGTA CTCCCACCGA CGGCAAAGGT ACGCCGACGA TGTCGCTGTC CATCAGCGGT CACGCGCTCG AACGCAACGT CGACGCGATG TTCGATATTC TCACCGATTT GCAAACGGCG AAGTGGCGAG GGGAAGAAGA GCGCGTCAAG TTGTTGTTGA CACGTCGTGC CGCCGCACTC GGCGCCTCCG TTGGACAGCA AGGCATGCAA TACGCGAGAA ACCTCGCAGG CGCTCAAATC AGCGCCACGA GTGCGTTGTC GAACGAAACG AGTGGATTGC CTCACGTCGG CCTCGTCTCG CGCTTGTCCA AGGAAGGCGC GATCGATGAA GTTGAAACCG CGATGGCCGA AATCGCCGCG TTCGCGCTTC GCCCCGAGCG CGTACAGCGA TGCCGCATCG CGTGCCAAAA AGAAAGCTTT TCCGCCACCG AACGTCGATT CGCAAAGTTC TTGAAAGATA TCAAGCCGGT TGCCGCCAGT CCGAGCGACA AGGACACCGT GGCGACGAAG TTGAAGACGT TCAAACCCGA GCTTTCCAAG GTGTTTGTCT CCATCCCGGG GCAAACCAAT TACTGCTCCG CCGCGCTTCC GGCGTTGCCG TACAGCCACC CCGACGCTCC GGCATTATTC TTGCTCGCGC AAGCGTTGAG CGCGGGCTAC CTTCACCGTG AAATTCGCGA AAAGGGCGGC GCTTACGGCG GTGGTTGTGC CTCGGACCCG ATGTCCTCGC TCTTCACCTT CTTCTCCTAC CGCGATCCGA ACACGACGGA GACGTTGGAC ACTTTCACCA AATCCATCGA ATGGGCCACG AACTCGGAGA ACATCACCAC AAAAGAGCTC GAAGAGGCTC AACTTCGCGC GTTCAAGCAA CTCGACGCCC CGCTCGCTCC CAGCGCGCGC GGAAACTCCG GTTTCTTAAC CGGCGTCACC GACGAAGAGC GTCAGCGTTT CCGCGACGGG TTGCTCGCCG CGTCGCCGGC GGATTTATCT CGCGTCGCCG CCGCCCACCT CCGCGGCGTC GCCCCCGCGA TCGCCATCAT CGGTTCATCC GAGAAGGCTC CGCTCGCGGA CGCGAGCTGG GTGAATCTTG ACGCCCAGGG CGCGCCTCGC GTCGCCTAGT CGTCGCCGCG CCCTCG
|
Protein sequence | MVGTSAASMS AIGAGRVGGA NATATATKRA TSEDAKATLD ALENLKPGTR LAGGAFEVTS TKRVMPYDVV AVELEHVKTG AKVLHVGADD SNAGFNVAFR TTPRDSTGVA HVLEHTVLCG SEKFPVRDPF FNMLRRSLST FMNAMTASDF TCYPFSTMNR VDYKNLLDVY LDAAFFPKIA AEDFSQEGHR FEFAKMDDPT SDLIYKGIVF NEMKGAMGSQ SARYGRALGE NLFPTSTYHW NSGGDPVNIP DLTYEQLKAF HALHYHPSNA KFYTYGDLPL EETLQQIEDS ALHRFDKLDV SKLIVEDEKR FTAPKRVEAT VPADAVVADA NKQSLISLAW LMVNQIEDPV SLDNFALGVA SDLLTSGPQS YLYEALLEPG LGSGFAPGTG YGGSRRETSF AVGLKDVADA DMDKIEKTIL DVLERISREG FPRERVEAVM HQLELDSAAV TTQFGLYTGF GAFSTWVHDG DSLRALRTPE LAAKLNAALD ADPQYWQKLI KKWFLDNTHR LTITARTDPD YDKKLDEAEK AKLKSIEKTL TEDQKKKIVA DALVLKENQD KKEDVSVLPT LIVAEAVPKD IKRWGSKNMK IAGNIPLQYD EQPTNGVVYF STHFDLDGLP QRLVPYLDMF MDFIDQLGTE KMKYKDLAEQ IKLRTGGFSV GSVVRTPTDG KGTPTMSLSI SGHALERNVD AMFDILTDLQ TAKWRGEEER VKLLLTRRAA ALGASVGQQG MQYARNLAGA QISATSALSN ETSGLPHVGL VSRLSKEGAI DEVETAMAEI AAFALRPERV QRCRIACQKE SFSATERRFA KFLKDIKPVA ASPSDKDTVA TKLKTFKPEL SKVFVSIPGQ TNYCSAALPA LPYSHPDAPA LFLLAQALSA GYLHREIREK GGAYGGGCAS DPMSSLFTFF SYRDPNTTET LDTFTKSIEW ATNSENITTK ELEEAQLRAF KQLDAPLAPS ARGNSGFLTG VTDEERQRFR DGLLAASPAD LSRVAAAHLR GVAPAIAIIG SSEKAPLADA SWVNLDAQGA PRVA
|
| |