Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31874 |
Symbol | |
ID | 5001998 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 684984 |
End bp | 688354 |
Gene Length | 3371 bp |
Protein Length | 1060 aa |
Translation table | |
GC content | 54% |
IMG OID | 640417419 |
Product | predicted protein |
Protein accession | XP_001418083 |
Protein GI | 145347243 |
COG category | [R] General function prediction only |
COG ID | [COG1204] Superfamily II helicase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.630707 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGCGC GTCCGCGTCG GGACGCGCTG CGCGCGACGA CGGACATAGA ACCTTTAGAC ACGGCCAGAG CGTTTGATTT CAAGTGCGTT CGCAACGGTC GCGCGGGAAC GACGGTGTCG CGCGCGACGC GACGGACTGA CGCGAAGGCG TCGCGACGAC GCGCGCGTCG TAGGTATTTC AACAGCATTC AGAGCGAGAT GCTCGAGTTT TTGGTGAACA CGCGACGGAG CTTCGTGATG AGTGCGTGAC GCGACGCGAA CGCGAGACGC GCGGCGAGTG CCTCGTTTTG GAAAGGACTG ACGAATGAAC GCGTGATCGC GCGTCGAACG CGAACGCAGG CGCGCCGACG GGGAGCGGGA AAACGACGGT GTTTGAGTTG GCGATGTTGG CGGCGTTGCG AGGACCGAAC GCGACGGAGG GGGGATTGAG CGCGGCGCGA GGGCGGAGAA AGGTGATTTA TCTCGCGCCG AGTCGGGCTT TGGTGAGCGA AAAGGCGCGC GAATGGCGAG AACGATTAGG CCGGATCGGA ATCACGTTTG CGGAGTTGAC GGGAGATAAC GATTTTGGTG GGGACGTGTG GGGGGAGATT GAGAATGTCG ATGCGATTTT GGCCACGCCG GAAAAATTCG ACCGCGTGAC TCGTCTGGAC GCGAACCGCG GCGGGATGTC GTTCTTCAGC GACGTCGCCG CGGTGTTGAT TGATGAAGTG CATCTCATCG GTGACATTCG TGGAGGGTGT TTGGAAGCCA TTGTTAGTCG TTTAAAGTTG CTGAGCAAGT CGTCGGCTCT ATTGCAATCG CACTTGCGAA ACGTTCGCTT TGGCGCCGTG AGTGCGACGA TTCCGAACAT TGAAAACCTC GCGAATTGGC TCGGCGCCAA TCGCGATGGC ACGTTTGTGT TCGGTGAAGA GTTTCGACCT GTCAAGTTAC AGACGTACGT GCGCAGTTTT CCGGACACAA CGAGCGATTT TTTATTCAAC AAGTATCTCA AGCAAAAAGT GTTTGCGGTG ATTCGGGAAT TCTATCGCGG CAAGCAGACG CTCGTGTTTT TGGGCAGTCG CAATGACGCG CAGCAGACGG CGAAGCAGCT CGTGGTGGAC TCTCGTAGAC AGTTCGTAAA CCCACAACTC TCCCAATTCT TGCTCGAAGC GTCCATGCAA GCGCAGAACA AACATCTTGC TGAGTGTATC ACCGCCGGCG TAGCGTTTCA TCACGCCGGG CTGGAACGAG GCGATCGCGA GTTGGTCGAG GGATTGTTTT GCTCTCGCGC TATTATGGTT TTGTGCAGTA CGAGCACTTT AGCTGTGGGC GTTAATCTTC CAGCGTACTT GTGCGTCGTC GCTGGTACGG ATATTTACGA CGGTGGTGGC GCGTACAAGG AAATATCCAT GGACACCCTA CTTCAGATGA TCGGCCGCGC GGGGCGACCT CAATTCGACA CCGAGGGCGT GGCGGTGGTG ATGACGAAGA ACTCTTTGCG ATCGCGCTAC GAGGGATTAG TTCATGGCAA GTATCCGCTC GAGTCAAGTC TCGGTTCATC GCTGCCTGAG TACTTAAACG CTGAAATAAG CTGTCGAACT GTGAACACGG TGGACGACGT CCTGGAGTGG GTGCAAAGCA CGTATTACTA TATTCGCGTT TGCGCGGAAC CAAGAAAGTA CGGAATTAAG AACGATGATA CGGTGACAGA ACACGTGAAG CGGTTAATCA AGGCGACTTT GGATGAGCTT ATTGCGTCAG GAATGTGCGC CGTGGTAGGA AATAGCGCGC TTCAACCGCT CAAAGCCGGA GACATCATGT CTCTTCGTTA TTTGCGCTTC AAAACGATGA AGAACATCAT GCGAAACGTC GCGACACCGT CGTACGCGGA TCTTTTAAGA ATGTTGTGTG AGAGTTACGA GTTCAAAGAC ATCAAGCTTC GCCGAGATGA AAGGAAGTTG TTGAAGCAGT TGAACACAGA TCAAAAGATT ATTCGCTTTC CCGTGCAAGA GACTACTGGA AAGTCGCAAA AGTTGTCCTT AGCGAAGGTA ATTCGTACGC CGGGCGAAAA GCTGTATCTT TTGGCGCAGT ATATTTTAAC TGACGTGGTC GAACCGAGCA TCACGCTCTC GCATCCGTCG ATGCGAATGG AAGGTGATAA AGTTCTTCAG CTCGGAACTC GCATCATGCG CGCGGCGTCA GAGTACTATC AATCCACGTT GACGTTTACC GCTGCAGCGA ACGCCTTTTC GCTCGCCAAA GGTCTCGACG TGCACATGTG GCCGGACACG AAAGTGCAAA TTCGACAGCT CAAGCACTCG CGCATGAAAA AAGTTATCGA AAAGCTCATT GAAGCTAGAA TTATGACGCT CGAAGACGTT GAAGACGCAG ATCCGCGTCG CATCGAAATG AAGCTCGGCA AGTCGTTTCC GTTTGGCAAC ACCCTTCAAG GTGACGTAAA GGATTTTCCA TCCGAGCTCA AGGTTGAAAT GACGCATCAC GCCATGTCGA AGGAGTCCTA CAGCGTCGAC GTCAAGCTGT CGTTTAAGAA TACTTCTGAG AGCCGCTTGA GCACGAATCA TCTACCGGCA AAATATCCGG GGATGCTCTT CATCGGCTCA GAGCACGACG ACCGGTTGTT GTACGTTCAG CGTTTGCCGA CGCGAGAGTT TGATTTGGAT GCTAACATGG ACGCATCCGC GAACATCGTC TTCCACGGCC GGTTCACGTG TACGAACGTG TCGAGCGGAA ACGTCCCGCT TTGTTTCGTG GCGCGTGTGA TTTTCGAACG TTGCATCGGT CGCGACGTCG TCGAATACCA CGTTGTCAAT CGCGGCGGCG CGAGTCCGCG AACTCCGGAC AAATCAGGTG TTTCTCCCGC GAAGTCGCCC ACGTCGACGA CGCCCGCGTC AACGATGAAA CAAACAAAGA TTATCGCTGA CGCGCACACC AAAAAATTCA GACTGATGAA TTCGGACGAC GAAGTTCGCC GGAAGGTGGC ATTCGAAGAC TTTAAGCTGT CTAACGAATC GTCGCCGAAC TCGAGTCCGC GTGAACGCAA AGCTACGTCG CGTTCAGAAG ATGTTTGCAA TAGGTGCGGT GTCAAAGGCC ATTGGGCAAA GGACTGCTTA TATCCCGACA ATCGACCCGA AGAGCTGCGA CCAGGGCCGA AACCGACGGA CAAATGTCGT AGATGCGGTG AGCTTGGGCA CTTCGCTCGC GATTGTTCGT TCGACGAGGA CACATGCAAA ATTTGTCAGC AGCACGGCCA TCGGGCTCGA GATTGTCCAA GCGTGGCCGA CGTCTTTGCT TCGCTTGACG ACACGACAAC GACGGTGAAC GACGCATCCG ATTCGGACAA GGAAGAAAAC TGGGAGTTCG TCTTTGGTTG A
|
Protein sequence | MYARPRRDAL RATTDIEPLD TARAFDFKYF NSIQSEMLEF LVNTRRSFVM SAPTGSGKTT VFELAMLAAL RGPNATEGGL SAARGRRKVI YLAPSRALVS EKAREWRERL GRIGITFAEL TGDNDFGGDV WGEIENVDAI LATPEKFDRV TRLDANRGGM SFFSDVAAVL IDEVHLIGDI RGGCLEAIVS RLKLLSKSSA LLQSHLRNVR FGAVSATIPN IENLANWLGA NRDGTFVFGE EFRPVKLQTY VRSFPDTTSD FLFNKYLKQK VFAVIREFYR GKQTLVFLGS RNDAQQTAKQ LVVDSRRQFV NPQLSQFLLE ASMQAQNKHL AECITAGVAF HHAGLERGDR ELVEGLFCSR AIMVLCSTST LAVGVNLPAY LCVVAGTDIY DGGGAYKEIS MDTLLQMIGR AGRPQFDTEG VAVVMTKNSL RSRYEGLVHG KYPLESSLGS SLPEYLNAEI SCRTVNTVDD VLEWVQSTYY YIRVCAEPRK YGIKNDDTVT EHVKRLIKAT LDELIASGMC AVVGNSALQP LKAGDIMSLR YLRFKTMKNI MRNVATPSYA DLLRMLCESY EFKDIKLRRD ERKLLKQLNT DQKIIRFPVQ ETTGKSQKLS LAKVIRTPGE KLYLLAQYIL TDVVEPSITL SHPSMRMEGD KVLQLGTRIM RAASEYYQST LTFTAAANAF SLAKGLDVHM WPDTKVQIRQ LKHSRMKKVI EKLIEARIMT LEDVEDADPR RIEMKLGKSF PFGNTLQGDV KDFPSELKVE MTHHAMSKES YSVDVKLSFK NTSESRLSTN HLPAKYPGML FIGSEHDDRL LYVQRLPTRE FDLDANMDAS ANIVFHGRFT CTNVSSGNVP LCFVARVIFE RCIGRDVVEY HVVNRGGASP RTPDKSGVSP AKSPTSTTPA STMKQTKIIA DAHTKKFRLM NSDDEVRRKV AFEDFKLSNE SSPNSSPRER KATSRSEDVC NRCGVKGHWA KDCLYPDNRP EELRPGPKPT DKCRRCGELG HFARDCSFDE DTCKICQQHG HRARDCPSVA DVFASLDDTT TTVNDASDSD KEENWEFVFG
|
| |