Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31794 |
Symbol | |
ID | 5001777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 541475 |
End bp | 543736 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | |
GC content | 54% |
IMG OID | 640417198 |
Product | predicted protein |
Protein accession | XP_001418044 |
Protein GI | 145347161 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0561113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGG AGAAGAAGAA AATGGGAAAG AAGGAGCGAC GAGCGGCGTT CGGACCGAGC GTGGCGTTGA ATAAAGAGCT GATGTCGTGT GAGACGTTGG AGGCGTTGGC GGAGACGTCG CGTCGCGTCG CGCGGGACAT GTCGGCGGTA AACGTCGCGA CGACGTACGG AAAATTGGCC AGGTTCGCGC GAGGGGGACG AGGGAAAGTT AGTGAAGAGA TCAAGCGGTC GGAATGGTTC GCCGAACTCG AAGAGCGCGC GATGGAGAAG CTGGGGGAGT GCGAAACCAA GGCGCGCGCC GTCGCGCAAA TTGCGTGGGC GTGCGGGTAC TTGGATAGGG GTCGACATCT GCATGGTGAA GATGCGTTTT GGGATGAATT AGAGCGGGCT ATTGAGAGAG AAATTAGTAA ATGCGAGCCA CAAGGTGCGA GTAACGTCGC GTGGGCGTAC GCAAAGCTTG AGATGCGCAT GCCGAACGGC ATTCGCAACG CCATAGAAAA GCACATTGTG CAAAATGCGA GTGCGTACAA ACCGTTCGAA TTGAGCATCA CGTTTTGGGC GCTCACGAAG TTTGGAGACA TTCCGACGGA GGAAATGCTA GATGTATTCG AGCGCGAGAT GCGACTTCAA TCGTGTGGTT CACAAGAGCT TACGAACATC GCTTCGGCGT ACGCGCGCAT CAGCGGACGG AGAGTGCGTC AGGGGACTCA AGGTTTCCTG AAAGAGTTAG CCTCGAATGC TTTCACCATT CTGCACGAAT TCGACAACAC GGAACTTGCG ATGTTTCTCT GGGGGTTGTC AAACGCGGGT TACTACCTGG ACGACGACGA TGCGATGGAG ATATTCAAAG TAGTTGAGAG GCGAGCGAGC GGCGCCAAAC GATTGGAGCC ACAACAAATT GCCTTGATCA CGGGATCGTT TGCGACGTTC ACGGACGACG TCACGGTGAC GCACTACAAT TGCGAGACCG GAAGTCCGCT TTCGCATCTT CGAATGAGTG CGCAAACACG AGCGGCGCTC AAATCCGCGA TCGTAGCTCT GGAGAACACC TTTCTGGCCT CCATCGCGAA GTGCAACATG GACGATTTGA GCTACGTAAT GTGGGCGTTC GCGCACCTGG AGCATCGGCC CAGTGACGAG TTTGTTCGAC GGCTCGAGGA AGAAGCGATC GATAAAATCG AAGAAGCGAG CGCAAAAAAT CTATCAAACC TATTGTACGG ATTTGGAACG CTAAACTTGG CAGGACTCGG AGTGTTTACG CACGCCATGT TTTGCGTCTC GCAAAAACTG GAAGAGTTTA CACCAGTTGG AATCTTCATG GTTTGCTCGG CTTTGGCGAG TAGTAACTAC GATCCGGGAC CACAGATGAT GTTACAGTTC GAAAACAAGC TCATGAAATC GGCGCATGCG TTTGAATCGC AAGATTTCAC CGAATTCTTG CGCGTATTCG CGAGGCTTAG GTACATGCTC GCGGACGAAA CTTTCGATTT CATCGGCGTA AGCTCGGCGA AGACGCTTGA TCGCTTCGAC TCATACAGGA TAAGCATGAC TTTATGGTCG CACGCGACGC TGTGCGCACA GCCTCATGAT GCTTTACTCG CGCGGATTGA AGACGAGATT CGAGGTTCTG CGTCCCAGTT CAAACCGCAA AACTTTGTGT TGGCTTTGTG GTCGCTCGTT TTGCTCGGCT CACTGGAAGA CGCGCGAGAC AGTGTCGTGC GTGTCTTGCA CGCGCTCGTC AAACTTCAAG GTGGTGCTTT GACGAGTTCG GAAGACTTGG AGGATGCGCA ACTATGCTCT CTCTACATGG CGCGTCTCAC TTCTATGGGA AAGCCGTTCG AAGAGCTCAT CCTTGGCGTC ACGGATGGCG TCGCCGACGA ATGCGAGCGT GCCTGGCTCC GCGCAAAGGC TCAAGATCCG ACGATTAGCA AAGTTCAGCA TCACATCGGC GAAGTACTCC GCGAAATTGG CGCGCAAGAT TTTGAGGTTG AAGCGCTCGT GGAGGGCGGC AAGATTCGTT CTGATATCGT ATTCCCGAAC TCGCGAATCG TCGTTGAAGT CGACGGTCCG CATCACTACA GCCGCGACGC ATCCGGTCGT CTTCGCGAGC TCGGTCAAAC CGTCATGCGC AACAATCTTT TAAAATCATG GGGTTGGCGT GTCGTCATCG TTCCCTACGC CGATTGGGGC GACATGCTCA CCATTGAAGA GAAAGCGTCG TATTTACGTT CTCTCCTCGG CGACGAAGTC TTCGTCGCGT AG
|
Protein sequence | MTAEKKKMGK KERRAAFGPS VALNKELMSC ETLEALAETS RRVARDMSAV NVATTYGKLA RFARGGRGKV SEEIKRSEWF AELEERAMEK LGECETKARA VAQIAWACGY LDRGRHLHGE DAFWDELERA IEREISKCEP QGASNVAWAY AKLEMRMPNG IRNAIEKHIV QNASAYKPFE LSITFWALTK FGDIPTEEML DVFEREMRLQ SCGSQELTNI ASAYARISGR RVRQGTQGFL KELASNAFTI LHEFDNTELA MFLWGLSNAG YYLDDDDAME IFKVVERRAS GAKRLEPQQI ALITGSFATF TDDVTVTHYN CETGSPLSHL RMSAQTRAAL KSAIVALENT FLASIAKCNM DDLSYVMWAF AHLEHRPSDE FVRRLEEEAI DKIEEASAKN LSNLLYGFGT LNLAGLGVFT HAMFCVSQKL EEFTPVGIFM VCSALASSNY DPGPQMMLQF ENKLMKSAHA FESQDFTEFL RVFARLRYML ADETFDFIGV SSAKTLDRFD SYRISMTLWS HATLCAQPHD ALLARIEDEI RGSASQFKPQ NFVLALWSLV LLGSLEDARD SVVRVLHALV KLQGGALTSS EDLEDAQLCS LYMARLTSMG KPFEELILGV TDGVADECER AWLRAKAQDP TISKVQHHIG EVLREIGAQD FEVEALVEGG KIRSDIVFPN SRIVVEVDGP HHYSRDASGR LRELGQTVMR NNLLKSWGWR VVIVPYADWG DMLTIEEKAS YLRSLLGDEV FVA
|
| |