Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15588 |
Symbol | |
ID | 5002029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 793718 |
End bp | 795685 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417450 |
Product | predicted protein |
Protein accession | XP_001418110 |
Protein GI | 145347297 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGC GACGCGCGCG GGCGATCGTC GTCGTCGCGT GCGTCGCGTG GGCGATCGCG AGCGGGGTCA CGCGCGCGGA CGCGCTGTGG GGACGTCGCG GGGGGGGGGC GAACGCGGCG AGCGCGGGCG CGGCGGACGA TCGGCGCGTG TGCTTCGTCG CGGCGACGGT TCCGTGGACG TTCGGCGCGT ACCAGGCGCA GGCGCTGGGA CTCTCTAAAG CGTTCGCGGA TAATGGGTAC GAGACGTTTT GGATGCCGCG CGTGCCGGGG GTGCGGTTGC CTCCGGGCGA GTACGCGGAT TGGCGCCGCG CTCGCGAACG CTTCGCGGGC GACGTGCGCG CGCCGACGAA AGCGGAGCAA AAGGCGATGG AGCACCTGAC GTTTTTGGGC GTGCCGGACG TGCACGCGCC GTACGCGCGG ATAAACGAGC TGTCGATGAC GATGAAGCAG GTGAACGAGG CGTCGTCGAC GCACCGACTG GACGCCTTCG TGTTGCTCAT GGATATCGGA CAAATGTATT TCGATGGGTA CGATTTCGCC GTGCCCGTGG TTTTGTGGAT GCCGTATCAC CACGAAGAAG CGGACGCGTC GGTGGCGGTG GTCCGGACGT ACTCGGGCGT GGCGGCGTTG TCCGACACCA CGGGTCGCGC GATCGCGACG GAGCAGCCGC TCACGCGCAC GATTCCTCAT TTTATCGATC GCGCGTCGCT CAACGCGCTC GCGGACTCTT TCGAGACGAA ACTGAAAATC GCCGACGATG TCAAAAATAG ACGCAAAGTC ATTCGCCGCG CGGTGTTTGA TTCTAAAAAG TACGACAGAA TGTTCTCGAG GAGCATCGAC GAAGTCGACG ACGACACCTT CTTGGTGCTC ATGCAAGGTG GCAACTACGA AAACTCTGAC AGGAAAGGAT GGGTGGCGTC AATCAACGCG TTCGCTGAAT TTCAACGAGC GAACCCGAAA TTAAAGACGC ACCTCTGGAT TCATACGGTG GACAGCGCTA TGACGCAAAA CGATCTGAAC CACAAGAGCA AGCCGCCCGT CGCGGTTTTG CGAACGGGTA TATCCCTACG TGCCGCGCTT CAACGCGCAC AAATTCCATC GAACATGTAC ACGCTGGATG AGAATTTGCA CGACAAGGTT TTCACGTCGG CACTGAAAAG ACACGCCGAC GTGTGTTTGC ACACGTCGAA ATCGGAAGGT TTCGGCATGG TCGTCCTCGA GTGCCAAGCG CTGGGTACGC CCGTAGTGAC GACGAACTAC ACCGCGATGC GCGATTACAC CAAATATGGG TTAGCGGTCG AAATTGGGGC ACGTGAATCG ATACAAGGTG CGTATTTCGC AGCGCCGAGC GTACCGGGCG GCGCCGCGGC TTTGACGGCT ATTGCCACGG GCGCGGCAAG TCTTCCTCTG GTCGAAGACG TGTACAAGTG GATCGATGAA GAGCTTTCCT TGAGCGCGGT CTTTACAAAG TTCGAATCTC TCCTCGGCGA AGCCAAGATT GCGCACGCAA AGCGTACACC GTGGAGCCAA GTCGACACGT ACAAAACTAG ACCGCTCTTC ACCGTGACCA CCGATGAATA TCCGAGACTC GCGACGTGGG ACACTCCTTG GACGCTGTAC CACCATCCGG GCGTCGAAGT TGATTACGAT ACGATTCAGC GGATGTTGAT TCAAGCGACA AGTTCAGGAG ATTATTACGT CATCGCCGTG GCTCAGACGC GAAGAGAAGG CGTATTACTG CCGTTCGATC CTCAAGATGG TGTGCACAAC ATCAACCCGT ACCACGTCGC TATCGTTCGC ACGTGGATGC TCAGGCAGTT TCAAGAGGCG AACAGTTATA TATGGTCATC CGTGTATCAA ATCATGCAAA CGTGCGGCGA TCACAAGTTT TTCTTGCCGC TTCCGGAGGG GCTGGCGAAC ATCAAGCCAA ACCTGGATCC GGAGATCGCG AGCCGCGACG AGCTGTAG
|
Protein sequence | MATRRARAIV VVACVAWAIA SGVTRADALW GRRGGGANAA SAGAADDRRV CFVAATVPWT FGAYQAQALG LSKAFADNGY ETFWMPRVPG VRLPPGEYAD WRRARERFAG DVRAPTKAEQ KAMEHLTFLG VPDVHAPYAR INELSMTMKQ VNEASSTHRL DAFVLLMDIG QMYFDGYDFA VPVVLWMPYH HEEADASVAV VRTYSGVAAL SDTTGRAIAT EQPLTRTIPH FIDRASLNAL ADSFETKLKI ADDVKNRRKV IRRAVFDSKK YDRMFSRSID EVDDDTFLVL MQGGNYENSD RKGWVASINA FAEFQRANPK LKTHLWIHTV DSAMTQNDLN HKSKPPVAVL RTGISLRAAL QRAQIPSNMY TLDENLHDKV FTSALKRHAD VCLHTSKSEG FGMVVLECQA LGTPVVTTNY TAMRDYTKYG LAVEIGARES IQGAYFAAPS VPGGAAALTA IATGAASLPL VEDVYKWIDE ELSLSAVFTK FESLLGEAKI AHAKRTPWSQ VDTYKTRPLF TVTTDEYPRL ATWDTPWTLY HHPGVEVDYD TIQRMLIQAT SSGDYYVIAV AQTRREGVLL PFDPQDGVHN INPYHVAIVR TWMLRQFQEA NSYIWSSVYQ IMQTCGDHKF FLPLPEGLAN IKPNLDPEIA SRDEL
|
| |