Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119488 |
Symbol | LacZ |
ID | 5000442 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | - |
Start bp | 426181 |
End bp | 429767 |
Gene Length | 3587 bp |
Protein Length | 1164 aa |
Translation table | |
GC content | 44% |
IMG OID | 640415863 |
Product | Beta-galactosidase, putative |
Protein accession | XP_001416385 |
Protein GI | 145343556 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.873023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACTTTT GGAACCTCTT TGCAAGCTTC GTGTTTCGTT GCTTTAGTCT CCAGGGAATA ATCAACTTTT ACGAAAGCTG GACTTTTCGC ACATCGCGCG TGCTATCAAT TATACTCAGA ATCGGGGAGT GTTCCTACGC AACCTTTTTC GACAGTTGCG TTCAGGAGCA ACTGGTTTCA GTACGTGTCG CATACAACTA ATTGAAATGC AATCACACGA AAAATAGGTA GACGAGCCTT TATCGTCGGC CGAAATCGCC GACTGGGAGA ACGCATCTAT TGTTGGTCGT GATCGAAGGT AGGTCGTGTC GACTGTTTCT ATATGTGTGC TAAATATGCC TAGGCCAAGT CACTGCACCT TATATTCTTT CAGAACTGTG GACGAATGTA TATCTTTCTG GAAAGGTGGA GGCCTCTGGA ATGAGAGAAT TAATTTAGCC AACGTTCAGA ACCTGAATGG CACGTGGAAG TTCAAGCTTC TGAAGAATCC ACGCGCCATA TCTGATGAAT TTACACTGTC AAACTTCTCA GACACGTTCT GGGCAGAGAT TCCAGTCCCT GCAAATTGGC AATGCAAAGG ATGGGATCGA CCTATTTACA CAAACTTTCA ATATCCTTTT CCATTGCATC CACCAGTCGC GCGTACGTCT ATTAAGCTAG GTATTGATGC TGGTGTTCTG TGTGAAAACT GCGTGCACGC GACAAATCCG ACGGGCCTGT ACAGGCGAAC CTTCCAGCTA GATATGGACT GGAATGAATC ATATGAAAGG ACTTTCATCG TGTTTGAGGG GGTCGATGCA GCGTTTCATA TATGGATCAA TGGTCAACTT GTCGGTTACT CTCAAGATAG CAAAATGACC GCCGAATTCG ACGTATCAGA TTCTTTGCAA AGCGGGACTA ACTTGGTGGT GGTCCGCGTT TATCGATGGT GCGATGGTAG TTACTTAGAG GATCAAGATC AATGGTGGCT CAGTGGAATA TTCAGAGATG TTTATTTGTA TCGTAAAAGA GCGTCACATA TCTGCGATTA CTCCGTGCAG ACAGAGTGTT GTGATTGGCA GTCAGGCACA TGGGAGCTTC GAGTTGGGAT TGAAATTTGT GACACCGCCA ATGCGTACTT CAACAATGAC AATAAACGTC TTCGTGTTCG TCTTTTTGAT TCTTCCTTAC GTGAAATATC TACTGGAACT ACGGACAAGT TTCTTATCCA ACACTATTCA CCTGATTTTG GTACAACGCA AGAAAAAAAC ATAGAGGAAG TTAGGCAATA TGCAAATGTA TGCTTCAACA TAAGCAACGT GGCAGAATGG TCGTCTGAGC GTCCAACGCT TTATCTACTT GCCATTATTT TGGAGAGTGA ATCTGGAGAG TGTTTAGACT GCGAAGGGTG CAGAGTAGGC TTTCGAACTG TCCAGATTTT CAACAAACAA CTATTAGTGA ACGAAAAACG AATCACTTTT CAAGGAGTAA ATAGACATGA ACACTGCCCT GTAGAGGGAA AGGCGGTATC GGAGAAGCTC ATGATAGAGG ACATTTTGCT GATGAAACGT AACAATTTCA ACGCAGTGCG CACTTCACAT TATCCGAATC ATCCGAGGTT TTATGAGTTG TGTGATGAAT ACGGATTGTA CGTTATTGAT GAGGCCAATA TTGAGACACA TGGGTTTGAG TTTGGCTTAC ATTCGACGCC ATATCTAGCA AATGATCCTG TATGGAGAAA TGCATACATG TCCAGAGTAT CACGCATGGT ACAACGCGAC AAAAATCATT GCTCTATCAT TATTTGGTCA CTTGGCAATG AGTCAGGTTG TGGCGGCGCT CACTTTGCTA TGTATTCATG GGTAAAACAA AATGACAAAA CCCGTCCAAT TCAATATGAA GGTGGTGGCT TCAAGACAAA GTGCACCGAT ATTATTTGTC CCATGTATGC AACACCTAAG ATTTGCCAAG ATCTTGCATC ACAGATGGAT GATCGACCAG TCATTTTGTG TGAGTATTCT CATGCCATGG GAAATAGCAA TGGTGGTCTC GCCAAATATT GGGAAGTGTT TCGGAGCAAC CGTAGTGCGC AAGGAGGGTT CATATGGGAC CTTATCGATC AAGGATTAAA CTGTTCAACA AATGGGCGAA TTCATTGGGG CTATGGCGGA GATTTTGGAG ATTCACCAAA CGATAAGCAG TTTTGTATCA ACGGGTTGGT CTTCCCAGAC CGTTCGCCTC ATCCTGCCAT GGAAGAAGTG AAGTATCTCC AACAACCTAC GATGATACGC GCACAAGGAG ATAAGATTAT CGTTGAAAAT CGATATCACT TTACAAATCT CGAGTGTATG AAGTTCGACT GGTGCGTGAT TCTCGATAGT GGCTTCATAC TTAAGAAAGG CCAATTTAGC AAACTTCACA TTCAACCGGG AGCAGAAACT TCTTACGAAT GGTCCCAGTT ATTCCCATCA CTATCATCTC TCGCGCAGCT TGTCCAGAGG AAGCAGTTTG TGTTTGGAGA ATGGTGGATT GACGTATCGG CTTCGTTCAT CAAACATCAG TCTTGGATAC CTGAAGGTGT GAGTATTGCA AAGTGTCAGC TTATGTTGCC GCAACAAAAA GTCCCTGTTC CGGGAAGCTT GAACAGTGAA GCTGCAATTC ATATAGAGCA TTTGAGTGAT GCGATTTGGG TCACTTCGCA AGACAGCACA TATGTTTTCA ATGCAGCTTC CGGTAGATTA TTGAAGTTTC AATTTCGCGG CGAAATGTTG ATTCAATCAG GCCCGATAGC GAGTTTATGG CGAGCTCCAA CAGACAATGA CAGTGGTGGT TGGATTTTTT CTTTTGCCGA GCGATGGGCC AAAGCAGGAC TGGATACTTT GCATGAGCAC GAAGAAGCTG TCGAAACTTT CGTAGACAAC TTCGGAAGAT TTCATTGTGC TAGTAAGCTT GTGCTTCGAA CTGCTCAAAA GAAAACTGTA TGTCGTCTTT GCTCTCATTA CACGGTCTTG GCGTCTGGTC ATCTAAATGT TACTTGCACA TTCGATTTAT CACCTCATTT ACCACCTCTT CCTCGAATTG GAGTTCTGAT GCAATGTCGA GCAACCATGC AACAAGTTGA GTGGCTTGGA CTTGGTCCAC ATGAAAATTA TTTAGATAGA AAGTCCTCTG CGTTTCTGGG TCGCCACTCC GCAACTGTTG ACGATCTTCA TGTTCCATAT ATAGTACCAA GTGATAATGG TGCTCGCCAA GAAGTGCGCT GGCTGGCTTT AGAATCTTCC GCAAGCGGAA ATAAGTGCTT GTTCACGAGC AAGGAGAATT TCAATTTCAA TGCGTCAAAT TTCTCTGACG CTGAACTGGC GCGAGCCAAC CACCAGCACG ATCTTCAGCG AAGTGACAGT ATTCACGTAC ATTTAGATAC ATTCCAAATG GGTTTAGGAG GAGACTGCAG TTGGTTTCCA TGTGTGCACT CTGAATTTCT TGCACCTGCT CGGAAGCGTT TCACTTTTAC TTTTGTCATC GCAGGAGTTG GAGGAAACGA AAACCCAAGT GATGTTTTTC AAGAATTGCG ATTTTCGCGT GACACAGTCG AACTCACTTC ATCGTGA
|
Protein sequence | MYFWNLFASF VFRCFSLQGI INFYESWTFR TSRVLSIILR IGECSYATFF DSCVQEQLVS VDEPLSSAEI ADWENASIVG RDRRPSHCTL YSFRTVDECI SFWKGGGLWN ERINLANVQN LNGTWKFKLL KNPRAISDEF TLSNFSDTFW AEIPVPANWQ CKGWDRPIYT NFQYPFPLHP PVARTSIKLG IDAGVLCENC VHATNPTGLY RRTFQLDMDW NESYERTFIV FEGVDAAFHI WINGQLVGYS QDSKMTAEFD VSDSLQSGTN LVVVRVYRWC DGSYLEDQDQ WWLSGIFRDV YLYRKRASHI CDYSVQTECC DWQSGTWELR VGIEICDTAN AYFNNDNKRL RVRLFDSSLR EISTGTTDKF LIQHYSPDFG TTQEKNIEEV RQYANVCFNI SNVAEWSSER PTLYLLAIIL ESESGECLDC EGCRVGFRTV QIFNKQLLVN EKRITFQGVN RHEHCPVEGK AVSEKLMIED ILLMKRNNFN AVRTSHYPNH PRFYELCDEY GLYVIDEANI ETHGFEFGLH STPYLANDPV WRNAYMSRVS RMVQRDKNHC SIIIWSLGNE SGCGGAHFAM YSWVKQNDKT RPIQYEGGGF KTKCTDIICP MYATPKICQD LASQMDDRPV ILCEYSHAMG NSNGGLAKYW EVFRSNRSAQ GGFIWDLIDQ GLNCSTNGRI HWGYGGDFGD SPNDKQFCIN GLVFPDRSPH PAMEEVKYLQ QPTMIRAQGD KIIVENRYHF TNLECMKFDW CVILDSGFIL KKGQFSKLHI QPGAETSYEW SQLFPSLSSL AQLVQRKQFV FGEWWIDVSA SFIKHQSWIP EGVSIAKCQL MLPQQKVPVP GSLNSEAAIH IEHLSDAIWV TSQDSTYVFN AASGRLLKFQ FRGEMLIQSG PIASLWRAPT DNDSGGWIFS FAERWAKAGL DTLHEHEEAV ETFVDNFGRF HCASKLVLRT AQKKTVCRLC SHYTVLASGH LNVTCTFDLS PHLPPLPRIG VLMQCRATMQ QVEWLGLGPH ENYLDRKSSA FLGRHSATVD DLHVPYIVPS DNGARQEVRW LALESSASGN KCLFTSKENF NFNASNFSDA ELARANHQHD LQRSDSIHVH LDTFQMGLGG DCSWFPCVHS EFLAPARKRF TFTFVIAGVG GNENPSDVFQ ELRFSRDTVE LTSS
|
| |