Gene OSTLU_119488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119488 
SymbolLacZ 
ID5000442 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp426181 
End bp429767 
Gene Length3587 bp 
Protein Length1164 aa 
Translation table 
GC content44% 
IMG OID640415863 
ProductBeta-galactosidase, putative 
Protein accessionXP_001416385 
Protein GI145343556 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.873023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACTTTT GGAACCTCTT TGCAAGCTTC GTGTTTCGTT GCTTTAGTCT CCAGGGAATA 
ATCAACTTTT ACGAAAGCTG GACTTTTCGC ACATCGCGCG TGCTATCAAT TATACTCAGA
ATCGGGGAGT GTTCCTACGC AACCTTTTTC GACAGTTGCG TTCAGGAGCA ACTGGTTTCA
GTACGTGTCG CATACAACTA ATTGAAATGC AATCACACGA AAAATAGGTA GACGAGCCTT
TATCGTCGGC CGAAATCGCC GACTGGGAGA ACGCATCTAT TGTTGGTCGT GATCGAAGGT
AGGTCGTGTC GACTGTTTCT ATATGTGTGC TAAATATGCC TAGGCCAAGT CACTGCACCT
TATATTCTTT CAGAACTGTG GACGAATGTA TATCTTTCTG GAAAGGTGGA GGCCTCTGGA
ATGAGAGAAT TAATTTAGCC AACGTTCAGA ACCTGAATGG CACGTGGAAG TTCAAGCTTC
TGAAGAATCC ACGCGCCATA TCTGATGAAT TTACACTGTC AAACTTCTCA GACACGTTCT
GGGCAGAGAT TCCAGTCCCT GCAAATTGGC AATGCAAAGG ATGGGATCGA CCTATTTACA
CAAACTTTCA ATATCCTTTT CCATTGCATC CACCAGTCGC GCGTACGTCT ATTAAGCTAG
GTATTGATGC TGGTGTTCTG TGTGAAAACT GCGTGCACGC GACAAATCCG ACGGGCCTGT
ACAGGCGAAC CTTCCAGCTA GATATGGACT GGAATGAATC ATATGAAAGG ACTTTCATCG
TGTTTGAGGG GGTCGATGCA GCGTTTCATA TATGGATCAA TGGTCAACTT GTCGGTTACT
CTCAAGATAG CAAAATGACC GCCGAATTCG ACGTATCAGA TTCTTTGCAA AGCGGGACTA
ACTTGGTGGT GGTCCGCGTT TATCGATGGT GCGATGGTAG TTACTTAGAG GATCAAGATC
AATGGTGGCT CAGTGGAATA TTCAGAGATG TTTATTTGTA TCGTAAAAGA GCGTCACATA
TCTGCGATTA CTCCGTGCAG ACAGAGTGTT GTGATTGGCA GTCAGGCACA TGGGAGCTTC
GAGTTGGGAT TGAAATTTGT GACACCGCCA ATGCGTACTT CAACAATGAC AATAAACGTC
TTCGTGTTCG TCTTTTTGAT TCTTCCTTAC GTGAAATATC TACTGGAACT ACGGACAAGT
TTCTTATCCA ACACTATTCA CCTGATTTTG GTACAACGCA AGAAAAAAAC ATAGAGGAAG
TTAGGCAATA TGCAAATGTA TGCTTCAACA TAAGCAACGT GGCAGAATGG TCGTCTGAGC
GTCCAACGCT TTATCTACTT GCCATTATTT TGGAGAGTGA ATCTGGAGAG TGTTTAGACT
GCGAAGGGTG CAGAGTAGGC TTTCGAACTG TCCAGATTTT CAACAAACAA CTATTAGTGA
ACGAAAAACG AATCACTTTT CAAGGAGTAA ATAGACATGA ACACTGCCCT GTAGAGGGAA
AGGCGGTATC GGAGAAGCTC ATGATAGAGG ACATTTTGCT GATGAAACGT AACAATTTCA
ACGCAGTGCG CACTTCACAT TATCCGAATC ATCCGAGGTT TTATGAGTTG TGTGATGAAT
ACGGATTGTA CGTTATTGAT GAGGCCAATA TTGAGACACA TGGGTTTGAG TTTGGCTTAC
ATTCGACGCC ATATCTAGCA AATGATCCTG TATGGAGAAA TGCATACATG TCCAGAGTAT
CACGCATGGT ACAACGCGAC AAAAATCATT GCTCTATCAT TATTTGGTCA CTTGGCAATG
AGTCAGGTTG TGGCGGCGCT CACTTTGCTA TGTATTCATG GGTAAAACAA AATGACAAAA
CCCGTCCAAT TCAATATGAA GGTGGTGGCT TCAAGACAAA GTGCACCGAT ATTATTTGTC
CCATGTATGC AACACCTAAG ATTTGCCAAG ATCTTGCATC ACAGATGGAT GATCGACCAG
TCATTTTGTG TGAGTATTCT CATGCCATGG GAAATAGCAA TGGTGGTCTC GCCAAATATT
GGGAAGTGTT TCGGAGCAAC CGTAGTGCGC AAGGAGGGTT CATATGGGAC CTTATCGATC
AAGGATTAAA CTGTTCAACA AATGGGCGAA TTCATTGGGG CTATGGCGGA GATTTTGGAG
ATTCACCAAA CGATAAGCAG TTTTGTATCA ACGGGTTGGT CTTCCCAGAC CGTTCGCCTC
ATCCTGCCAT GGAAGAAGTG AAGTATCTCC AACAACCTAC GATGATACGC GCACAAGGAG
ATAAGATTAT CGTTGAAAAT CGATATCACT TTACAAATCT CGAGTGTATG AAGTTCGACT
GGTGCGTGAT TCTCGATAGT GGCTTCATAC TTAAGAAAGG CCAATTTAGC AAACTTCACA
TTCAACCGGG AGCAGAAACT TCTTACGAAT GGTCCCAGTT ATTCCCATCA CTATCATCTC
TCGCGCAGCT TGTCCAGAGG AAGCAGTTTG TGTTTGGAGA ATGGTGGATT GACGTATCGG
CTTCGTTCAT CAAACATCAG TCTTGGATAC CTGAAGGTGT GAGTATTGCA AAGTGTCAGC
TTATGTTGCC GCAACAAAAA GTCCCTGTTC CGGGAAGCTT GAACAGTGAA GCTGCAATTC
ATATAGAGCA TTTGAGTGAT GCGATTTGGG TCACTTCGCA AGACAGCACA TATGTTTTCA
ATGCAGCTTC CGGTAGATTA TTGAAGTTTC AATTTCGCGG CGAAATGTTG ATTCAATCAG
GCCCGATAGC GAGTTTATGG CGAGCTCCAA CAGACAATGA CAGTGGTGGT TGGATTTTTT
CTTTTGCCGA GCGATGGGCC AAAGCAGGAC TGGATACTTT GCATGAGCAC GAAGAAGCTG
TCGAAACTTT CGTAGACAAC TTCGGAAGAT TTCATTGTGC TAGTAAGCTT GTGCTTCGAA
CTGCTCAAAA GAAAACTGTA TGTCGTCTTT GCTCTCATTA CACGGTCTTG GCGTCTGGTC
ATCTAAATGT TACTTGCACA TTCGATTTAT CACCTCATTT ACCACCTCTT CCTCGAATTG
GAGTTCTGAT GCAATGTCGA GCAACCATGC AACAAGTTGA GTGGCTTGGA CTTGGTCCAC
ATGAAAATTA TTTAGATAGA AAGTCCTCTG CGTTTCTGGG TCGCCACTCC GCAACTGTTG
ACGATCTTCA TGTTCCATAT ATAGTACCAA GTGATAATGG TGCTCGCCAA GAAGTGCGCT
GGCTGGCTTT AGAATCTTCC GCAAGCGGAA ATAAGTGCTT GTTCACGAGC AAGGAGAATT
TCAATTTCAA TGCGTCAAAT TTCTCTGACG CTGAACTGGC GCGAGCCAAC CACCAGCACG
ATCTTCAGCG AAGTGACAGT ATTCACGTAC ATTTAGATAC ATTCCAAATG GGTTTAGGAG
GAGACTGCAG TTGGTTTCCA TGTGTGCACT CTGAATTTCT TGCACCTGCT CGGAAGCGTT
TCACTTTTAC TTTTGTCATC GCAGGAGTTG GAGGAAACGA AAACCCAAGT GATGTTTTTC
AAGAATTGCG ATTTTCGCGT GACACAGTCG AACTCACTTC ATCGTGA
 
Protein sequence
MYFWNLFASF VFRCFSLQGI INFYESWTFR TSRVLSIILR IGECSYATFF DSCVQEQLVS 
VDEPLSSAEI ADWENASIVG RDRRPSHCTL YSFRTVDECI SFWKGGGLWN ERINLANVQN
LNGTWKFKLL KNPRAISDEF TLSNFSDTFW AEIPVPANWQ CKGWDRPIYT NFQYPFPLHP
PVARTSIKLG IDAGVLCENC VHATNPTGLY RRTFQLDMDW NESYERTFIV FEGVDAAFHI
WINGQLVGYS QDSKMTAEFD VSDSLQSGTN LVVVRVYRWC DGSYLEDQDQ WWLSGIFRDV
YLYRKRASHI CDYSVQTECC DWQSGTWELR VGIEICDTAN AYFNNDNKRL RVRLFDSSLR
EISTGTTDKF LIQHYSPDFG TTQEKNIEEV RQYANVCFNI SNVAEWSSER PTLYLLAIIL
ESESGECLDC EGCRVGFRTV QIFNKQLLVN EKRITFQGVN RHEHCPVEGK AVSEKLMIED
ILLMKRNNFN AVRTSHYPNH PRFYELCDEY GLYVIDEANI ETHGFEFGLH STPYLANDPV
WRNAYMSRVS RMVQRDKNHC SIIIWSLGNE SGCGGAHFAM YSWVKQNDKT RPIQYEGGGF
KTKCTDIICP MYATPKICQD LASQMDDRPV ILCEYSHAMG NSNGGLAKYW EVFRSNRSAQ
GGFIWDLIDQ GLNCSTNGRI HWGYGGDFGD SPNDKQFCIN GLVFPDRSPH PAMEEVKYLQ
QPTMIRAQGD KIIVENRYHF TNLECMKFDW CVILDSGFIL KKGQFSKLHI QPGAETSYEW
SQLFPSLSSL AQLVQRKQFV FGEWWIDVSA SFIKHQSWIP EGVSIAKCQL MLPQQKVPVP
GSLNSEAAIH IEHLSDAIWV TSQDSTYVFN AASGRLLKFQ FRGEMLIQSG PIASLWRAPT
DNDSGGWIFS FAERWAKAGL DTLHEHEEAV ETFVDNFGRF HCASKLVLRT AQKKTVCRLC
SHYTVLASGH LNVTCTFDLS PHLPPLPRIG VLMQCRATMQ QVEWLGLGPH ENYLDRKSSA
FLGRHSATVD DLHVPYIVPS DNGARQEVRW LALESSASGN KCLFTSKENF NFNASNFSDA
ELARANHQHD LQRSDSIHVH LDTFQMGLGG DCSWFPCVHS EFLAPARKRF TFTFVIAGVG
GNENPSDVFQ ELRFSRDTVE LTSS