Gene OSTLU_17572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17572 
Symbol 
ID5004625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp228025 
End bp231798 
Gene Length3774 bp 
Protein Length1257 aa 
Translation table 
GC content48% 
IMG OID640420046 
Productpredicted protein 
Protein accessionXP_001420789 
Protein GI145352935 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.632728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAA ATAAGTCTCT CGAGCATGGT TGCCTGGCGT GCAGCTTTGG ATTTGATTTC 
AAGCGACGGG GCAACTTGAC TCGAGTACGC GATTCAGTCG TGGCGTACAC TGTTGGAAAT
CTCTTGACCT TGCGAGACGT GAAAACACAA AGGCAGACGT ACCTGCGGAG CGCGGAGGGG
GGCAATATCG GCTTGGTCGC CGTGGATGGA GATGGCACGC ACATTGCGAT CGGCGAGAAT
GGTGCGGCGC CCCGAATATC AATATATAGT TACCCTTCAC TTGAGCTCGT AAACACGATT
CACGGCGGCG CCGCGGCGTG TTACTCGACT GGAAAATTTT CTCCAGACGG ACAGCTTCTC
GCGAGCGTGA GTGGGCTCCC AGACTATTGG CTCACTTTGT GGGACTGGAA GACAGCAAAG
ACAGTCCTTC GTGCCAAAGC GTATGGACAA GAAGTGTTTG ACTTATCCTT CCTAGGCGCT
AACTCGGAGC GAATGATCAC TCACGGCGCT GGTCACATTC AGTTTTGGCA GATGGTCAAG
ACGTTTACTG GATTGAAGTT ACAAGGTACT CTTGGAAAGT TCAACGGCGA ACCTCTGTCC
AATATCACGT GCGTGTTGGA GTTACCCTTT AATGATATGA TTTTGACGAG CACAGAGTAT
GGCGCTCTAC TAATCTGGGT CGATGGGCGC GTTCAGTACA AAATAATGCG AGCAAACGCG
TCGAGCGACG ATGAAGACTG CCACGGCGGT TCGATCGAGA CGATGATTTA TATCAACAAC
GGTTCAGTGA TTCTCACGGC GGGCGATGAT GGGTGGGTTC GAGAATGGAA TATTGACGAG
ATTCGTCGCG TCATATCGTC GCAAAGCAAA GGGGTGGTTT GTATGAAACC CATCTCAGAG
CGCTCAATCG ATGGAGCCAA TATCCGCCAC ATTTGCGCGC TTGAAAATAA GTTATTCATT
CAAGACAGCG GGCGAGGGTT AGTGTGTTCT TTGGACGATG ATGATATGGA GCGTCCTGTT
GTTTCTGCGC ACAGTGGTGT CATCACTGCA TGCGAGACTT CGCTGAACAG TACTCACTTG
ATCACTTGCG ACAATATCGG AGAAGTGCAC TGCTATGATT ATTTATCCCG TGCGTTTCTC
TACAAAAGTC GGTTCAGCTG TTCGGCGACG GCACTTACTA ACGTACCTTT GAATGTCAAT
GAAGATGGGA GATGTATCGT CGTTGGATTT GCCGATGGTG TCGTTCGCGT TCTCGTGAGA
AACACGTCCT CTTGGCGCTT GGCTCTCGCG CTCAAGCCAC ACGCGCATGC CGTCGAGCAG
TTCACATGGA GTTCTGATGG TGAGTATTTG GCAATCGCTG CTGCTAATAA TACTGTGTTT
ATTTTCAGTT CATCTTCCTT CGCATCGACG TTGCAATTTG AGCCAATTGG TTTCATACGC
CTGAGTGCTC GCCCGAAGAA GATATTTTGG GATGGTGAAA GTATTTTTGG TGCTCAAATC
GAGGACGAGA TGCACTGCTT TGCCGTCCAT CGTGACTTGG GTGAGTCTCG TAAAGATTCG
TACGAAATCG ATGTCACTCC TCGTATTACC ACCGATGTAG AAACCAAGAA CGCCGTTGAT
GAATCTTTCA GCGCGATAAC GTATGATAAT TCTTACTGCG TTAAAATCTG TAAATCGGGA
TCTTTTGAAG TACACGCGTT AAGCGATGCT TCTCCATCAA ACACACACTC TTCAATCGAC
GAGGCGCTGT GCATCTATCC AGAAGTCGAA GCGATCGACA CACTCTCCGC ACAGTGCGCG
TCCATCGAAG AGGTACTGCA GCGAGAACGC GCGAAGGAAT TGAGCGAGGA AGAAGCGGCA
GCACTTATAC GACTCTCGAG CTTGGTGGCG CTCTGTCGAG ATGAATTCAA TGTCATCATT
GAAGAAAATG CAAAACTTCC TCGAGAGATG CGAATCTCTG GCGCAGAACT TCTCGTAGAC
GATACACAAA TGCGCCGAAT TCGCGAGGAT TCAGTACACT GGTTAAAAGC GACTCACGAC
AAGCTTCAAA GCTCAGTTAT GTGTGCTGAA TCTGTCTCAA ATCAACTGCA AAAAAGTTAT
TACGACAAAA TAATGGCCCT TCCTCGTATG ATTCATTCAT CTAGCGAAGC TTTGACGTGC
TCATCTTTTG CGTTGCGCGC TGGAAGCGAG GGTGATTGGC AGACAACTAC GGTCGCAGAG
GCGACAGTTC ATGAATGCGA ACAAGAGATG CGTGTCGTTC ATGTGCAAAA TACTTGCATA
AGTGAAAATG CGCACGAAAG ACAGAGCATC GAAGGCTCTG AAATGAGTTC AGAGGCTCGA
CGGCGCTTCG AAAGATTACG GAGGAACGAA GAGTTGAGAA AATTGGAGAA GGCAGAGCCT
TTGAAAGACC AAACAGACGA CAGCGAAGCG TTGAAACGCG ACAATCTTGG CGATGGGTTT
CCATTGCGTT GCGTTCCGGA TTCACGTGTA CCACCTGAAG ACAAATTAAC GACGGAGGGT
AAACTTTCCG AATTGAGATA TGTCGATGGT CAACTACGCG GCAACCAGAG CCGGTTCAAT
CGAGAGTTTG ATTGCATCGA TGCGAAGGAC GCTGTCACCG ATGAAGATCT TATCTCGCTA
TGCCATCTAA AAGCAGATAT CATCGCGCGA AGTAAAGTCA TTGAACTTTT TCGTATTGTA
GTGCATCGCG AGCTGAGCGT CATGCCTGAA TATGACGTGA AGAACACCGA AATGCAAAAT
GCTCAACGGC AAGCAAAGGA AGAGGTCGAA ACCGCTCGTG CGGCGGTAGA AAATGCCCAG
ACGGCCGTAA ACGATCAACA GACCAAATTA GAGGGGCAGA ACGGAGTAAA AGCGGAAATC
GAGGCGACAT TTGACAAATT CATGATTCCA GGCGTCCCGC GAGCGGCATT GCTCAAAGTT
TTCCACAAAC GCGTCGTCAA TAAAACCGAA CAAAAGAGCG ACAAAGGTGA CTCCGATTCC
GACTTAGACT CTGATTTTGA CGATGATTCG TACTATTCGT CGTCAGACGA AGAAAATGGC
GACACTTGCC CAAAAGGTTG CGACAGTGCT GTCTACGAGC GCGTGTGCGA TCTTCGTTTG
CAAAGAATCG AAGCCATGGA CGCAATAACT GAAATACAAC GGGCATTGGA TGGGAAGAAA
AAGATTCACG ATGCCGCACT GAAAAAGTTG CGAAGTTTGG AGGAAAACGT ACGGGTACTC
GAGAGAGATG CGGATGCTTT TGAGAAGGTA AAGCAGCAAA GTTTGAACGG GCTCAGGACT
GCATTCGTGT TGCCGGTGAG CTGCGTCGAG ACGAGTCAAG GCGACATGGA CGACATGATT
ATTTTCTCCA AGACTCGCTT AGCAGAACTT GAGAATAAAG TCGTAGAGTG GGATGCAGAA
GTCTTAAAAT TGAAACGTCA GCAGCAGGAG TTAAAGCGAG AGCACTCCTC GCTCATCGCG
CAACGTTCCG CTAAGGCGAA AGAATTTCAG AGTCTGCAAC GCAAATACTC TGAGACTCAG
ATCCGAAAGT TTGGAAAGCT CATCGTTCTG GAAGATTTAG ACAAGGTTGT GAATAATGGT
CAAGGCACAG AAGATTTACG CGACAAGCTC AAGGTACAAG AGCTGGAGAA TGCGCGTGAG
CTGCGAGAGA TTAAAAAAGA CATCTCAAAT GTTGAAAGAG AAGTGTTGAA TCTCACGGAA
GCGCATACCA CGTGTCTCAA TGAACTTCTC GCGCTGCGCT CAGTCAAAGT CTAA
 
Protein sequence
MAKNKSLEHG CLACSFGFDF KRRGNLTRVR DSVVAYTVGN LLTLRDVKTQ RQTYLRSAEG 
GNIGLVAVDG DGTHIAIGEN GAAPRISIYS YPSLELVNTI HGGAAACYST GKFSPDGQLL
ASVSGLPDYW LTLWDWKTAK TVLRAKAYGQ EVFDLSFLGA NSERMITHGA GHIQFWQMVK
TFTGLKLQGT LGKFNGEPLS NITCVLELPF NDMILTSTEY GALLIWVDGR VQYKIMRANA
SSDDEDCHGG SIETMIYINN GSVILTAGDD GWVREWNIDE IRRVISSQSK GVVCMKPISE
RSIDGANIRH ICALENKLFI QDSGRGLVCS LDDDDMERPV VSAHSGVITA CETSLNSTHL
ITCDNIGEVH CYDYLSRAFL YKSRFSCSAT ALTNVPLNVN EDGRCIVVGF ADGVVRVLVR
NTSSWRLALA LKPHAHAVEQ FTWSSDGEYL AIAAANNTVF IFSSSSFAST LQFEPIGFIR
LSARPKKIFW DGESIFGAQI EDEMHCFAVH RDLGESRKDS YEIDVTPRIT TDVETKNAVD
ESFSAITYDN SYCVKICKSG SFEVHALSDA SPSNTHSSID EALCIYPEVE AIDTLSAQCA
SIEEVLQRER AKELSEEEAA ALIRLSSLVA LCRDEFNVII EENAKLPREM RISGAELLVD
DTQMRRIRED SVHWLKATHD KLQSSVMCAE SVSNQLQKSY YDKIMALPRM IHSSSEALTC
SSFALRAGSE GDWQTTTVAE ATVHECEQEM RVVHVQNTCI SENAHERQSI EGSEMSSEAR
RRFERLRRNE ELRKLEKAEP LKDQTDDSEA LKRDNLGDGF PLRCVPDSRV PPEDKLTTEG
KLSELRYVDG QLRGNQSRFN REFDCIDAKD AVTDEDLISL CHLKADIIAR SKVIELFRIV
VHRELSVMPE YDVKNTEMQN AQRQAKEEVE TARAAVENAQ TAVNDQQTKL EGQNGVKAEI
EATFDKFMIP GVPRAALLKV FHKRVVNKTE QKSDKGDSDS DLDSDFDDDS YYSSSDEENG
DTCPKGCDSA VYERVCDLRL QRIEAMDAIT EIQRALDGKK KIHDAALKKL RSLEENVRVL
ERDADAFEKV KQQSLNGLRT AFVLPVSCVE TSQGDMDDMI IFSKTRLAEL ENKVVEWDAE
VLKLKRQQQE LKREHSSLIA QRSAKAKEFQ SLQRKYSETQ IRKFGKLIVL EDLDKVVNNG
QGTEDLRDKL KVQELENARE LREIKKDISN VEREVLNLTE AHTTCLNELL ALRSVKV