Gene OSTLU_50104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50104 
Symbol 
ID5002923 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp71692 
End bp76035 
Gene Length4344 bp 
Protein Length1386 aa 
Translation table 
GC content60% 
IMG OID640418344 
Productpredicted protein 
Protein accessionXP_001418827 
Protein GI145348791 
COG category[A] RNA processing and modification 
COG ID[COG5161] Pre-mRNA cleavage and polyadenylation specificity factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.544218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.543331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACG CGGTGCACCG CGAGGTGCAC CCGCCCACGG GCGTCGACCA CGCGGTGACG 
GCGTACTTCA CGCGCCCCGT GGGCGACGGC GGGGACCCGA ATCTGATCGT CGCGAGCGCG
AATCGCATCA CCGTGTACGC CGTCAATCGG CGCGGTGACG AAGAATCGCT CGACGTGTGC
GCCGAGTTCG ACGCGCAGGG CGCGATAGGG TCGATGAGCG TCCTGAGACG ACGGTTCGGG
GCGCCGAGGA ATCAGCGAGA CGCGCTGTTG ATCGCGATTC GCGAGCGAAA GCTGAGCGTG
GTGGAGTACG ACGCGGCGAC CGGGGACGTG TGCTGCTCGT CGATGCACTC GTTCGAGAGT
GCGCTGGGGT GTAATCCACT GGGAACGACG CTGCGGATGT CGAGAGAGGC GCCGCTGGTG
GTGAGCGATC CCGAGGGTCG GTGCGCGGCG GTGGTGCTGC GGGAGGACGG CGTCGCGGGC
AAGGTGCGAG TGCTGCCGAG CGTGGACGGG GGGTTGGGGC TGGTGGCGAA CGACGACGAG
GGACGCGTGC GAGGACCCGC GGCGAGCGTG CGCGAGTCGT TCCCGCTGCA CCTGCCGGGG
GTGCGATTGA TTCGCGACGC GTGTTTTTTG CACGGATACG GCGAGCCGGC GCTGGCGGTT
TTGTACGAAA AGACGCCGAC GTGGGCTGGG CGGTATAACC TGAGCAAAGA TACGTGTGAG
ATCGTGGCGC TGAGCGTGGA CGTTGACAAG CAAAAGGGGA CGGTGATTTG GCGTCGGCAG
AACTTGCCGT CGAGCTCGTA CAAGTTGACG GCGCTCTTGC CGCCGCTCGG CGGCGCGTTG
GTGTTTTCGC AAGACTTTTT GCTGCACGAA TCCCAAGAGA GCTCGTCCGT GCTCGGGTTG
AATACTTTCG GTCACGGTGG GCCGCAAGAA GGGAACGACG CCGAAGTCGC CGCGCGCGCG
GCGGGAATGG GGGAGAACGC CATGGCGAAC CCGCCGCCGG CGTGCGCGGC GCGCGCCGTC
GATTGCGGAT TAGAGATCAC GTTAGATGGC GCTCAGGCGT CTGTTGTGTC GGAAGATCGC
GTCTTGGTGA CAACAAAAAC CGGTGCGTTG TTGCTCTTAG CTCTTCACAC CGATGGCCGT
AGTTTACGTC GCATGATGCT TCAGCGCGCC GGCGGCGCTG TGCTCTCTTC GGGCATGTGT
CTGCTGTCGA GAGATTTGTT GTTCCTAGGA AGTCGCATTG GTGACTCGCT TTTGGTGAAG
TTCACGCCTA AAGAAGAACC GACGGCACCT CTCATGCTTC CCGACGCCGA AGACGAGAGC
GAGGATGAGG CGACGGAGAA GTCGAAGGGT AAGCGCTCTA AATCGGGCGG CGCGGCGAAT
CGCAAACGCG CCAAAACCGC CGAGGCGCCG CCACCAGCGC CGTCGACGCC GAGTCCCGAA
GATGACGATG ACGAGCTCGA GGCGTTGCTC TACGGCACGA CGAAAACGGA GACTGTGCAG
ACCGACGCCG TGCAAACGGA GAAGAAGCGC GAAGGCTTGG CCGGCATCAT CCCAGGCTTG
AAGGTTGCTG GTTACGATTT GAAGGTCAAG GACTCTTTGC TCGGCGTCGC GCCTGTGGTG
GATATCGCCG TCGGCGCGAG CGCGCCCATG GGTTCGAACA AGAACGAGCG CACCGAGCTC
ATCACCGCGT GCGGTCAAGG GAAGAACGGT GCGCTGGCAA TTTTGACTCG TGGCGTCCAG
CCTGAACTCG TCACCGAAGT CGAATCCGGT ACGCTGCCCA ATTTACAAGG TTTGTGGACG
TTGCACTATC GCAAAGAAGG TTCGAAAGAA GAACGCGAAC CTTTCCATCA TCATTTGTTA
CTGAGCATGA AGTCGTCGAC GATGATCATG GAAACCGGCG AGGAGCTTCA AGAAGTGAGC
GCCTCGCTCG AGTTCATCAC GAATCAAGCC ACGTTGGCGG CGTCGAATAT TTTCGGACAT
TACTGCTCGG TCCAAGTCAC GGGAACGGGC ATCCGCGTGT TAAAGGGAGG CGTGAAGGTA
CAAGACGTCG GTTTGCAGGA CATGGACGCA CCGAAGGGAG CCGCGATCGC GTCCGCACAG
ATTTTAGATC CTTACATCAT CGTTCGGCTA TCGGATGGTT CGATAAGATT GTTGTCCGGA
GACGAAAAAC AGATGAGTGT TTCGTTGATG GAAACGGGGG CGATCCCAAC ATCTTCAGTA
ACGGCGTTCG CTCTGGTGGA TGATTCTGTT GAAGCCGCAG ACGCGGCGGG CGGGGGCGAA
CGCAAGAGCG GATGGATTCA TCGAGCTGCG ACGAATGGCA CCATTACGGG TTTGGAAGGG
AACAAGAAGA GCGGCGCGTG CAACAACAGC GAGGCCATCG TCGCGCTGAC GCGCGAGGGT
GGAAGTCTGG AATTGTTTTC GTTGCCCAGC TGCACGCGCA TCTGGTGCGC CGATGGGCTG
TCCGAGGGCA TGCGCGTATT GAGTCCGCAA ACACCCGTCA ACGCTGAGTC CAGTGTCCCT
GAGATTGTCG ATATTCGCAT CGATTCATTC CAAGATGCGC ACGAGCGCCC TTTGCTCACC
GCAGTGCGTG GCGATGGCAC GTTGCTCTTG TATAAGGGTT TCATTGTTCC CGCCGGGACG
ACGTACGAAG GACAAGACGA ACCGCTCGAA AAGAATGAGT TGCGATTTTC TCGCGTCAAC
GTCGACGTTG AAGGTTCTGG TTTGAATGTC GCTGGCATAG GCGCCGCAGG TCAGCTTAGA
GACTCCCTAG CGGGCGCGCG ATTGACGCGC ATCGGTAACG TCGGCGAAGG ACAAGGCGTG
CAAGGCATCT TTGTCGCAGG CCCAAACCCG TTGTGGCTCA TCGTTCGCAG GTCTCGCGTA
TTAGCTCTCC CGACGCGTGG TGAAGGCGAG GTCGTCGCAT TCACGGTATT CCACAACGTC
AACTGTCCGC ACGGTTTCAT TCTAGGCACC GCGTTGGGTG GCGTGCGCAT TTGTCAGATG
CCTAGTAAAA TGCACTACGA AGCTGCGTGG CCGGTGCGCA AGGTGGCGCT CAAGTGTACG
CCGCACACGA TTACGTACCT GCCAGATTTC AAGCTCTACG CACTGGTTAC ATCAGCTCCT
GTGCCTTGGG TCGAACGGGA AATAGAGCAA GATAATGTCC ACGGTATCGC CTTGGCAAAA
GTGCGACGCG AGCGCGCGAA AGCGAACGAT GACATGGAGT TACAATACTC GGTGCGACTC
CTCGTTCCTG GATCGCTCGA TAGCGCGTGG CAACACGCGC TCGAACCGGG CGAGCACGTG
CAGTGCGTTC GAAATGTCCA ATTGAGAGAC ATTAACACTG GGGCACTCCT TTCACTTCTT
GCCGTCGGTA CGGCGATGCC TGGAGGAGAA GACACGCCGT GTCGCGGTCG CGTCATTTTA
TTTCAAATGG TGTGGGAGCG CGACGCCGAA TCCATGGATG GGTACAGATG GAAAGGACAA
GTGTGCTGCG TGCGCGAAGC GAAGATGGCG TGCACCGCGT TATCGGCGCT CGACGGTCAC
CTGATTGTTG CGGTCGGTAC CAAGCTCACC GTGCACACGT GGGATGGCGT CGAATTGAAT
AGTGTCGCTT TCTTCGACAC CCCAATTCAC ACCGTGAGCA TCAACGTCGT GAAGAATTTC
ATCCTGGTGG GCGATTTAGA GAAGGGCTTG CACTTTTTCC GCTGGAAGGC GAACGGCTTC
GAGAAGTCGA TCATTCAGCT CAGCAAGGAT TTCGATCGCA TGGACGTCGT GAGCACAGAG
TTCTTGATCG ACGGCGCCAC TCTGAGTTTG CTCGGGTCCG ACATGAGCGG CAACGCGCGC
ATCTTTGGCT ACGATCCAAA ATCGCTCGAG TCGTGGAAAG GACAGAAACT CCTCGTGCGT
TCGGCGTACC ACGTCGGTTC GCCCATCTCT CGCATGGTGC GTTTTAACGT GGAAGGTACG
ACCGCGAAAG CCGCGCCGGG AGAACGCCCC AAAGGCACCA ATCGACACGC CGTCTTCTTC
GGCACGCTCG ACGGCGCTTT GGGCATCTTC ATGCCCACCG ACGAGCCGAC GTACGCCAAG
CTCCACGCCC TTCAACGCGA GTTGAACACC ACGGTGCGCT CACCGATCGG TTGCAACCCG
CGCACGTTTC GCACCCCCAA AGTCTTCGAA GGCAAGCATG TGCAATTACT CGCCCCCCTC
GACGTCCTCG ACGGCGGTTT ACTCTCCAAA TTCGAGACGC TCACCTTCAC CGAGCAGCGC
GCCGTCGCCG AGCGAAGCGG CGTCGACCGC GATTTGGCCC TCGGTTTGAT CCAGCACCTC
AGCGCCAGCA ACGCGTTCGT GTGA
 
Protein sequence
MSHAVHREVH PPTGVDHAVT AYFTRPVGDG GDPNLIVASA NRITVYAVNR RGDEESLDVC 
AEFDAQGAIG SMSVLRRRFG APRNQRDALL IAIRERKLSV VEYDAATGDV CCSSMHSFES
ALGCNPLGTT LRMSREAPLV VSDPEGRCAA VVLREDGVAG KVRVLPSVDG GLGLVANDDE
GRVRGPAASV RESFPLHLPG VRLIRDACFL HGYGEPALAV LYEKTPTWAG RYNLSKDTCE
IVALSVDVDK QKGTVIWRRQ NLPSSSYKLT ALLPPLGGAL VFSQDFLLHE SQESSSVLGL
NTFGHGGPQE GNDAEITLDG AQASVVSEDR VLVTTKTGAL LLLALHTDGR SLRRMMLQRA
GGAVLSSGMC LLSRDLLFLG SRIGDSLLVK FTPKEEPTAP LMLPDAEDES EDEATEKSKD
DDDELEALLY GTTKTETVQT DAVQTEKKRE GLAGIIPGLK VAGYDLKVKD SLLGVAPVVD
IAVGASAPMG SNKNERTELI TACGQGKNGA LAILTRGVQP ELVTEVESGT LPNLQGLWTL
HYRKEGSKEE REPFHHHLLL SMKSSTMIME TGEELQEVSA SLEFITNQAT LAASNIFGHY
CSVQVTGTGI RVLKGGVKVQ DVGLQDMDAP KGAAIASAQI LDPYIIVRLS DGSIRLLSGD
EKQMSVSLME TGAIPTSSVT AFALVDDSVE AADAAGGGER KSGWIHRAAT NGTITGLEGN
KKSGACNNSE AIVALTREGG SLELFSLPSC TRIWCADGLS EGMRVLSPQT PVNAESSVPE
IVDIRIDSFQ DAHERPLLTA VRGDGTLLLY KGFIVPAGTT YEGQDEPLEK NELRFSRVNV
DVEGSGLNVA GIGAAGQLRD SLAGARLTRI GNVGEGQGVQ GIFVAGPNPL WLIVRRSRVL
ALPTRGEGEV VAFTVFHNVN CPHGFILGTA LGGVRICQMP SKMHYEAAWP VRKVALKCTP
HTITYLPDFK LYALVTSAPV PWVEREIEQD NVHGIALAKV RRERAKANDD MELQYSVRLL
VPGSLDSAWQ HALEPGEHVQ CVRNVQLRDI NTGALLSLLA VGTAMPGGED TPCRGRVILF
QMVWERDAES MDGYRWKGQV CCVREAKMAC TALSALDGHL IVAVGTKLTV HTWDGVELNS
VAFFDTPIHT VSINVVKNFI LVGDLEKGLH FFRWKANGFE KSIIQLSKDF DRMDVVSTEF
LIDGATLSLL GSDMSGNARI FGYDPKSLES WKGQKLLVRS AYHVGSPISR MVRFNVEGTT
AKAAPGERPK GTNRHAVFFG TLDGALGIFM PTDEPTYAKL HALQRELNTT VRSPIGCNPR
TFRTPKVFEG KHVQLLAPLD VLDGGLLSKF ETLTFTEQRA VAERSGVDRD LALGLIQHLS
ASNAFV