Gene OSTLU_25878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25878 
Symbol 
ID5006790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp389014 
End bp392128 
Gene Length3115 bp 
Protein Length1005 aa 
Translation table 
GC content56% 
IMG OID640422211 
Productpredicted protein 
Protein accessionXP_001422576 
Protein GI145356724 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.266953 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GACGCGCGCG CGACGAGCGA CGCGCCGCGC GACGAGCGAC GACGCGCCGG TCGAGCGTCG 
AGCTCGACGC GCGCCCGGAT GAGCGACAGC GAGAGCGATT ACAGCGACGA CGACGCGCAC
ATCGGCGGCG GCGATCGGGA TGCGGAGGAA ACGTACGACG GCGCGGGAGG CGAGGCGACG
CTGAACGTGG ACGTCGGATC GTCGCTCACG AGCGCGGATT GGGCGGACAT CGATAACTCG
GACCTCGTGG CCAAGGTGCG CGGGAGCGCG GGGTTTCGAA TCATGCGAGA GATTCAGCCG
TGGGACGGGG CGCTGGACGC GCCGCCGGAT TTGATGAAAC GACACGTGTT TGAATACTTT
TACACGCAGC GCAAGGACAG TGACTTTTAT AAAAAGTTGG CCAAGGATTG CTTCCCGAGC
GAGTACGAAA GAGCGGGGGC GGATAAATCG GAGAAGAAGC GGTTGGCGAA CCTCTGGGCG
GGGGCGGTTT TGGACGAGTT GAAGGCGATG CGAGACGGGG CGAAGCACAC CTTGCGATGG
ATCAACGCCA TCAAGGAAGC GCAGGCGGGT GATTTGTTAT TTTTAAAGCG TGAGGCTCTG
ACTCACGGGC CCGGGAGCGT GCAGGAGATT GCCAGTCAAA TTGTTTGGGA CAACGTGAGT
AATCGCATCA AGGATCAAAC CTTTGCGGCG ATGCACGCGT GGACGAACAA ATCGGGTGCT
CAATCCGGTG GCGTCGCGCG CATGTACAAG CATCAGCACG AACTCATCGA TCAACTGGAC
AAGCACTATG CCGCCATCGG GAAAATGCAA GAGGCGAAGA AGCCGTTACC CAAGCCGTTG
CACGTCATCT TATCGACGCC GACGGGGTCG GGCAAGACGT TCACCGCCAT TCTCGTTCAC
CTGCGTCTTT TGAAGGTCAA GTACCCGGAC GCTATCTTGC TCTACTCAGT GCCGACGAAA
CAAGTTTTGA AGCGCGTTGG ACAAGAGTGT GAAGCGCACG CGGTGCCGTA CTGGACGGCG
GCGCGCGACA ACACGGGCGA TGGAACGCTT CACCAAGTGC GTCGTCCGTA TTCCATTCGT
GATAAGGCTC GAAACAAGAA AGCACTTCGT GCGGCGCGTG CTATCGGTGA AAAGGTCAGC
GCTGGTTCCG GTACGATTCA ACAACAGCTC GAATACGCCG CAGATGTGGG TTACAAGCTC
AAGGATCACG GCGCTGGTAA ACCGGATATC ATCATCGCGG ATCTCAATAC CACGGCGGCG
TTGCTCAAAG CCTGCAAGGA AGAAAGCTCC AGCTCGTTCT ACCACGAGTC TAAGATTCTT
TTGTACTTTG ACGAGCCGAA TATGGGTATT CACCTCGATC CGAATGTTTT GGGTGTGGTT
TCGTCTATCC AGGCCAACAT GCCGCTCACC GCCGTACTTG CTTCGGCGAC CCTAGGCGCG
TGGGAAGGTT TGGAGCCTTG GTGGCGCGGC CCGACCGATG CGAACCAAAT TACTATTAGT
TTGGAGCCGT ACGAGTTGCC AATGGCGAAG CTCGCCGTTT TCAACGAAGG CACGAGCGAG
TTTTCTCCCA TGAGTCCGCT CAACTTGATT GAAAATTATG CCGAATACCA ACGAGTGATG
GAGGATTACC GTTTGCCCAC GCTTCTGTTG CGTCACTTGA CAGCTCGACA TGGTAATGAT
TTATTGCAGA TTCAACCGCC CGGTGGACCT TGGGATAAGG TGCAGGGTGA CGTGAAATCG
TTGCGTTTGG CAATCGAACC GACGTTTACG AGCCTGTCGC AAAAAGAATT TGAACGACTG
CAAGGTCGAT GGAAGATGGG CGAGGACGCT CCTACCAAGG TCGATGGCAT CCGTGGTGCA
CTCTCAAAAG AGGGCGTCAC TATGGTCGGT TGCTTGGATC CGCGCAAGGT GGCGTTCGAG
CTCGCCGGCT TTGCTGACCA GGAGGCTTGG ATCCAAGACG TTCATAAATT GAACAACAAG
CTCAAGGAAG CGGAAAGAAT GGTAAAAGAG AACGCCAAGG CTGAAAAACG CAAAAAGAAG
GATGACGAAG ACGAGGCTAA GGATGGTGGC GACGAAGGCG GCGTCGGAGT CGTGACTCTT
CGACCGATGT TAAAGATTAG TCTCGCCGAA GCTCTCGAGG CTGACATCAA CACCTTGGTC
ATGCTTTCTA AGGGTATCGC TTATGCGTGC GGGTCAGGCA CAGAGCCTAT GGTGAAACGT
CTTTATAACC AAGCGTTGCT CACCGTTCCC GATTCTCTTC GAGGACGATC TCCGCCGCTC
AACGTGCTGG TTGTCGACTA CTCGTCCATT TACGGAACGG ATTGTCCCGC GGTTGATACT
TTGTTGTTGC AAGAGGATTT GGGTCGACTC TTAGCTTGGG AAGATCTTCA GCAGTTCCTT
GGTCGTCTTC GACGCGATGG AACAGCCGTT TTCTACTCCA AGAAAACTCT TCGCCGGGCC
GCTCTCGGCG CCGCGGTCGA AGAGGAAGAG ACGACGGCGC TTATTGAATT CCAAAAGCTT
GTTGAAAACT CTGTGCTGGA TCTCGAAAAG GCTCAAAAAC GCGACACCGA CAGCGTCACC
GCCCTCGTGA CCAAGCTCTC TGCGTCTTCT GGTCGCAGTG CGGGAGAAGT CGCTTCGTAC
GTGTTGGCGT CTGTTATATC GTTTGCGCTA TCTGCTCCGT CTCATCTCGA TGGTGCGGGC
GTGTACCCGG CGACCATCCC GGAGAGCGAC AAAGAGCTCT TAGCCGCGAT CACAAAGCGC
GTGGAGGGTT ACTCGGACGC ATTGGAAAAT GTGTTGAAAA AGATGTCGGA GCAAGTTCGT
GCCGTGAGCG CGATCGAAGC CCTCTCGCTC TCTGCGAACC CGTTCGCCAG CCGTACCGGC
GGCGCTCGCG TGCTCGGCAT TGCTGCGCAA GTGTTGAAAA TGCTGTACGA TGCTGATATC
CTCTCCGAAG AAGCACTCTT TGCTTGGGCT AATGCGAAGC GCAAAGAACT TCTCGCTGAG
TCAGACGGTG ACGCTCGCTT CTTTGGCAAA GCAAAGCCAT TCATTCAGTG GTTATCCGAA
GCAAGCGATG AAGACAGCGA TGAAGAAGAA GAATAGGTCA CACACACATC ACATT
 
Protein sequence
MSDSESDYSD DDAHIGGGDR DAEETYDGAG GEATLNVDVG SSLTSADWAD IDNSDLVAKV 
RGSAGFRIMR EIQPWDGALD APPDLMKRHV FEYFYTQRKD SDFYKKLAKD CFPSEYERAG
ADKSEKKRLA NLWAGAVLDE LKAMRDGAKH TLRWINAIKE AQAGDLLFLK REALTHGPGS
VQEIASQIVW DNVSNRIKDQ TFAAMHAWTN KSGAQSGGVA RMYKHQHELI DQLDKHYAAI
GKMQEAKKPL PKPLHVILST PTGSGKTFTA ILVHLRLLKV KYPDAILLYS VPTKQVLKRV
GQECEAHAVP YWTAARDNTG DGTLHQVRRP YSIRDKARNK KALRAARAIG EKVSAGSGTI
QQQLEYAADV GYKLKDHGAG KPDIIIADLN TTAALLKACK EESSSSFYHE SKILLYFDEP
NMGIHLDPNV LGVVSSIQAN MPLTAVLASA TLGAWEGLEP WWRGPTDANQ ITISLEPYEL
PMAKLAVFNE GTSEFSPMSP LNLIENYAEY QRVMEDYRLP TLLLRHLTAR HGNDLLQIQP
PGGPWDKVQG DVKSLRLAIE PTFTSLSQKE FERLQGRWKM GEDAPTKVDG IRGALSKEGV
TMVGCLDPRK VAFELAGFAD QEAWIQDVHK LNNKLKEAER MVKENAKAEK RKKKDDEDEA
KDGGDEGGVG VVTLRPMLKI SLAEALEADI NTLVMLSKGI AYACGSGTEP MVKRLYNQAL
LTVPDSLRGR SPPLNVLVVD YSSIYGTDCP AVDTLLLQED LGRLLAWEDL QQFLGRLRRD
GTAVFYSKKT LRRAALGAAV EEEETTALIE FQKLVENSVL DLEKAQKRDT DSVTALVTKL
SASSGRSAGE VASYVLASVI SFALSAPSHL DGAGVYPATI PESDKELLAA ITKRVEGYSD
ALENVLKKMS EQVRAVSAIE ALSLSANPFA SRTGGARVLG IAAQVLKMLY DADILSEEAL
FAWANAKRKE LLAESDGDAR FFGKAKPFIQ WLSEASDEDS DEEEE