Gene OSTLU_94867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94867 
Symbol 
ID5004010 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp325251 
End bp327020 
Gene Length1770 bp 
Protein Length575 aa 
Translation table 
GC content57% 
IMG OID640419431 
Productpredicted protein 
Protein accessionXP_001419971 
Protein GI145351197 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0308188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0300986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAGC GACAATACAT GGGCGGAGAC GCGAAGTCGA AAAAGGTGAA GACAGCAAAT 
AGAGGAAAGT TTGTGTTCGA TTGGCGGAAA GAGGAAGACA CGTCGAGAGA TTTGAATCCA
CTGTACGATC GGCCGCACGA AGTGGCGCCG ATGTTCGGTC GAGGGATGAT CGGGGGCGTG
GATCGAAGGG AGCAGGCGCG CTCGAACGCC GAGCGCGAGC GCGAACTCAT CGTCAAGTCG
AGGAAGGATC TCGGATCGAA GGACGCGGCG GGTGATGTGC GAAAGATGGA AGTTGAGCGC
GAGCGTAAAC GTAAGGATGT CGAGGCGCGC GAGTTGAAAC GAACTTTCAA GGAGCACTGG
AGTGATAAAA AGTTGGAAGA CATGACAGAG CGCGATTGGC GCATTTTTCG AGAAGACTTT
AACATTTCGT ACAAAGGCGG CAAGTTACCG CTTCCCATGC GCGCGTGGAA AGAGTGCACG
AGCTTGCCAC AAGAGATATT GCGCGCTATC GCGCAGGTTG GGTACGAAAA GCCGTCGCCG
ATTCAAATGG CTAGCATTCC GATCGGTTTG CTGAAGAGAG ACGTCATCGG CATCGCCGAG
ACGGGTTCGG GTAAGACGTG CGCGTTCGTC GTCCCCATGC TCGCGCACAT CATGCAGCTT
CCGAAAATGA CGGACGAAAT TGCCGCGCAC GGGCCGTACG CCCTGATCAT GGCCCCTACG
CGCGAGTTGG CGCAACAGAT TGAGGAAGAG ACTCTCAAGT TTGCGCAGTA TTTGGACTAT
CGCGTCGGCT TGGTCGTCGG CGGTCAATCG ATCGAAGACC AAGGTTTTAA ACTTCGCAAA
GGGGTGGAGA TATTAGTCGG TACGCCCGGT CGTATCATAG ATGTCATTGA GCGCCGATAC
ACCGTGCTCA GTCAGTGCAA CTACATCGTG CTCGACGAAG CCGATCGCAT GATCGACATG
GGTTTCGAAC CGCAAGTCGT GGCGGTGATG GAGGCGATGG GATCGGGTAA CTTGAAACCC
GAGGACGAGG CGGAAGAGCT CGACGGCCAG GCGCTCGAGC AAGGTGGGCC GACGTCGTCA
AAGTACCGAA CGACGTACAT GTTTTCCGCC ACCATGCCTC CGAGCGTGGA GCGTCTGGCG
AGAAGTTATT TGCGCAATCC CGCGGTGGTC ACCATCGGCA GCGCCGGGAA GACGTCCGAT
TTGATCAAGC AAGAGATTAT TTGGGTGTCG AGAAACGAGC GCGACTCCAA ATTTGAGCTC
GTGTTATCGC GACATCCCAA CACGCAAGCC ATCGTGTTCG TGAACGCCAA ACGCTCGGTG
GACGCCGTGG CGAATCTGTG CTACCGTCTC GGGTACTCGT GCGCGTCCAT ACACGGCGGC
AAATCGCAAG ACCAACGCGA GGAGTCTTTG CGCGGGTTCA AGGCTGGGGA TTACGACATC
TTGGTCGCCA CCGATGTCGC CGGTCGCGGG ATCGACGTCA AGGGCATCGA TCTCGTCGTC
AATTACGAGT TGCCGCACAC GATTGAAAAT TACACCCATC GCATCGGGCG CACCGGTCGC
GCCGGTCGCA AGGGCACCGC CGTGAGCTTC CTCACGAGCG ACGATCGCGA CATCATGTAC
GAGCTCAAAG AACTTCTCAT CGAGAGCAAG AACCACGTCC CAGATGCGCT GGCAAACCAC
GAAGCGGCGC GCGTAAAGCC TCAGCGCGAC GACAGAGGCA GACGCATGAA CCGCGAAGAC
ATTCGAGGGC AAGAAGCCAT CATCTACTGA
 
Protein sequence
MLKRQYMGGD AKSKKVKTAN RGKFVFDWRK EEDTSRDLNP LYDRPHEVAP MFGRGMIGGV 
DRREQARSNA ERERELIVKS RKDLGSKDAA GDVRKMEVER ERKRKDVEAR ELKRTFKEHW
SDKKLEDMTE RDWRIFREDF NISYKGGKLP LPMRAWKECT SLPQEILRAI AQVGYEKPSP
IQMASIPIGL LKRDVIGIAE TGSGKTCAFV VPMLAHIMQL PKMTDEIAAH GPYALIMAPT
RELAQQIEEE TLKFAQYLDY RVGLVVGGQS IEDQGFKLRK GVEILVGTPG RIIDVIERRY
TVLSQCNYIV LDEADRMIDM GFEPQVVAVM EAMGSGNLKP EDEAEELDGQ ALEQGGPTSS
NVERLARSYL RNPAVVTIGS AGKTSDLIKQ EIIWVSRNER DSKFELVLSR HPNTQAIVFV
NAKRSVDAVA NLCYRLGYSC ASIHGGKSQD QREESLRGFK AGDYDILVAT DVAGRGIDVK
GIDLVVNYEL PHTIENYTHR IGRTGRAGRK GTAVSFLTSD DRDIMYELKE LLIESKNHVP
DALANHEAAR VKPQRDDRGR RMNREDIRGQ EAIIY