Gene OSTLU_51117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_51117 
Symbol 
ID5004844 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp73734 
End bp75842 
Gene Length2109 bp 
Protein Length685 aa 
Translation table 
GC content64% 
IMG OID640420265 
Productpredicted protein 
Protein accessionXP_001420745 
Protein GI145352844 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG0631] Serine/threonine protein phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0207292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGC GCGACGGCGC GCGCGCGAAT CGATGCTTTC GAGGGTGTTG CGTCGGCACC 
GCGCGCGTGC GATTGGACTG CGACGTCGGC GCGCGCGAGG GACGACGCGG AAGCGACGAC
GACGGCGACG ACAGCGTCGC GCGCGGCGAC GTCCTGTACG CGGGACCGCT GAGCGTGGTG
TTCTACGGGC GCTATCGCGA CGAGGCGATC GCGATAAAAA GGCCAAAGCT GCGGACGACG
CGCGAGATCG ATCGGTACCA CGCGGAGTTG GGACTCATGC TCGAGGCGCG ACACGAAAAC
GTGCTGGGCG TGGTCGGCGC GCGCGCGGCG CCGCCGAATT ATGAGTTGTT CTTTCCGTTC
ATGGAGAACG GCGCGGTGGA TGAGGTGGTG TATGAGCAAG GATGGACGCC GACGTGGCAG
GCGGTGCTGA AACTGGCGCG AGAGGTGTGC GCGGGGTTGA CGTATTTACA CGACGTCGGC
GTCGTGCATC GAGACGTAAA GCCGAGCAAC GTGCTGTTGG ACGGATCGTG GACGGCGAAG
ATCGGGGACT TTGGCTTGGC CGAGCGCGAG AGCGAGCTGC GAGCGTCTTT GCAGGCGGCG
ATTTATTCCA CCGAAGACGC CGAGGGTAAG GCGAGGGTGG AGGGGAAATG GATCGCGGGC
GAGCACGGCG CGCCGAGCGG GGGGTTTCAG AAGCAGCACA TGGTGGGGTC GATGTTGTAC
ATGTCGCCGG AGGTGTTGAT GCGACAAGTG AGCGGGTACG GCGCGGATGT GTACGCGTAT
GCGGTGACGA TATGTGAGAT AGCCACAGGG ACGGTGCCGT TCAGCGATCG GGCGAGAAAC
GTCGCGCTCG CGCACACCGT GCTCGACGCG AGCTACAACG AACAGGATTT AGCCATCGCC
ATCGCGAGCG AGCACTTGAG ACCGATTTTA CCAGGAGAGA CCGCGGCGGG GGGAGCGGGC
AAGGTTCCCG ACGGTTTGAA CGACCTCATC ACACGTGCAT GGGCACCCGT AGAATCGTCG
CGACCGCGAA TGCCGGAGAT CACTCGCGAG CTCGAAGGCG TCGTCGCGGC GTATTGCGCC
GAAAACGGCT TGGACGACGT CGCCGCGGTG TGGTTGCCCC CAGCGAACGA TCGTGCCGAG
GCGGCGACGG CGGCGGCGAC GGCGCTCAAC GAGCCTTTAG ATTGGGAAAT GCAAGAGCCA
GCGTTTGCGG CACCGAATCC GGAGCACGCG ATCGCCGCCG CCGACTTTTC CGCCGGCGTC
TTTAGCACGC CCGGCGCGCG CGGCGCAGAC AAGATGGAAG ATCGTCACAT CGTCGTCAAC
AACCTCGGTG GTCGCGCGCA CGCCCATCTT GTCGCTGTGT TCGACGGACA TCGAGGGCAC
GAAGCCGCGG AGTTCGCCAT GGTGCACATC GAACGTGCGA TTCGAAGCGA GTGGGGCGCT
CACGGCGACG ACGTCGAGAG CGCGCTCTCC GCCGCGGTGA CGAAATTAGA CGCCGCGTTT
TGCGCGCGTT TCGAGGCGAT CAAGGCGAAA GAGATGAGCG CATCAAAAAG TGCGCAACAA
AGCAAACGTA ATCCAGGCTG CACCGCCATC GTGGGTTTGC TGTGGGGTGA TCGATTGTGC
GTCGCCAACG CGGGCGATTG CCGCGCGATT CTCTCGCGCG ACGGCGTGGC GCTCCCGTTG
AGCGTCGATC ACGACGCTGA GAGCAACGCG AGTGAGCGTC ATCGCATCGA GCGCGACTTT
CCAGGGGCAT TGCGTCAGCA CAACGGCGTT TGGCGCGTCG GAGACGCTGG TGTGGCCGTG
ACGCGAGCCA TCGGAGACGC CGACGCCAAG GCTTTCGGCG TCGTCGCCGA GCCGGAGATG
ACGACGGTGT CCGTCAATCT CGCCACCGAC GACTGCCTCG TTCTCGCGTG TGATGGCTTG
TGGGACGTCG TCGATAATCA CGATGCCTTG GCGATGATCA AAGACACGGT GAAAGAGCCA
TCCATGTGTG CCAAGAGACT CGGATGCGAA GCGTTGACGC GCCTGTCGGG CGACAACGTC
ACCGTCCTCG TCGGCTTTTT GCGAGGCAAT CGCACGTGCG AAAACGTCTC TTGGGCGCGC
GCGTTTTAG
 
Protein sequence
MATRDGARAN RCFRGCCVGT ARVRLDCDVG AREGRRGSDD DGDDSVARGD VLYAGPLSVV 
FYGRYRDEAI AIKRPKLRTT REIDRYHAEL GLMLEARHEN VLGVVGARAA PPNYELFFPF
MENGAVDEVV YEQGWTPTWQ AVLKLAREVC AGLTYLHDVG VVHRDVKPSN VLLDGSWTAK
IGDFGLAERE SELRAVEGKW IAGEHGAPSG GFQKQHMVGS MLYMSPEVLM RQVSGYGADV
YAYAVTICEI ATGTVPFSDR ARNVALAHTV LDASYNEQDL AIAIASEHLR PILPGETAAG
GAGKVPDGLN DLITRAWAPV ESSRPRMPEI TRELEGVVAA YCAENGLDDV AAVWLPPAND
RAEAATAAAT ALNEPLDWEM QEPAFAAPNP EHAIAAADFS AGVFSTPGAR GADKMEDRHI
VVNNLGGRAH AHLVAVFDGH RGHEAAEFAM VHIERAIRSE WGAHGDDVES ALSAAVTKLD
AAFCARFEAI KAKEMSASKS AQQSKRNPGC TAIVGLLWGD RLCVANAGDC RAILSRDGVA
LPLSVDHDAE SNASERHRIE RDFPGALRQH NGVWRVGDAG VAVTRAIGDA DAKAFGVVAE
PEMTTVSVNL ATDDCLVLAC DGLWDVVDNH DALAMIKDTV KEPSMCAKRL GCEALTRLSG
DNVTVLVGFL RGNRTCENVS WARAF