Gene OSTLU_38674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38674 
Symbol 
ID5001799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp386973 
End bp388166 
Gene Length1194 bp 
Protein Length397 aa 
Translation table 
GC content53% 
IMG OID640417220 
Productpredicted protein 
Protein accessionXP_001418004 
Protein GI145347077 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.201986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.16101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCG TAGAAGCGAT GCAGAAGTTT TGGAAGCGGC TTCGAGTGAT GGACCGAAAC 
GATATTTTGC GTTACAAACA TGTGTGGTAC GTGCCGACTC TAGAACATCT CTCAGACGAC
ACGCAAACGC TATCGCACGG TTCCGGTGCT AGCAGTGGTG ACGAGGACGA TGCCTTTCGC
ATCAGGAAGG GCGAGCACGT GGCGTTTCGA TTTGAAATGT TCAATTCACT CGGCGCGGGA
AATTTCGGTC AAGTTGTGCG TTGCTTTGAC CATAAATATA AGCGCGAAGT GGCACTTAAA
CTGATTTCTC CTGACGAAAC ATTTGCGAGT CAAGCTCGCG TGGAAGTTTC GGTGCTCAAA
CGTGTCGAAG GTGGTTCGAG TCGCGTAGTG AAGATGTTCG AGCATTTAAA GTTTCGCGGT
CGATTGTGCG TTGTGTTTGA GCTGCTTCAC ATCAACTTGT ACGAGTTCTT GGAAGCCCGC
GCGTTCGCGA AGTTAGATAT TCAACACGTC CGCCATATCG CTCGGCAAAT GGTGGATGCG
CTCGTGTACC TCAAACACAT GCAAGTGGTG CACTGTGACA TCAAACCCGA AAACATTTTG
CTAGAACATC CTGGCTCGTT CGATGTCAAA CTCATCGACT TCGGAAGCGC GTGTTTTCAG
GGAAAACAAG TGTATACGTA CATTCAGAGC CGATTCTACC GCGCTCCGGA AGTCATGCTC
GGAATCGATT ACGGCCATCC CATTGACGTA TGGAGCTTGG CGTGCGTTCT CGCCGAGCTT
GCCACTGGCA AAACTTTATT TGTGGGCGAC GATGAGGCAC GCCAGTTAAG CGCGATCACC
TCGCGGATTG GGCCTCCTCC ACGCCGCATT CTTAGCTCGG CAGCACACTC CGATCGCCGA
GTCGACTTTC ATGTGTGCGA GTCGTCGTTT GCGAGAAGTC GACGAGACGA CAAACGTAGC
GATCGGACAT CCTCAAAGCA CAAGCCGCGA AGCAAGCGCA CGAAAGTCAT CGATATCGAC
GACGATCGGT TCAACGCATT TCTACTACGA GCGCTGCACT GGAATCCGTC TCGACGGTTG
ACGCCCGACG CCGCTCGGCG GCATAGTTTC CTGCAAAAAC GTCAAGCCGT CGTTGGTGAG
GCCGCAGTCG ACGACGCGGT GCGCACCGGG CGCGAGCTTC AAACCACGAG ATGA
 
Protein sequence
MSIVEAMQKF WKRLRVMDRN DILRYKHVWY VPTLEHLSDD TQTLSHGSGA SSGDEDDAFR 
IRKGEHVAFR FEMFNSLGAG NFGQVVRCFD HKYKREVALK LISPDETFAS QARVEVSVLK
RVEGGSSRVV KMFEHLKFRG RLCVVFELLH INLYEFLEAR AFAKLDIQHV RHIARQMVDA
LVYLKHMQVV HCDIKPENIL LEHPGSFDVK LIDFGSACFQ GKQVYTYIQS RFYRAPEVML
GIDYGHPIDV WSLACVLAEL ATGKTLFVGD DEARQLSAIT SRIGPPPRRI LSSAAHSDRR
VDFHVCESSF ARSRRDDKRS DRTSSKHKPR SKRTKVIDID DDRFNAFLLR ALHWNPSRRL
TPDAARRHSF LQKRQAVVGE AAVDDAVRTG RELQTTR