Gene OSTLU_27007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_27007 
Symbol 
ID5005068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp51023 
End bp53244 
Gene Length2222 bp 
Protein Length524 aa 
Translation table 
GC content61% 
IMG OID640420489 
Productpredicted protein 
Protein accessionXP_001420891 
Protein GI145353158 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0000261622 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.260585 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGATCGAT CGAGACGGTC GAGTCGTTCG CGCGCGCGAC GGACGCGAAC GCGACGGACG 
CGACGGGAAT CGAACGAATC GGCGCGAACG AAGGAAAGGA AAGGAAAGGA AGGATCGAAT
CGAATCGAAT CGAATCGAAT CGAATCGAAT CGCGACGGAA ACGCGACGAA AGGGCGACGA
AAGGTCGACC GCGGCGATGG TGCGAGCGAC GGGGCGCGCG CGCGCGGAAC GCGGGACGCG
GGAGGCGCGC GCGCGACGGC GAGAGTCGAG CGAGGGTGGT TCGCGGGACG AGGACGCGGC
GACGACGACG GCGACGACGA GCGGACGCGA GACGGGCGCG CGCGAGACGG AAACGGCGGC
GCGAACGCGC GAGGGGAATG ACGAAGAGGG ACGGGCGGGC GTCGAGGGCG AGGAGACGGC
GGGACGCGAT GGGGAGGAGG AGGGACGCGG CGGCGACGGA GACGCGAGGG AGGGAGACGC
GCTCACGGGG GAGGAGTTGG ACGTGTTGAA GGCGAACACG ACGGGCACGA AGCACGTTCG
AGGGATGGGG ATGAATGGAC ACGTCATGCT CGAGGCGACG ACGTCGAGCG GTGACGGGAG
GGGCGGAGAT CAGCCGCCGG CAAAGGCGCG GAAAATCGTC GCCGCGCCGA AAGGGTTGGG
ACCGCACGCG ATGAAGAATG TGCCTTCGAG CGTGTTGGAC GAGTCGGAGA GGAAATTGAG
ACGACCGAAG GTGCAAACGC ACGAGGTGTA TGGATTACTG CGGGCGCATT ACGATTTAGG
CGACATCGAC CGCGAATCTT TGAAGGAGCT TCCGTCGTAC GACGATAAGA ATTGGTACTT
TTGCGCGTAC ACGGACGATC CCAAGACGGG CGAGAAGGTC AAGCACGAGT ACGTCGCCAA
GGTACACAAC GGTATGGACT CCTCTGGAGT CAGTCGCGGC GTACTCGCGG CGCAAGAGCG
TATCATAGGG TACCTCGCCG CGCACGGCGT CGAGGTTCCA AACGTCGTCA AGTCGAAACT
CGCGATGTAC ACCACGCCTG GTCCGAACGG GACGGTGAAT CTCGTCTCCG AGGGCACGCC
CGGCGCGAAG CGCGAGTTTT GCACGCGCGT GACGTTTCTC GCCAGCAACC AAGTCGCGCA
CACCATGCGC GTGTTGACGT ATGTGCGAGG CAAAACCATC GTGCAGGTGC CGATGCCGCA
CTCGATCGAC TTTGTGCGAA GGAGCGGCTT ATTCGTCGGA CAAGTGTGCC ACGCGTTGTC
CAAGTGGCCT ACGCCGAAAG AGATGGAAAT CCAGCAGGAT CGGCGCGAAC TGGGAGACGC
CGTTACGCTG CCTTACGTAG AGCAGCACGC GATGTGCTGG AAGGTGCTCA CAAGTCGCTC
GCGCTTGTGG GATCTTCGCT TCTTCATGGA CGTGCAGCAC TTCATGACGG ATCTCATCAT
GCGCGAACTC TTCGAAGACG AAACGCGCAT CATGATGTGC AACACCGTCT TCAACGCCTT
CAAACACTTG GTGCTTCCGG TGGCGGACAA ACTTCGCATC GGCATCTTGC ACAACGACTT
GAACGAGCAA AACATCATCG CGTTGGAATC AGGGCCAGAT CCCGTCAAGT TTCCAGCGAA
TGTAAAGTTT GCTGCGATTG ATTTTGGAGA CGTCGTCGTG TCGTGGCGCG TAAACGAAAT
TGCCGTCGCG TGCGCGTACT GCGCTCTGGA CAAGGAAGAC CCCGTGCACG ACATGTCGAT
GATGCTGGCG GGGATTCAAT CCGTCTATCC GCTCACGCCG CTCGAGATGC GCGTCTTGCC
GTGTCTCATC GCCGCTCGTT TGGTCACGTC GCTCATCATG GGCATGTACT CGTACCATAT
GCAAATTGTG GGTGAACAAA ACGATCACGC GTCTGACGCC GCCGCCGGGC AAGCGAACTC
GAACGCGGAC CCGAGCGGCG AACCAGCCAG CGCGGATACG CCGAGGGGCA ACGTGTACGT
ACTCACGACT CAAAAATCTG CTTGGACTGC GCTGACTCGC ATCTTAACCG TCGGCGCGGA
GACGATGTTC AAGAGATTCA TCATCGATGC CTACTCTGAA GGGTCGAGCG TCGTCGTCCC
GCAAGCGGCT GGGTATCCCG CCTAGACGGT TCGCGCGGAC GCCGCGATCG CGACGTATGA
GGACGACGAG TCGATGATGC GCGCGAAAAA CTAATAAATG ATAACATTTA ATGTGCAACA
GT
 
Protein sequence
MNGHVMLEAT TSSGDGRGGD QPPAKARKIV AAPKGLGPHA MKNVPSSVLD ESERKLRRPK 
VQTHEVYGLL RAHYDLGDID RESLKELPSY DDKNWYFCAY TDDPKTGEKV KHEYVAKVHN
GMDSSGVSRG VLAAQERIIG YLAAHGVEVP NVVKSKLAMY TTPGPNGTVN LVSEGTPGAK
REFCTRVTFL ASNQVAHTMR VLTYVRGKTI VQVPMPHSID FVRRSGLFVG QVCHALSKWP
TPKEMEIQQD RRELGDAVTL PYVEQHAMCW KVLTSRSRLW DLRFFMDVQH FMTDLIMREL
FEDETRIMMC NTVFNAFKHL VLPVADKLRI GILHNDLNEQ NIIALESGPD PVKFPANVKF
AAIDFGDVVV SWRVNEIAVA CAYCALDKED PVHDMSMMLA GIQSVYPLTP LEMRVLPCLI
AARLVTSLIM GMYSYHMQIV GEQNDHASDA AAGQANSNAD PSGEPASADT PRGNVYVLTT
QKSAWTALTR ILTVGAETMF KRFIIDAYSE GSSVVVPQAA GYPA