Gene OSTLU_42574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42574 
Symbol 
ID5003330 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp10498 
End bp11802 
Gene Length1305 bp 
Protein Length421 aa 
Translation table 
GC content58% 
IMG OID640418751 
Productpredicted protein 
Protein accessionXP_001419042 
Protein GI145349233 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0510] Predicted choline kinase involved in LPS biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.202848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACCG ACCGAAGACA CTCGAGCGCG AGCGAAGAAA ACTCGGGGGC GAGCACGACA 
GACTTTCAAG ACGAAGACGC GCCGACGTCG AGAAGCGCGA CGCCCGCGCC CGGACGATTG
AACGTCGAGC TCGACGTCGA TCGCGAACAA CGCGATGAGT ATCAAGGCGT GAAATCCATC
GTGCGAAACA CCGTGCGAGG GTGGGCAAAC GTCGAAAACG CGGCGCTCGA GGTGAGCCCG
GTGCGAGGCG GGATCACGAA CGCGCTGTTC AAGGTTCGTC TGGCGCAAGA CGCGGCGCCG
ACGACGACGA AGGATCCGAT CGCACGCGCG GTGGTGGTGA GAGTTTTTGG CAAGGGTACC
GATCAATTCA TCACTCATCG CAAAGTACAA GGCGAGACGT CGCACGTTTT GAACGAACAC
GGGTTCGGGG CAAAAGTGCT CGGCGTTTTT TCAAATGGGT TGGTTGAAGA GTTCATCGAA
GCCGAGAGTG TGGCTCCGGA GGAGTTGGCG AACGGAGGGA TTTTGCTTCG ACGAGTCGCG
GCGCAGATGC GACGCTTGCA CAAGGAAGTG GCGCCGGATT TAGTGCCTCG CGCCGCGGCT
GGCGAGACCA TCGCGCGCGC CCGAGCCAAC GCTATCTGGG ACACGCTTCA GTTGTGGTTC
GACTTGGCGT ACGGTGTTGC CAATGATCCG ACCATTTTCA AGAATGACGC GCGCAAAGAG
TCGATTTTGG CATCGTTGAA GATCGATTCG GAATCGCGTC AAATGCTGTT CGAAGTCATT
CGCGCGAGGT GCGAAGCCGT GAACAGTCAG ACAGTGTACT GTCACAACGA CATTCACGCC
GGTAACTTTT TGCTGAACAG AAAGACGGAC AACCTGACGC TCATCGATTA CGAGTACGCC
GACTACGGTC CCCGTGCGTT TGACATGGCC AATCTGTTTT GCGAATTCGC CGGGTTCGAG
TGCAACTACG ATCAGTTTCC GACGTGCGAA CTTCGCCGCG AGTTTTACTC GGCGTACTTG
CACACCACGG TCGATGCGGA GATTGACGCG CTCGAAGCGG AAGTCGCGGC GTGGACGCCC
GTGACGCACG CATTCTGGGC GCTCTGGGCG GTGATTCAAG CCAAGTATAG CGCCATCGAT
TTTGACTTTT TGGGTTTCGC CGCGATGCGC ATGAAGGTGT TTTACGCCTC TGCTCTCGCG
CCGAGTGAGT GGGTGCCGAC GAACGCCGCG CTCGGTGGAC AGCACGGCAC GCCGGAGAAG
AGTGTGGGTT GGAATGCCAC GGCTGAGGGA AACGTCGTGC TTTGA
 
Protein sequence
MVTDRRHSSA SEENSGASTT DFQDEDAPTS RSATPAPGRL NVELDVDREQ RDEYQGVKSI 
VRNTVRGWAN VENAALEVSP VRGGITNALF KVRLAQDAAP TTTKDPIARA VVVRVFGKGT
DQFITHRKVQ GETSHVLNEH GFGAKVLGVF SNGLVEEFIE AESVAPEELA NGGILLRRVA
AQMRRLHKET IARARANAIW DTLQLWFDLA YGVANDPTIF KNDARKESIL ASLKIDSESR
QMLFEVIRAR CEAVNSQTVY CHNDIHAGNF LLNRKTDNLT LIDYEYADYG PRAFDMANLF
CEFAGFECNY DQFPTCELRR EFYSAYLHTT VDAEIDALEA EVAAWTPVTH AFWALWAVIQ
AKYSAIDFDF LGFAAMRMKV FYASALAPSE WVPTNAALGG QHGTPEKSVG WNATAEGNVV
L