Gene OSTLU_50659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50659 
Symbol 
ID5004191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp580139 
End bp581271 
Gene Length1133 bp 
Protein Length347 aa 
Translation table 
GC content62% 
IMG OID640419612 
Productpredicted protein 
Protein accessionXP_001420220 
Protein GI145351730 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTCGCGCGAT CGCGTCGCGA GTCGACCCGA CCCGCGCGCA CACCGGGACG CGTCGCGACG 
ATGGCGCCGC ACGCGAATTT GCGCCTGCTC GGCATGGGGA ACCCGCTGCT GGACATCTCC
GTCGCGTGCG AGGACGACGC GCTGCTGAAA AAGTACGACT TGAAGCTGAA TGACGCGATC
TTGGCGGAGG CGAAGCACGC GCCGCTGTAC GAGGAGATGG CGACGCACGG AGACGTGGAG
TACATCGCGG GAGGCGCGAC GCAAAATACC ATCCGCGTCG CGCAGTGGAT GATGCAGCGA
GAGGGCGCGA CGGCGTACAT GGGGTGCGTG GGAGAGGATA AGTTTGCGAC GCAGATGCGG
GCGTCGTGCG AGAACGACGG GGTGCTCGCG AATTACATGG TGGACGCGTC CACGCCGACG
GGGACGTGCG CGGTGATCGT GAAGGATGGC GAGCGATCGC TGTGCGCGGC GCTGAACGCG
GCGAATAATT ACAAGGCGGA ACACTTGGAC GCGAGCGAAA ATTTCGCCCT CGTGGAACGC
GCCGATTTTT ATTACATGGC TGGTTTCTTC ATGACGGTGA GCCCGGAGAG CATCATGCGC
GTCGCCAAGC ACGCGTGCGA GAATAAGAAG ACGTTCATGA TGAACCTCAG CGCGCCGTTC
TTGATGCAAG TGCCGCCGTT CCTGGCGACG CTCATGGAGG CGCTCCCGTA CGTGAACATC
TTGTTCGGTA ACGAATCCGA AGCCGTCACG TTTGCCGAAT CTCAATCCTG GGACACCAAG
GACATCAAGG AAATCGCTCT CAAGATTTCC GCCATGCCCG TGGCGGAAGG CAAGCCGTCT
CGCACGGTTG TCATCACGCA AGGTTGCGAC CCGACCGTCG TCGCGCGCGA CGGCGCCGTC
GAAGAGTACG CCGTCATCCC GCTCGCCAAG GAAGACTTGG TGGATACCAA CGGCGCGGGT
GATGCTTTTG TCGGTGGCTA CATCTCGCAA CTCGTGCAAG GCGCGGACGT CGCCAAGTGC
TGCGCCGCGG GTAACTACGC CGCGAACAAG ATCATCCAAG AGTCTGGCTG CAAGTGCCCC
GGAGTGCCGT CTTTCACCGC GTAATCCGCC TCGACGAGAT TTGATTAGAC AGT
 
Protein sequence
MAPHANLRLL GMGNPLLDIS VACEDDALLK KYDLKLNDAI LAEAKHAPLY EEMATHGDVE 
YIAGGATQNT IRVAQWMMQR EGATAYMGCV GEDKFATQMR ASCENDGVLA NYMVDASTPT
GTCAVIVKDG ERSLCAALNA ANNYKAEHLD ASENFALVER ADFYYMAGFF MTVSPESIMR
VAKHACENKK TFMMNLSAPF LMQVPPFLAT LMEALPYVNI LFGNESEAVT FAESQSWDTK
DIKEIALKIS AMPVAEGKPS RTVVITQGCD PTVVARDGAV EEYAVIPLAK EDLVDTNGAG
DAFVGGYISQ LVQGADVAKC CAAGNYAANK IIQESGCKCP GVPSFTA