Gene OSTLU_41900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41900 
Symbol 
ID5004967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp299866 
End bp300933 
Gene Length1068 bp 
Protein Length356 aa 
Translation table 
GC content69% 
IMG OID640420388 
Productpredicted protein 
Protein accessionXP_001420964 
Protein GI145353316 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID[TIGR02152] ribokinase 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0459054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGCG CGCGTCGCGC GCGGCGCCTC GCGCTCGGCG CCGCGCTGAC CGCGCTCGCG 
CGCGTCGCGC TCGCGCGCGC CGACCGCGCG TCGATCGCCG TCGTGGGGAG CGTGAACGCC
GACGTCATCG CGCGCGTCGG AACATCGGTC CCGCGCCGCG GAGAAACGCG CGCGGCGACG
GCGGCGCGCG TCGAGCGCGC GTGCGGAGGG AAGGGAGCGA ATCAAGCGGT CGCGGCGTCG
CGATTGACGT GCGAGCGCGG CGGGAAGGCG GCGTTCGTGG GACGATTCGG GGAGGACGAG
GCGGGGACGA CGCTGCGACG CGCGCTGCGG CGAGAGACGG ATGTGTCCGG GAGCGTCACG
ATGGCGCGAG GGAGCGGAAT CGAGACGGGA ATGGGGTTCG TGATGCTGAC GGACGATGGA
TCGCCGAGCG CGGTGGTCGT GGGGGGGGCG AACGCGCTCG GATGGAACGC GGACGACGAG
GCGCTGGCGA GCGAGTTTCG CGAGGCGCTG CGAGGCGCGA AGACGGTGAT GCTGCAGCGG
GAGGTGCCGG AACGCGTGAA CGTGATAGCG GCGACGGTTG CGAGGGAAAT CGGTGTGCGA
ACGATCGTGC TGGACGCGGG AGGCTCGTTT CACGCGGCGG ATGAGGCGTT GTTGGCGCTC
GTGGATTACG TCGCGCCGAA CGAGAGCGAA CTCGCGGGAA TGGCGGGAAT GCGCGTGGAG
GAGGTTTCTT CTGGGGACGA AGCGGTCGTT AAGGCGGCGA GAATCGTCGC AGGAACGTCG
CGCGTCGCCG TTCTGTGCAC GCTGGGGAGT CGAGGATCGT TGCTCGTGAG AGGCGAAGAC
GTCACGAGAG TCGAATCGAT GGAACTTCCG GCCGGCGCCA AAGAAGTGGA CGCCACCGCG
GCTGGTGATG CGTTTCGCGC CGCGTTCGCG GTGGCGATGG CTGAAAATAA AAACGAGCGC
GACGCCATGA AGTTCGCCAG CGCCGCGGGG GCTTTGGCGG TGACGAAGTT GGGTGCGATG
CCAAGCCTTC CATTTCGGAG CGACGTGGAT GAATTTCTTG GCGTTTCA
 
Protein sequence
MRRARRARRL ALGAALTALA RVALARADRA SIAVVGSVNA DVIARVGTSV PRRGETRAAT 
AARVERACGG KGANQAVAAS RLTCERGGKA AFVGRFGEDE AGTTLRRALR RETDVSGSVT
MARGSGIETG MGFVMLTDDG SPSAVVVGGA NALGWNADDE ALASEFREAL RGAKTVMLQR
EVPERVNVIA ATVAREIGVR TIVLDAGGSF HAADEALLAL VDYVAPNESE LAGMAGMRVE
EVSSGDEAVV KAARIVAGTS RVAVLCTLGS RGSLLVRGED VTRVESMELP AGAKEVDATA
AGDAFRAAFA VAMAENKNER DAMKFASAAG ALAVTKLGAM PSLPFRSDVD EFLGVS