Gene OSTLU_38341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38341 
Symbol 
ID5003942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp570652 
End bp571980 
Gene Length1329 bp 
Protein Length442 aa 
Translation table 
GC content64% 
IMG OID640419363 
Productpredicted protein 
Protein accessionXP_001420039 
Protein GI145351340 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.300966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.435151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGCG CGCGCGCGAG CGCGAGCGAT GGGTGGTTGT ACTGCGGCTT GGACTTTGGG 
ACGTCGAGCG CGCGCCTGGC GCTCGTGGAC GATCGAGGCG CGCTCGTCGG CGTGCGAACG
AGGGCGTACG ACGACGCGAA CGCGAGCGTC GCGCGGGCGT GGGAGCGCGC GCTGTTCGAG
CTGCTCGAGG ACGCGATGGA CGCGGAGGAG CGCGAGCGGT GCCAAGGCGT GGCGGTGGAC
GGCACGAGCG GGACGGTGGT CATCGTCGAC GCGCGCGACG GGCGAGCGCT GCGAGAACCG
TACATGTATA ACGAAACGTT TCCAGACGAA GTAGAACGCG TGCGAGCGCT GAGGAACGGG
CCGGGGAAGG ATTCGACGGA GAGCGCGTCG AGCGCGGCGT GTAAACTTTC GAGATGGTTT
CGCGTGGACG CGGAGGGAGA CAGAGAGCAC GCGGCGCTGT TGCATCACGC GGATTGGTTG
GCGTATCTGC TGCACAAGAA GATGGGCATG AGTGATTTCA ATAACGCGTT GAAGCTTGGG
TTCGATCCAG CGCCGGGGGT GGAGGCGTTT CCGGGATGGT TGCGAGACGC GCCGTTTGGG
TACATGTTGC CGACGGACGT TCGCGCGCCG GGAACGTCGT TCGGCGTCAT GGACGCCGAT
GTGGCGAAGC GGTTAGGATT TCCGTCGACG TGTGAAGTCA TCGCAGGGAC GACGGATAGC
GTCGCCGCGT TCGTGGCGTC GAAGGCGGCC GAATCGGGAG AGTGCGCGAC GAGTCTCGGG
AGCACGCTCG CGTTGAAACT CATCTCCGAC ACGCGCGTCG ACGACCTGAG CTCGGGCGTG
TACTCGCACC GTCTCAACGG TCGGTGGCTC GTGGGCGGGG CGTCGAATCT GGGAGGATGG
ATTTTACGCA GATTCTTTTC CAACGACGCC CTCGAGTCGC TGAGCGAGAA AATAGCAAAC
GAAGGTTACG TCGCGACGGA GGATTATTTC GACGGGGTGA TGCTAGGTTT CGGTCTGAGC
GTCGACGAGG CGTCGGCGAT CGTGGAAAAG TCACGACCGG CGGACGACGC GCAATTCGTG
GTGAACATTC TCAGTTCCAT CGCCAACGTC GAGGCGAGAT GCTACGAGCG CATGCGAACG
CTCGGGGCGT CGCACGGCGC GCGCAAAGTG TACACCGCGG GAGGTGGGGC GAAGAACGGC
GTGTGGAGTG GCATGCGCTC GAAAGCCATG GGAGATATCC CTGTCGAACG ATCGGCGTGC
GACGAAGCCG CGTACGGCGC GGCGCTCCTC GCTCGACAGG GAAGGAAACG GTTATCCGGC
TACATTTAA
 
Protein sequence
MRRARASASD GWLYCGLDFG TSSARLALVD DRGALVGVRT RAYDDANASV ARAWERALFE 
LLEDAMDAEE RERCQGVAVD GTSGTVVIVD ARDGRALREP YMYNETFPDE VERVRALRNG
PGKDSTESAS SAACKLSRWF RVDAEGDREH AALLHHADWL AYLLHKKMGM SDFNNALKLG
FDPAPGVEAF PGWLRDAPFG YMLPTDVRAP GTSFGVMDAD VAKRLGFPST CEVIAGTTDS
VAAFVASKAA ESGECATSLG STLALKLISD TRVDDLSSGV YSHRLNGRWL VGGASNLGGW
ILRRFFSNDA LESLSEKIAN EGYVATEDYF DGVMLGFGLS VDEASAIVEK SRPADDAQFV
VNILSSIANV EARCYERMRT LGASHGARKV YTAGGGAKNG VWSGMRSKAM GDIPVERSAC
DEAAYGAALL ARQGRKRLSG YI