Gene OSTLU_25086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25086 
Symbol 
ID5003861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp614902 
End bp616276 
Gene Length1375 bp 
Protein Length438 aa 
Translation table 
GC content63% 
IMG OID640419282 
Productpredicted protein 
Protein accessionXP_001419856 
Protein GI145350953 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.300966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.110064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCGA CGCGAGTCGC CGCGCACGCG CACGTCGAGC GGCGACGCGC GTGTGGAGAG 
TGCGCGCACA GTGGTCGTGC GTCGTCCCGC GCGCGCGCGG GCGCGCGCGA GCGAGAGGTG
CGACGAAGGG TGGCGCGGCG CGCGAGTCGA GAGTACGACG TCGTCGCGCT CGGTAACCTG
TGCGTGGACG TGTTACTGCC GCCCGGCCCG ATCCCAGACG CGACGTCGCT GAAGACGACT
AAAACACTCG GTGAACTCGC GAGGACGGCG CCGGCGCGAG AGTCGTGGGA ACTGGGCGGG
AATTGTAATT TTTTAATCGC GGCGTCGAGG CTGGGCTTGC GAGCGTCGTG CGCGGGACAC
GTCGGAAACG ATGAATACGG CAAGTTTTTG ATCGATGAGC TCGCGCTGGA GGGAATTGAT
CACGTGGAAT TGATTCCAGG AGACGATCAG GGCGTGCGCG TGAGCGCTTT GGCCGAGACG
TTGATTTGTT TCGTGTTGAG CGACGGCGCC GGTTCGCACG CGTTTTGTAG TAGGTACGAT
TTGGGCCCGT GGCCGCTGAT GCGGGACGTG AGCGACGTCT CTAACGAAGC GCGCGAGGCG
TTGCGTTCGT GTCGAGCGGT GTTTCTCAAC GGTTTTGTGT TCGACGAGCT CAAGCCTCAG
GCTGTAGCGC AGGCGCTCAA ATTGGCCAAG GGGAACGGCG CGGGGGTGTT TTTCGATCCG
GGGCCTCGCG CGTTTACGTT TGTCGACGAG ACGAATCCGT CACGCATGGA GGCATTGAGA
GTGGCGCTGG AAAATTCCGA CGTCGTGCTC GCGACCGAGG AAGAACTCGC AGCGCTCACG
GGCGTGCGTG CGAATGCGCC GCCCACGGAC TACGCCGCGG CTGTGTTCGA CTTTCCGGGA
TCCGCGGCGG AGTGGGTCGT CGTGAAGCTC GGTCCCGAAG GCGCGATGGT CGTCACGCGC
GACGGTCAAA GCGCGCGCGT CGGTTGTCCA CGCGTGAAAG TCGGCGACAC CGTGGGGTGC
GGTGATAGCA GCGCGGGCGC GTACGTCTTA GGATACCTGC GAAAGCAAGC CGACGACGCG
TTGGATTTGA GCGAAGTCTT GCAAACCACC GCGACGCTCG CGACGCACGT GGGAAGCGCC
ACGGCGATGA ACATCGGCGC CGGTCGAAAC GTCGCGAAAG CAGAGACCGT GCTCGAGCTC
TTAGACGCGG CGGTGGACGG TAAGACCGAG GGGGTCGATC GAGGCACGGC GTCGCGCGCG
CAAGCGATTC TTCGCGAGTC GATGAACGAG TCGATGAAAC AACAACAAGC GCGATAATTA
TTGCATCCCA ATCACTAGCG GGTTTCATAA ATACGACTAG ACATATCGTT ACGCG
 
Protein sequence
MISTRVAAHA HVERRRACGE CAHSGRASSR ARAGAREREV RRRVARRASR EYDVVALGNL 
CVDVLLPPGP IPDATSLKTT KTLGELARTA PARESWELGG NCNFLIAASR LGLRASCAGH
VGNDEYGKFL IDELALEGID HVELIPGDDQ GVRVSALAET LICFVLSDGA GSHAFCSRYD
LGPWPLMRDV SDVSNEAREA LRSCRAVFLN GFVFDELKPQ AVAQALKLAK GNGAGVFFDP
GPRAFTFVDE TNPSRMEALR VALENSDVVL ATEEELAALT GVRANAPPTD YAAAVFDFPG
SAAEWVVVKL GPEGAMVVTR DGQSARVGCP RVKVGDTVGC GDSSAGAYVL GYLRKQADDA
LDLSEVLQTT ATLATHVGSA TAMNIGAGRN VAKAETVLEL LDAAVDGKTE GVDRGTASRA
QAILRESMNE SMKQQQAR