Gene OSTLU_26023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26023 
Symbol 
ID5004138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp106982 
End bp108022 
Gene Length1041 bp 
Protein Length346 aa 
Translation table 
GC content66% 
IMG OID640419559 
Productpredicted protein 
Protein accessionXP_001420078 
Protein GI145351421 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGAC TCTCCGTCTT CGGTTCCGTG AACGTCGACC ACGCGTTCGC GGTCGACGCG 
TTTCCGCGCC CGGGCGAGAC CGTGGGGGGC GATAGCGCGT ACGCCAGGCT CGCGGGCGGG
AAAGGGGCGA ATCAGGCGTG CGCGGCGGCG CGCGCGGGCG TCGAAGTCGA AGTGCACTTT
TGGGGACGCG TCGGAAGAGA CGACGATTTT AGCGTCGCGC AGCTCGAGCG CGCGGGCGTG
ATCGCGCGGG AAGTCACGCG CGACGATGCG TTGCCCACTG GGTCCGCGTG CGTGCTGTGC
GAGCGCGCGT CGGGGGAAAA CATGATCGTC GTGTGCGCCG GCGCGAACGA CGCGGCGCGC
GCGCGCTTTA GAGAGGGCGA CGAGAGCGCG GTTTTGGCGC TTCAGGGAGA AGTTCCGGTG
AGCGCAAACT TAGACGCGGT GCGGATGATG CGAACGGTGA ACCCTCGGTG CGTCGTGGTG
TTGAATTACG CGCCAGCGAC GGAGATATCG TTGGAATTGA TAGAGGAGGT GGATTGCGTC
GTCGTCAACG AAAGCGAAGC TCGAGCAATC TCAGAGGCGT ACGAGATTCT CGACGAGCCG
ACGCGGGCGA GCGATCGACC GCAGCGGAAT CGAGCGATTG TGTACACCAT GGGCGCGGCT
GGTGCGCGAT TGGTGATGAC GCGAGATCTG ACGTTTCGCG AGGCGTTTGT TCGCGACGGC
GATGGAAATT CCCAAGACGA AGCGCGCGTC GAAGCGATGC GTTTCGGCGA CGACGAAACC
GTCATCGATA CCGTCGGTGC GGGAGATGCG TTTGTCGGTG CCTTCTGCGC CGCGATGGCG
AGCGACGCGA CGTGTGCGGA CGCGTTGCGT TTAGCGTCCG CCGCCGGCGG TTTGACGTGC
CTCTCGAGCG GCGCTCGCGC GGACGTGTCG CGCGCGCGCG AGCGAGCGCT TCCCATCCCG
TGCGTCCTTC GAGGTGCGCT CAAGAATAAA AACGTCGAGC GCGCGCCCCT TCGCGCGCTG
CTTCTCGACG ATGCGCTGTA A
 
Protein sequence
MPRLSVFGSV NVDHAFAVDA FPRPGETVGG DSAYARLAGG KGANQACAAA RAGVEVEVHF 
WGRVGRDDDF SVAQLERAGV IAREVTRDDA LPTGSACVLC ERASGENMIV VCAGANDAAR
ARFREGDESA VLALQGEVPV SANLDAVRMM RTVNPRCVVV LNYAPATEIS LELIEEVDCV
VVNESEARAI SEAYEILDEP TRASDRPQRN RAIVYTMGAA GARLVMTRDL TFREAFVRDG
DGNSQDEARV EAMRFGDDET VIDTVGAGDA FVGAFCAAMA SDATCADALR LASAAGGLTC
LSSGARADVS RARERALPIP CVLRGALKNK NVERAPLRAL LLDDAL