Gene OSTLU_4642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_4642 
Symbol 
ID5003657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp265504 
End bp266544 
Gene Length1041 bp 
Protein Length347 aa 
Translation table 
GC content61% 
IMG OID640419078 
Productpredicted protein 
Protein accessionXP_001419540 
Protein GI145350279 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.609905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.210663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCGTTCGCGT TCGCGAGCGA TCGCAAGGCG CACGCGCTGG TGACGGGCGG CGCGGGATTC 
ATCGGGTCGC ACTGCGCCGA GGCGCTGCTG CGGCGCGGCT ACGCGGTGAC GACGGTGGAT
AACATGAGCC GAGGGAACGC GGGCGCGGTC GAGGCGCTGC GAAGGATGGC GCCGAAGGGA
AGCCTGCGAG CGGTGCGAGG GGATCTGGGC GTCGTCGAGG ACGTGGACGC GGCGTTCGGG
AACACGAACA TGCCGGTGGA CGCGGTGTTT CACTTCGCGG CCATCGCGTA CGTGGGGGAG
TCGATGGCGG ATCCGGTGAG GTATTACTCG AACATCACGA CGAACACGGT GAATTTATTG
CGAGTGATGC AGGCGAAAGA TGTGAGGAAG ATGATTTACA GCTCGACGTG CGCGACGTAC
GGGAACGTGG AGAAGTTGCC CATCACCGAG TCGACGCCGA CGAGGCCGAT TAATCCGTAC
GGCAAGTCCA AGTTGTACGC CGAAAACGCG ATCAAGGATT ACGCGCTGGC GAATCCAAAG
TTTAAGGCGT CGATTTTGCG GTATTTCAAC GTGTTCGGGG GCGATCCCGA GGGCGTGTTG
GGCGAGTTGC CGCGCGCGGA GTTGCGCGAG CACGGGAGAA TTTCCGGCGC GTGCTTCGAC
GCGGCGATGA AGAACATCGA CAAGCTCACG GTGATGGGGA CGAAGCACCC GACGCGGGAC
GGGACGACGA TACGAGACTT TGTGCACGTC GTAGATTTAG TGGACGCGCA CATAGCGGTG
GCGGAAAAGA ACAAATTTGA TAATCCTCCG TCGTTGTACA ACGTCGGCAC GGGGAGCGGC
GTGAGCATGC GAGAGTTCGT GGAGACGTGT AAAAAGGTGA CGGGCGTCGA CATAGAGATT
CACTATCGCG CTGAACCTCG GCCCGGAGAT TACGCCGAGG TGTACGCGAA CGTGGACAAG
ATCAAACACG AGCTCGGGTG GGAGGCAAAG TACACGGATT TGCACGAGAG CCTGACGCAC
GCGTGGAAGT TTAGAAAAAC G
 
Protein sequence
AFAFASDRKA HALVTGGAGF IGSHCAEALL RRGYAVTTVD NMSRGNAGAV EALRRMAPKG 
SLRAVRGDLG VVEDVDAAFG NTNMPVDAVF HFAAIAYVGE SMADPVRYYS NITTNTVNLL
RVMQAKDVRK MIYSSTCATY GNVEKLPITE STPTRPINPY GKSKLYAENA IKDYALANPK
FKASILRYFN VFGGDPEGVL GELPRAELRE HGRISGACFD AAMKNIDKLT VMGTKHPTRD
GTTIRDFVHV VDLVDAHIAV AEKNKFDNPP SLYNVGTGSG VSMREFVETC KKVTGVDIEI
HYRAEPRPGD YAEVYANVDK IKHELGWEAK YTDLHESLTH AWKFRKT