Gene OSTLU_42345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42345 
Symbol 
ID5003274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp257895 
End bp258953 
Gene Length1059 bp 
Protein Length352 aa 
Translation table 
GC content64% 
IMG OID640418695 
Productpredicted protein 
Protein accessionXP_001419325 
Protein GI145349820 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.252628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACG TCCGCGTCCT CGTCACCGGC GGCGCCGGGT ACATCGGCAC GCACGCGTGC 
GTGCAGCTCC TGCTCGCCGG CGCGTCCGTC GTGGCGATCG ATAACTTTGA CAATTCGTGC
GCCGAGGCGG TCGAGCGCGT GCGCGCGATC GTCGGCGAGC GGCGCGCCGC GCGGTTGACG
TTTCGCGAGT GCGATTGCCG CGACGCCGAG GCGCTGGAGG ACGTGTTCGC GACGTGCGGG
ACGGTGCGCG CGGTGATCCA CTTCGCCGGG CTCAAGGCGG TGGGGGAGAG CGTGGCGAAG
CCGCTGCTGT ACTATGAGAA TAACATTCGG AGCACGCTGA CGCTGTGCGA GACGATGGCG
AGGCACGGAT GCAAGACGCT GTGCTTTAGC TCGAGCGCGA CGGTGTACGG GGAACCGGCG
TCGGTGCCGT GCACGGAGGA TTTCCCGACG GCGGCGCTGA ATCCGTACGG ACGGACGAAA
TTGTTCATCG AGCACATTCT GAGCGATTTG CAAAAGAGCG ACGGCGAGTG GCGAGTGGCG
CTGTTGAGGT ACTTTAATCC GGTCGGCGCG CACGAGAGCG GAACGCTGGG GGAGGATCCG
AAGGGGATTC CGAATAATTT GATGCCGTTC GTGCAGCAGG TGGCGGTGGG GCGAAGAGCG
GAGTTGAGCG TGTTCGGAAA CGACTATCCG ACGAAGGACG GCACGGGACG ACGGGATTAC
ATTCACGTCG TCGATTTGGC GGATGGGCAC GTCGCGGCGG TGAAAAAGCT CACCACCGAT
CCTAACGCGG GGTTGATCAC CGTGAATCTC GGGACGGGGA CGAGCACGAG CGTGTTGGAG
CTCGTCGCCG CGTTTGAAAA GGCGTCTGGG AAAAAGATTC CGTGCAAGAT GGTCGCGCGT
CGCGAGGGCG ACGCCGCGGA GGTGTACGGC GCCACGCAAA AGGCGTTTGA AGTTCTCGGC
TGGCGCGCCG AGCGCACTAT CGAAGACTGC TGCAAAGATC AGTGGAAGTG GGCGAGCGCG
AATCCATACG GGTACCTGGG CAAGCCCGAC GACGAGTGA
 
Protein sequence
MDDVRVLVTG GAGYIGTHAC VQLLLAGASV VAIDNFDNSC AEAVERVRAI VGERRAARLT 
FRECDCRDAE ALEDVFATCG TVRAVIHFAG LKAVGESVAK PLLYYENNIR STLTLCETMA
RHGCKTLCFS SSATVYGEPA SVPCTEDFPT AALNPYGRTK LFIEHILSDL QKSDGEWRVA
LLRYFNPVGA HESGTLGEDP KGIPNNLMPF VQQVAVGRRA ELSVFGNDYP TKDGTGRRDY
IHVVDLADGH VAAVKKLTTD PNAGLITVNL GTGTSTSVLE LVAAFEKASG KKIPCKMVAR
REGDAAEVYG ATQKAFEVLG WRAERTIEDC CKDQWKWASA NPYGYLGKPD DE