Gene P9303_25441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_25441 
SymbolgalE 
ID4776228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2236629 
End bp2237675 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content56% 
IMG OID640088065 
ProductUDP-glucose-4-epimerase 
Protein accessionYP_001018540 
Protein GI124024233 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.8826 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGT TGTTGATCAC GGGTGGTGCC GGGTTCATTG GCAGTCACAC CTGCCTTGTG 
CTTCTGGAAG CCGGCCACAG GCTGGTGATA CTCGACAACT TTTCCAATAG CTCAGCTATT
GCCTCAAAAA GGGTAGCCGA ACTTGCTGGA GTGGCCGCTC AAGAGCGAAT GCTCGTATTG
GAGGGCGATA TCCGCAGTAG CAACGATCTA GATCGCGCTT TCAACTCAAT GGAAAACGGC
ATCGCGGCTG TGGTGCACTT CGCCGGGCTC AAGGCAGTTC ATGAATCAGT TCAGTTACCA
CTGAAATACT GGGATGTGAA TGTGGCTGGC AGCCGCTGCC TACTGGAAGC CATGCAACGG
CATAACTGCC GCACGATTGT TTTCAGCAGC AGTGCAACGC TTTATGGCTA TCCCGAGCAA
ATCCCAATCC CTGAAACGAC TAGGGTGCAA CCGATCAATC CATATGGTCA AAGCAAGGCA
GCGGTGGAGC AGTTGCTAGA TGACCTGGCC TGCAGCGAAC CTGGTTGGCG CATTGCCCGA
TTGCGCTATT TCAATCCAGT TGGAGCCCAC CCCAGCGGCT GCATCGGCGA AGATCCCAAG
GGAACACCCA ACAACCTTTT CCCTTTTGTG AGCCAAGTAG CAGTAGGCCG GCGAGCGGAA
CTCCAAGTAT TTGGAGCCGA TTGGCCTACA CCCGATGGGA GTGCTGTGCG CGACTACATC
CATGTGATGG ACTTAGCTGA GGGACACCGA GCTGCACTGG AGGTCCTGCA ACGAGAGGAA
CCGCAACTCC TCACTCTTAA TCTCGGCAGT GGCAAAGGCC ACTCGGTGTT GGAAGTTGTG
CAGGCCTTTG AAAAGGCAAG CGGCCAACCC GTTCCATACA GCATTAACCA GCGCCGCGCT
GGAGATGCCG CTTGTAGCGT CGCAGATCCA AGCCTGGCCG CCGAGCGGTT GGGATGGTCC
ACGCAGCGCA GCCTGTCAGA CATGTGCCGC GACAGTTGGA ATTGGCAGAA GGCCAATCCA
CAGGGCTATA GCCAGAAACA ACAATGA
 
Protein sequence
MAELLITGGA GFIGSHTCLV LLEAGHRLVI LDNFSNSSAI ASKRVAELAG VAAQERMLVL 
EGDIRSSNDL DRAFNSMENG IAAVVHFAGL KAVHESVQLP LKYWDVNVAG SRCLLEAMQR
HNCRTIVFSS SATLYGYPEQ IPIPETTRVQ PINPYGQSKA AVEQLLDDLA CSEPGWRIAR
LRYFNPVGAH PSGCIGEDPK GTPNNLFPFV SQVAVGRRAE LQVFGADWPT PDGSAVRDYI
HVMDLAEGHR AALEVLQREE PQLLTLNLGS GKGHSVLEVV QAFEKASGQP VPYSINQRRA
GDAACSVADP SLAAERLGWS TQRSLSDMCR DSWNWQKANP QGYSQKQQ