Gene Smed_4998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4998 
Symbol 
ID5318719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1513014 
End bp1514006 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content54% 
IMG OID640776780 
ProductUDP-glucose 4-epimerase 
Protein accessionYP_001313712 
Protein GI150377116 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.163773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0224202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTT TGGTAACAGG TGGCGCCGGA TACATCGGGA GCCACATGGT TTGGTGCCTT 
CTCGACGCGC ACGAAGACGT TGTTGTCCTT GATCGTCTCT CCACTGGGTT TCGCTGGGCG
GTAGCGCCGG AAGCCAAATT TTATGAAGGC GATATCGGCG ATTCTGAGCT TCTGAACAGG
ATTTTTGCTA GCCATGATAT TGAAGCAATC ATCCACTTTG CCGGGTCAGT CGTGGTTCCC
GAGTCTGTCG CCGATCCTTT GACGTACTAT GATAATAACA CGGTCAAGTC ACGGGCGCTG
ATCGCGTCAG CAGTGAAAGC CAAAATCAAG TATTTTGTTT TTTCTTCGAC CGCAGCCGTC
TATGGTACTC CAGACGGAAA CGGCCCGGTC AATGAAGCCG CGCCTTTACG GCCGGAATCG
CCGTATGGTT CGTCCAAGCT GATGACCGAG ATAATGCTCA AGGACGCGGC GTTTGCTCAT
GACATCACAT ACACGGTGCT GCGCTATTTT AACGTCGCGG GCGCAGACGT TCATGGGCGC
ACAGGCCAAT CAACCGCAGG CGCTACGCAC CTAATCAAGG TCGCCTGCGA AGCTGCATTG
GGGAAACGCA ACGGAATTGA CGTTTACGGC GCCGATTATC CCACTCCTGA TGGCACTTGC
ATCCGTGACT TTATCCACGT CACCGACCTG GTAAACGCAC ATTTAAGGGC CCTGGAGCGG
ATGCGGGCAG GAGGCAGCTC CATTGTCGCG AACTGCGGAT ATGGCCGAGG CTTTTCAGTT
CTGGACGTCT TGCATCAGGT GAAGCAAGCA TCCGGCGTCG ACTTCCCCGT AAGAATTGTC
GAGAGGCGCC CGGGTGATGC TGTATCCGTT GTGGCAGATC CGATGAGGAT TACCCGAGAA
CTTGCCTGGG AGCCTTGCCA CGATGACCTT AACTTCATCG TACGAACCTC GCTGGATTGG
GAGTCTCGTT TAAGCCGGAG AAATACATAT TAA
 
Protein sequence
MAILVTGGAG YIGSHMVWCL LDAHEDVVVL DRLSTGFRWA VAPEAKFYEG DIGDSELLNR 
IFASHDIEAI IHFAGSVVVP ESVADPLTYY DNNTVKSRAL IASAVKAKIK YFVFSSTAAV
YGTPDGNGPV NEAAPLRPES PYGSSKLMTE IMLKDAAFAH DITYTVLRYF NVAGADVHGR
TGQSTAGATH LIKVACEAAL GKRNGIDVYG ADYPTPDGTC IRDFIHVTDL VNAHLRALER
MRAGGSSIVA NCGYGRGFSV LDVLHQVKQA SGVDFPVRIV ERRPGDAVSV VADPMRITRE
LAWEPCHDDL NFIVRTSLDW ESRLSRRNTY