Gene Smed_3888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3888 
Symbol 
ID5318682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp345706 
End bp346827 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content63% 
IMG OID640775700 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001312633 
Protein GI150376037 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00197458 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCC GGACCGTCGA AAAAAGTCGG CATCTGGCGA CGCCTGCGCT CTCCGGAAAG 
GCGGGGCCGA TCCTTGTCGT TGGAGGCAGC GGCTTCGTCG GGAGCAATCT CGCCGACAGC
TTTCTCCGTG ACGGCGAGCA CGTGATCGTT CTCGACAATC TCAGCCGCCC GGGCGTCGAG
AGAAACCTCG AATGGCTGGT GGAGAGCCAC GGGCGCGCCG TCGAGGCCGT CACCGCCGAC
ATCCGCGATC TCGCCGCAAT CCAACCTGCC TTCAGGAATG CCAAGGCCGT ATTCCATTTT
GCGGCTCAGA CGGCGGTGAC CACAAGTCTC CAGCAGCCGA CCGAGGATTT CGAGACGAAT
GCGCGTGGCA CGCTCAACGT GCTCGAAGCC GCACGTCTGG CTGGGCGCAG CTCGCCCGTG
ATCTTCGCCA GCACCAACAA GGTCTATGGC GCGCTCGAGC ACATGGAAAT GAGAGACGTG
CAGGGCCGCT ACATGCCGGT CGACGAGGCG ACGCGCGCGC ATGGCGTCGG CGAGGCGCAA
CCGCTCGATT TCTGCACGCC CTATGGCTGT TCGAAGGGCG TTGCCGATCA GTATGTGCTG
GACTATGCCC GCTCCTTTGG CTTGCCGACC GCCGTTCTCA GAATGAGTTG CGTCTATGGC
CCGAGGCAGT TCGGCACAGA GGATCAGGGC TGGGTAGCGC ACTTCCTCAT ACGGGCGCTC
GCCGGCGAGC CGATCTCGAT CTATGGCGAC GGCAAGCAGG TCCGCGACAT CCTCCATGTC
ACCGATGCGG TCGCCGCTTA TAGAGCGGTT CTTAAAGCGA TCGACGGTCT GAAGGGTCGC
GCTTTCAATT TGGGAGGCGG ACCCGACAAT GCCGTCAGCA TTGTCGAGGT GCTGAACGAG
ATCGAGATCT TGACGGGCCG TCGGCTCTCT ACCGGGAAAA GCGACTGGCG CGCCGGCGAC
CAGCTGTATT TTGTGGCCGA TACGCGCGCG ATCGCCGATG CGGTCGGCTG GAGGGCGGGA
ATGGCTTGGC GCGAGGGGCT GCGCAATCTT TACGCCTGGC TTGTCGAAGA TCGGGTCGGC
GTCGGAAATA TCCGGCGTCA ACAGCGGAGG GTCCCTGCAT GA
 
Protein sequence
MSGRTVEKSR HLATPALSGK AGPILVVGGS GFVGSNLADS FLRDGEHVIV LDNLSRPGVE 
RNLEWLVESH GRAVEAVTAD IRDLAAIQPA FRNAKAVFHF AAQTAVTTSL QQPTEDFETN
ARGTLNVLEA ARLAGRSSPV IFASTNKVYG ALEHMEMRDV QGRYMPVDEA TRAHGVGEAQ
PLDFCTPYGC SKGVADQYVL DYARSFGLPT AVLRMSCVYG PRQFGTEDQG WVAHFLIRAL
AGEPISIYGD GKQVRDILHV TDAVAAYRAV LKAIDGLKGR AFNLGGGPDN AVSIVEVLNE
IEILTGRRLS TGKSDWRAGD QLYFVADTRA IADAVGWRAG MAWREGLRNL YAWLVEDRVG
VGNIRRQQRR VPA