Gene Smed_3887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3887 
Symbol 
ID5318681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp344603 
End bp345709 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content63% 
IMG OID640775699 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001312632 
Protein GI150376036 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00157391 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAG TACTTGTTAC CGGCGGCTGC GGTTTCATTG GCCGGCATGT GGTCGAAGAG 
CTTCTCTCTG GGAACTACGA GTTGCGCATC CTCGATGCGC TGAACGAACA GGTCCATGCG
GACGCCCAGA TCACCCTGCC ACCGGAAGTA GATATGCGGC GCGCCGATAT ATGCGACGCT
GACGCGGTGA AGAGCGCACT GAAGGACGTC GATCACGTCA TTCATCTCGC TGCGGAGGTC
GGCGTCGGCC AATCCATGTA CGAGATCTCC CGCTATGTGG GCGTGAACGA CCTGGGCACC
GCCGTCCTGC TCGAGGCGAT GATCGGCATG CCGATCCGGC GGATCGTCGT CGCCTCTTCC
ATGAGCGTTT ACGGCGAGGG CTTGTACGAG ACTGCGGCCG GGGAACGCCG CGCGCATATC
CGGCGGTCTC CCGCCGAGAT CAAGACGGGT GCGTGGAACC CGTCCGGCCC CGATGGCGAG
AGCCTGAAGC CGATCGCGAC GGACGAGAAC AAGCCGGTCG ATCTTGCCTC CATCTACGCA
CTCACGAAAT ATGCGCAGGA GCGTCAGGTC CTTATATTCG GTGAAGCCTA TGGCATGGAC
GCGGTGGCGC TTCGGCTTTT CAACGTCTAC GGCGCCGGTC AGGCACTCTC CAATCCCTAT
ACCGGAGTGC TTGCCAACTT CGCCTCTCGA CTTGCCAACG GGCAGGCGCC GATGGTCTTT
GAGGACGGCC GTCAGAAGCG CGACTTCGTC CATGTTCGCG ACGTCGCGCG CGCTTTCCGG
CTTGCGCTCG AACAGCCGCA CGCCGCGGGC CACGTCATCA ATATCGGCAG CGGGCACGCT
TACGCGATCG CTGACATCGC CTCGCTGCTT GCCGATGCGA TGGGCGTGCC GGAGATCGGG
CCGGAGATCA TGCACAAGGC TCGTTCAGGA GATATCCGCA ATTGCTTCGC GGATATTTCC
AAGGCGCGCG ACCTCCTCGG CTTCGAGCCG GCGCACCGGC TTGAAGATTC GCTCGCGGAT
TTCGCCCAAT GGGTCCGCAG TGCCGGCGCG ATCGACCGCG GCGCCGAGAT GAAGCGCCAC
TTGGAAGCAC GGGGGCTGGT TCTATGA
 
Protein sequence
MTKVLVTGGC GFIGRHVVEE LLSGNYELRI LDALNEQVHA DAQITLPPEV DMRRADICDA 
DAVKSALKDV DHVIHLAAEV GVGQSMYEIS RYVGVNDLGT AVLLEAMIGM PIRRIVVASS
MSVYGEGLYE TAAGERRAHI RRSPAEIKTG AWNPSGPDGE SLKPIATDEN KPVDLASIYA
LTKYAQERQV LIFGEAYGMD AVALRLFNVY GAGQALSNPY TGVLANFASR LANGQAPMVF
EDGRQKRDFV HVRDVARAFR LALEQPHAAG HVINIGSGHA YAIADIASLL ADAMGVPEIG
PEIMHKARSG DIRNCFADIS KARDLLGFEP AHRLEDSLAD FAQWVRSAGA IDRGAEMKRH
LEARGLVL