Gene Smed_3587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3587 
Symbol 
ID5318578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp15166 
End bp16146 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content63% 
IMG OID640775402 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001312335 
Protein GI150375739 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGC TTGTGACGGG TTCTGCCGGC AGGGTCGGCT CTTTTGTTGT CAGGCAATTG 
CTTGAAGCGG GACACCAGGT GAGGGGCTTC GATCTGCGCT CGGCATTGAT CGATGATGCC
GGCTTTGACG AAGTCGTTGG CGCATTCGAC GACCGCGAGG CGGCGGCGCG CGCCTGTGAG
GGTATCGACG CCGTACTCCA CCTCGGAGCC TTCATGTCCT GGCTGGCGAG CGACCGCGAC
CGGTTGTTCC AAGCCAATGT CGAAGGGACG CGCATCGTGG CCGAGGCGGC CGTCGCGGCA
AAGGTCGGCC GCTTCGTCTT TGCAAGTTCC GGGGAGGTCT ATCCGGAGAA CAGGCCGGAA
TTCCAGCCGA TTACGGAGGA TCATCCGAAG GAGCCACTGT CTCCTTACGG TCTCACCAAG
CTTCTCGGCG AGGAACTGGT GACGTTCCAG GGACGCGTAT CGTCGATGGA GACGGTGATC
CTGCGCTTCT CGCATACGCA GAACGCCGAT GAGCTGCTCG ATCCCGACAG CTTCTTTTCC
GGACCGCGCT TCTTCCTTCG CCCGAAAATC AAGCAGCAGG AGGCTTTCGG CAACAGGACT
GCGGCCGATC TTCTGAGGGC GGCGGATCCG GGACGGCCGG CACTCGTCCT GACACGCAAC
GAGAACGGCC GGCCGTTCAA GATGCACATC ACCGACACCC GCGACATGGC CCGGGGCGTT
CTCCTCGCGC TGTCCCATCC GAAGGCGGCG GGTGGCGTCT TCAATCTGGG AGCGACCGAT
CCTGTTGACT TCGCCGAGGT CCTGCCGGTA ATGGCCGACA GGATCGGGTG GCCGCTCGTG
ACGGTCGATC TTCCCGGCTC CGGCGTGTGG TATCACACTT CCAACCAGCG CATTCGTGAA
GCGCTCGGCT TCGAGCCGAA GTGGCCGATC ATGCGAATGC TGGACGAGGC GGTAGCGGCA
TGGACGACAA GGCAGTCTTA G
 
Protein sequence
MKVLVTGSAG RVGSFVVRQL LEAGHQVRGF DLRSALIDDA GFDEVVGAFD DREAAARACE 
GIDAVLHLGA FMSWLASDRD RLFQANVEGT RIVAEAAVAA KVGRFVFASS GEVYPENRPE
FQPITEDHPK EPLSPYGLTK LLGEELVTFQ GRVSSMETVI LRFSHTQNAD ELLDPDSFFS
GPRFFLRPKI KQQEAFGNRT AADLLRAADP GRPALVLTRN ENGRPFKMHI TDTRDMARGV
LLALSHPKAA GGVFNLGATD PVDFAEVLPV MADRIGWPLV TVDLPGSGVW YHTSNQRIRE
ALGFEPKWPI MRMLDEAVAA WTTRQS