Gene Smed_5081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5081 
Symbol 
ID5319383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp28956 
End bp29861 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content63% 
IMG OID640776861 
Product5-dehydro-4-deoxyglucarate dehydratase 
Protein accessionYP_001313793 
Protein GI150377198 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR03249] 5-dehydro-4-deoxyglucarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.302501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGCCTG AAGAAATCAA GTCGCGTGTC GGTTCGGGGC TCTTGTCCTT TCCGGTCACG 
CACTTCACGT CGGATTACAA GCTCAATCTC GAAAGCTACC GGCGTCATGT GGAGTGGCTT
TCGGGCTTCG GGGCCGCGGC CCTGTTTGCC GCCGGCGGCA CCGGCGAGTT CTTTTCGCTC
TCGCCGGATG AAGTGGGTGA GGTCACCCGT GCGGCGAAGG ACGTATCGGG CGAGGTGCCG
ATCATTGCGG GTTGCGGCTA TGGCACGTCC CTTGCGGTCG AGACGGCGAA AATAGTCGAG
GCGGCGGGCG CCGACGGCAT TCTCCTGCTG CCGCACTATC TCACCGAAGC GCCGCAGGAA
GGCATCTACG CTCATGTGAA GGCCGTATGC GATTCAACAG GTCTCGGGGT CATTCTCTAC
AACCGCGCCA ATTCCATCGC GAATGCCGAC ACGGTTGCGC GCCTGGCTGA GGCCTGCCCC
AACCTGATCG GCTTCAAGGA CGGTACCGGC AAAGTCGACC TCGTGCGCCA CGTGACGGCC
AAGCTCGGCG ACCGGCTCTG CTACATAGGC GGAATGCCGA CCCACGAGCT CTTCGCAGAA
GGCTTCAACG GCGTCGGCGT TACCACCTAT TCGTCGGCGG TGTTCAATTT CGTGCCGGAG
CTGGCACAGC GCTTCTATCG GGCAATGCGG GCCGGCGACA AGGCGGTGAT GGAAGGGATC
CTTCAGACGT TCTTTTTCCC GTTTGCAGCC CTGCGCGACC GCAAGGCCGG TTATCCGGTC
TCCATCATCA AGGCGGGCGT GGAGCTTGCC GGCTTTGCGC CCGGCCCGGT GCGCCCGCCC
CTGGTCGATC TGACCGGCGA AGAGCGGGAG ATATTGCAGG GGCTGATAGA AGCGTCGCGC
AACTGA
 
Protein sequence
MSPEEIKSRV GSGLLSFPVT HFTSDYKLNL ESYRRHVEWL SGFGAAALFA AGGTGEFFSL 
SPDEVGEVTR AAKDVSGEVP IIAGCGYGTS LAVETAKIVE AAGADGILLL PHYLTEAPQE
GIYAHVKAVC DSTGLGVILY NRANSIANAD TVARLAEACP NLIGFKDGTG KVDLVRHVTA
KLGDRLCYIG GMPTHELFAE GFNGVGVTTY SSAVFNFVPE LAQRFYRAMR AGDKAVMEGI
LQTFFFPFAA LRDRKAGYPV SIIKAGVELA GFAPGPVRPP LVDLTGEERE ILQGLIEASR
N