Gene Smed_3598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3598 
Symbol 
ID5318432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp25022 
End bp26170 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content62% 
IMG OID640775412 
Productgalactonate dehydratase 
Protein accessionYP_001312345 
Protein GI150375749 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.729698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA CCAAGCTGAC GACCTATATC GTTCCGCCGC GCTGGCTCTT TCTCAAGATC 
GAAACCGACG AGGGCGTCGT CGGGTGGGGA GAGCCCGTGG TAGAGGGGCG CGCCCTGACG
GTCGAGGCCG CCGTTCACGA GCTGTCTGAC TATCTCGTAG GCAGGGACCC CTTCCTCATC
GAGGATCATT GGAACGTCCT TTATCGCGGC GGATTCTACC GCGGCGGTGC CATACATATG
AGTGCGCTGG CCGGCATTGA CCAGGCGCTA TGGGACATCA AGGGCAAGGC CCTTGGGCAG
CCGGTCCATT CCTTGCTCGG CGGGCAATGC CGCGACAAGA TCAAAGTCTA TTCCTGGATC
GGCGGCGACC GACCGAGCGA CGTCGCGAAC AATGCGCGCG ACGTTGTTGC CCGCGGTTTC
AAGGCGATAA AGCTCAACGG CTGCGAGGAG ATGCAGATCG TCGACACCAA CGAGAAGATC
GATAGGGCGG TCGAGACGAT CGGAACGATC CGCGACGCGA TCGGCCCCAA TATCGGCATC
GGCGTCGACT TCCACGGTCG TGTGCATAGG CCCATGGCGA AGGTTCTCGC CAAGGAACTG
GAACAGTTCA AGCTGATGTT CATCGAGGAG CCGGTCCTGT CGGAGAACCG CGAGGCCCTG
AGGGAGATCG CCAACCATTG CTCGACCCCG ATCGCGCTCG GCGAAAGGCT CTATTCGCGC
TGGGACTTCA AATCTGTTCT CTCCGACGGC TTCGTCGACA TCATTCAGCC GGATCTTTCG
CACGCCGGCG GGATCACCGA ATGCCGCAAG ATCGCCGCGA TGGCCGAGGC CTACGATGTG
GCGCTGGCGC CCCACTGCCC GCTCGGGCCG ATTGCACTTG CCGCCTGCCT GCAGGTCGAC
GCGGTCAGCT ATAACGCCTT CATCCAGGAG CAGAGCCTCG GCATCCACTA CAACGAGGCG
AACGACATCC TCGATTACAT CTCCAACAAG GACGTCTTCG CCTATGAGGA CGGCTTCGTT
TCCATTCCTC AGGGACCCGG TCTCGGCATC GAGGTGGACG AGGCCTATGT GATGGAACGC
GCGAAGGAGG GGCATCGCTG GCGCAACCCG GTCTGGCGCC ATTCGGATGG CAGCGTGGCC
GAATGGTGA
 
Protein sequence
MKITKLTTYI VPPRWLFLKI ETDEGVVGWG EPVVEGRALT VEAAVHELSD YLVGRDPFLI 
EDHWNVLYRG GFYRGGAIHM SALAGIDQAL WDIKGKALGQ PVHSLLGGQC RDKIKVYSWI
GGDRPSDVAN NARDVVARGF KAIKLNGCEE MQIVDTNEKI DRAVETIGTI RDAIGPNIGI
GVDFHGRVHR PMAKVLAKEL EQFKLMFIEE PVLSENREAL REIANHCSTP IALGERLYSR
WDFKSVLSDG FVDIIQPDLS HAGGITECRK IAAMAEAYDV ALAPHCPLGP IALAACLQVD
AVSYNAFIQE QSLGIHYNEA NDILDYISNK DVFAYEDGFV SIPQGPGLGI EVDEAYVMER
AKEGHRWRNP VWRHSDGSVA EW