Gene Smed_2832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2832 
Symbol 
ID5323702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2957165 
End bp2958181 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content64% 
IMG OID640791777 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_001328497 
Protein GI150398030 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.526344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCG CGACGCTGAA AGATTCCACC CGCGACGGCA AGCTCGTCGT GGTCTCCAAA 
GATCTGACCC GCTGCTCCGA GGTGGGCCAC ATCGCCCGCA CGCTGCAGGC GGCGCTCGAC
GATTGGGTGC ATGCGGGTCC GCGGCTGGAG CGCGTCGCCG AGGGGATCGA GACCGGCTCA
CAGCCGACGA TGCGGTTCCA TGAACACCAC GCCGCCTCGC CGCTGCCGCG TGCCTTTCAG
TGGGCGGACG GATCAGCCTA CGTCAATCAC GTGGAACTTG TCAGAAAGGC GCGCAATGCC
GAGATGCCGG CGAGCTTCTG GACCGACCCG CTGATCTATC AGGGCGGCTC GGACAGTTTC
CTCGGACCAC GCGATCCCAT TCTCATTACC GACGAGGCCT GGGGCATCGA CATGGAAGGT
GAGGTCGCAG TCATTGTCGA CGACGTGGCC ATGGGCGCGA CGCTCGATGA GGCGAAGGCC
GCGATCCGGC TGATCATGCT CGTCAACGAC GTCTCCTTGC GGGGGCTCAT TCCCGGCGAG
CTTGCCAAGG GATTCGGCTT CTATCAATCG AAGCCATCCT CGGCCTTCTC GCCGGTCGCG
GTAACGCCGG AGGAGCTTGG TGAGGCCTGG GATGGCGGGA AGTTGCATCT CCCCCTTCAT
GTCGATCTTA ATGGTGAGCC CTTCGGCCGG GCCGATGCCG GCGTCGACAT GACTTTCGAC
TTTCCTTCGC TGATCGTGCA CGCCGCGCGA ACGAGGCCCC TTTCGGCCGG TACCATCATC
GGCTCGGGCA CCGTCTCCAA CAAGCTCGAC GGCGGCCCGG GCAAGCCGGT CTCCAAAGGC
GGGGCAGGCT ATTCGTGCAT TGCCGAGTTG CGCATGATCG AGACGATCGA GGGCGGTGCG
CCGAAAACGC AGTTCCTGAA GGCCGGCGAC GTCGTCCGCA TCGAGATGAA GGATCGCGCC
GGCCATTCGA TCTTCGGCGC CATCGAGCAG AAGGTCGGCA AATACGAACG CGGCTGA
 
Protein sequence
MKLATLKDST RDGKLVVVSK DLTRCSEVGH IARTLQAALD DWVHAGPRLE RVAEGIETGS 
QPTMRFHEHH AASPLPRAFQ WADGSAYVNH VELVRKARNA EMPASFWTDP LIYQGGSDSF
LGPRDPILIT DEAWGIDMEG EVAVIVDDVA MGATLDEAKA AIRLIMLVND VSLRGLIPGE
LAKGFGFYQS KPSSAFSPVA VTPEELGEAW DGGKLHLPLH VDLNGEPFGR ADAGVDMTFD
FPSLIVHAAR TRPLSAGTII GSGTVSNKLD GGPGKPVSKG GAGYSCIAEL RMIETIEGGA
PKTQFLKAGD VVRIEMKDRA GHSIFGAIEQ KVGKYERG