Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2832 |
Symbol | |
ID | 5323702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2957165 |
End bp | 2958181 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640791777 |
Product | fumarylacetoacetate (FAA) hydrolase |
Protein accession | YP_001328497 |
Protein GI | 150398030 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.526344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCG CGACGCTGAA AGATTCCACC CGCGACGGCA AGCTCGTCGT GGTCTCCAAA GATCTGACCC GCTGCTCCGA GGTGGGCCAC ATCGCCCGCA CGCTGCAGGC GGCGCTCGAC GATTGGGTGC ATGCGGGTCC GCGGCTGGAG CGCGTCGCCG AGGGGATCGA GACCGGCTCA CAGCCGACGA TGCGGTTCCA TGAACACCAC GCCGCCTCGC CGCTGCCGCG TGCCTTTCAG TGGGCGGACG GATCAGCCTA CGTCAATCAC GTGGAACTTG TCAGAAAGGC GCGCAATGCC GAGATGCCGG CGAGCTTCTG GACCGACCCG CTGATCTATC AGGGCGGCTC GGACAGTTTC CTCGGACCAC GCGATCCCAT TCTCATTACC GACGAGGCCT GGGGCATCGA CATGGAAGGT GAGGTCGCAG TCATTGTCGA CGACGTGGCC ATGGGCGCGA CGCTCGATGA GGCGAAGGCC GCGATCCGGC TGATCATGCT CGTCAACGAC GTCTCCTTGC GGGGGCTCAT TCCCGGCGAG CTTGCCAAGG GATTCGGCTT CTATCAATCG AAGCCATCCT CGGCCTTCTC GCCGGTCGCG GTAACGCCGG AGGAGCTTGG TGAGGCCTGG GATGGCGGGA AGTTGCATCT CCCCCTTCAT GTCGATCTTA ATGGTGAGCC CTTCGGCCGG GCCGATGCCG GCGTCGACAT GACTTTCGAC TTTCCTTCGC TGATCGTGCA CGCCGCGCGA ACGAGGCCCC TTTCGGCCGG TACCATCATC GGCTCGGGCA CCGTCTCCAA CAAGCTCGAC GGCGGCCCGG GCAAGCCGGT CTCCAAAGGC GGGGCAGGCT ATTCGTGCAT TGCCGAGTTG CGCATGATCG AGACGATCGA GGGCGGTGCG CCGAAAACGC AGTTCCTGAA GGCCGGCGAC GTCGTCCGCA TCGAGATGAA GGATCGCGCC GGCCATTCGA TCTTCGGCGC CATCGAGCAG AAGGTCGGCA AATACGAACG CGGCTGA
|
Protein sequence | MKLATLKDST RDGKLVVVSK DLTRCSEVGH IARTLQAALD DWVHAGPRLE RVAEGIETGS QPTMRFHEHH AASPLPRAFQ WADGSAYVNH VELVRKARNA EMPASFWTDP LIYQGGSDSF LGPRDPILIT DEAWGIDMEG EVAVIVDDVA MGATLDEAKA AIRLIMLVND VSLRGLIPGE LAKGFGFYQS KPSSAFSPVA VTPEELGEAW DGGKLHLPLH VDLNGEPFGR ADAGVDMTFD FPSLIVHAAR TRPLSAGTII GSGTVSNKLD GGPGKPVSKG GAGYSCIAEL RMIETIEGGA PKTQFLKAGD VVRIEMKDRA GHSIFGAIEQ KVGKYERG
|
| |