Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4970 |
Symbol | |
ID | 5318033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1484747 |
End bp | 1485592 |
Gene Length | 846 bp |
Protein Length | 281 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640776752 |
Product | 5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase |
Protein accession | YP_001313684 |
Protein GI | 150377088 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0811524 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTTC TTCGTTATGG CGAACCGGGC CAGGAAAAGC CGGGTCTTCT CGGTTCCGAC GGCATCATCC GCGATCTTTC CGGCCAGGTG CCCGATCTCG CCGCCGGCGC CCTCGATCCG AGCAAGCTCA CCGAACTGGC GAAGCTCGAT GTCGAGGCGC TGCCTGCCGT CAACGGTAAT CCGAGACTCG GACCCTGCGT CGCAGGCACC GGCAAGTTCA TCTGCATCGG TCTCAACTAT TCCGACCACG CCGCCGAAAC CGGCGCGACC GTTCCGCCGG AACCGATTAT TTTCATGAAG GCCACCTCGG CTATCGTCGG GCCGAACGAC GACCTGATCC TGCCGCGCGG TTCGGAAAAG ACCGACTGGG AAGTGGAACT CGGCATCGTC ATCGGCAGGA CGGCGAAATA TGTGAGCGAA GCAGAGGCAC TCGATTATGT CGCCGGCTAT TGCACCGTTC ACGACGTTTC GGAACGCGCA TTCCAGACCG AGCGTCATGG TCAATGGACG AAGGGCAAGT CCTGCGACAC CTTTGGGCCC ACCGGACCGT GGCTCGTAAC CAAGGACGAG GTCGAGGACC CGCAAAACCT CGCAATGTGG CTGAAGGTCA ATGGCGAGAC GATGCAGGAC GGCTCGACGA AGACGATGGT CTACGGCGTC GCCTATCTCG TCTCCTACCT CTCCCAGTTC ATGTCGCTGC ACCCTGGTGA CGTCATTTCC ACCGGCACGC CGCCGGGCGT CGGCATGGGC ATGAAGCCAC CGCGCTATCT GAAGGCAGGC GACGTGGTAG AGCTCGGCAT CGAGGGCCTC GGCAGCCAGA AACAACGCGT ACGCGCGGAC GACTGA
|
Protein sequence | MKFLRYGEPG QEKPGLLGSD GIIRDLSGQV PDLAAGALDP SKLTELAKLD VEALPAVNGN PRLGPCVAGT GKFICIGLNY SDHAAETGAT VPPEPIIFMK ATSAIVGPND DLILPRGSEK TDWEVELGIV IGRTAKYVSE AEALDYVAGY CTVHDVSERA FQTERHGQWT KGKSCDTFGP TGPWLVTKDE VEDPQNLAMW LKVNGETMQD GSTKTMVYGV AYLVSYLSQF MSLHPGDVIS TGTPPGVGMG MKPPRYLKAG DVVELGIEGL GSQKQRVRAD D
|
| |