Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4659 |
Symbol | |
ID | 5319334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1169940 |
End bp | 1171187 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640776457 |
Product | imidazolonepropionase |
Protein accession | YP_001313389 |
Protein GI | 150376793 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.563946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0326494 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAGAA ACGACAACCC AGGCCCTGTC AGAACCAGCC TCTGGCGTAA CGCACGCCTT GCGACGCTTC GCGAAGATCT GGGCACCCTC GGAGTCGTCG AGAGCGGGGT CGTCGCCGCT CGCGGCGAGC GGATCGTTTA TGCCGGCCCC GAGGCCGGTC TCACCTCGGA GCTTGCGCGC GCAGACCAGA TCTTCGATTG CGAAGGCCGC TGGGTAACCC CCGCGTTGAT CGACTGCCAC ACACATATCG TTCATGGCGG CAACCGGGCC CGGGAATTCC AGCTCCGCCT CGAAGGCGCG ACCTATGAGG CGATCGCCCG CGCCGGAGGC GGCATCGCTT CGACCGTCGA GGCGACCAAC GCCCTTTCCG TCGACGAACT TGTGGCGGCC GCACTGCCAC GGCTCGACGC CCTGCTCGCG GAGGGCGTTT CGACCGTCGA GGTAAAGTCG GGCTATGGTC TCAACGTCGA GACCGAGCTC AAGATGCTGC GCGCCGCCCG CCGGTTGGAG ACCCTGCGCC CGGTGCGCAT CGTCACCAGC TATCTCGCAG CCCATGCGAC CCCGCCGGGA TATCAGGGCC GAAACGGCGA CTACATCGCC GAGGTCGTCC TGCCGGGCCT CGCTGCAGCG CATACGGAAG GGCTTGTGGA TGCCGTCGAC GGATTCTGCG AAGGCATAGC CTTTTCGCCG GCGGAGATCG CCTTGGTCTT CGACAAGGCG AAGTCGCTCG GCCTTCCCGT GAAGCTTCAC GCCGAACAGC TTTCCGATCT CGGCGGCGCA AAGCTCGCTG CTTCCTACAG CGCTCTTTCC GCCGACCACC TCGAATATCT CGACGCTGCG GGCGCCGCCG CCATGGCAAA GGCCGGCACG GTCGCCGTCC TGCTGCCCGG CGCTTTCTAC ACCCTCCGGG AAAAGCAGCT TCCACCCGTC GAAGCACTCC GCGCGGCGGG GACGCGCATG GCTATCGCCA CCGATTGCAA TCCCGGAACC TCGCCGCTCA CCTCGCTGCT GCTCACAATG AACATGTCCG CGACGCTCTT CCGCCTGACG TTGGAGGAAT GTCTCGCCGG AGTTACTCGC GAGGCCGCCC GTGCACTCGG GGTGCTCGAC GAGACCGGTA CGATCGAAGC CGGCAAGTCC GCGGACCTTG CGATCTGGAA CATCGATCAA CCGGCGGAGC TGATCTACCG CGTCGGCTTC AATCCCCTGT GCGAGCGTGT TTTCAAGGGC GAAAGGGTTT CCCGATGA
|
Protein sequence | MDRNDNPGPV RTSLWRNARL ATLREDLGTL GVVESGVVAA RGERIVYAGP EAGLTSELAR ADQIFDCEGR WVTPALIDCH THIVHGGNRA REFQLRLEGA TYEAIARAGG GIASTVEATN ALSVDELVAA ALPRLDALLA EGVSTVEVKS GYGLNVETEL KMLRAARRLE TLRPVRIVTS YLAAHATPPG YQGRNGDYIA EVVLPGLAAA HTEGLVDAVD GFCEGIAFSP AEIALVFDKA KSLGLPVKLH AEQLSDLGGA KLAASYSALS ADHLEYLDAA GAAAMAKAGT VAVLLPGAFY TLREKQLPPV EALRAAGTRM AIATDCNPGT SPLTSLLLTM NMSATLFRLT LEECLAGVTR EAARALGVLD ETGTIEAGKS ADLAIWNIDQ PAELIYRVGF NPLCERVFKG ERVSR
|
| |