Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4106 |
Symbol | |
ID | 5318995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 568942 |
End bp | 572580 |
Gene Length | 3639 bp |
Protein Length | 1212 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640775913 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_001312846 |
Protein GI | 150376250 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.264245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTGA CGGGAACTTG GGATTTCTGG GTGGATCGCG GCGGCACCTT CACCGACGTC ATCGGCCGCG ATCCGTCGGG CGCCCTGCGC GCATTGAAGG TTCTTTCGGA AAACCCCGGT GCCTATCGCG ATGCGGCGGT CCACGGCATC CGGCTGCATC TCGGCCTTGG CTCCGGCGAG CCGGTGCCGC ACGGCCTCAT CGGCGAGGTC AGGATGGGCA CCACCGTGGC CACCAATGCG CTGCTGGAGC GCAAGGGCGA GCGGCTTGGG CTCGTCACGA CGCGCGGGTT CCGCGATGCG CTCCGGATCG GCTATCAGGA ACGCAAGAAG ATTTTCGCGA CCGAGATCAT CAAGCCGGAG GCACTCTATT CTGGTGTCGT GGAGCTTGAC GAACGAGTGC TTGCGGATGG CACGGTTGAG CTGCCGCTGG ACGAGGCGGC GGCGCGGCGG GCGCTCGAAA GCTTGAAGGC GGATGGTTAC GGTGCCGTCG CCATCGTCCT GCTGCACGCT TACAAATATC CCGCTCACGA GATGACGGTC GCCCACCTCG CGCGCTCGAT CGGGTTCGAG CAGGTATCTG TCAGTCACGA GGTCTCGCCG CTCGTCAAAT ATGTCGGCCG CGGCGATACG ACGGTGATCG ATGCCTATCT TTCGCCCGTG CTCGGCCGGT ACGTGGCACA GGTTTCGCAA GAACTCGACG TGGGCCGCTC CGGCGCCCGC GTCATGTTCA TGATGTCGTC CGGCGGACTG ACGGCGGCGG AGATGTTCCA GGGCAAGGAC GCAATCCTTT CCGGGCCGGC GGGCGGCGTC GTCGGCCTTG CGCGCACGGG CGAGGCGGCC GGCTTCGACC GCGTCATCGG CTTCGACATG GGCGGAACGT CGACGGACGT CGCCCATTTT GACGGGGAGT ACGAGCGCGC CTTCGAGACC GAGGTTGCCG GTGTGCGGGT GCGCGCGCCG ATGATGCTGA TCCACACGGT TGCTGCCGGC GGAGGCTCGA TCCTCCATTT CGACGGGGGT CGCTTCCGCG TGGGCCCGGA TTCGGCCGGC GCCAATCCAG GCCCCGCCTG CTACCGCAAC GGCGGACCGC TCGCCGTCAC CGACGCAAAT GTCATGCTGG GCAAGTTGCT GCCCGAGCAT TTTCCGGCGA TCTTCGGACC GGAGCAGAAC CAGCCCCTCG ACGTCGAGAC GGTGCGCGAG CGTTTCGCGG CCCTTGCGGC CGAGATCGGC GACGGGCGCA GCCCCGAAGA TGTCGCTGAC GGTTTCATCC GCATTGCGGT TGCGAACATG GTCGAGGCGA TCAAGAAGAT TTCGGTGAGC CGCGGCTATG ACGTGACGCG TTATGCGCTC AACTGCTTTG GCGGTGCCGG CGGGCAGCAC GCCTGCCTCG TGGCCGATGC GCTCGGCATG AAGAGCATAC TCCTGCATCC GATGTCTGGT CTTCTTTCGG CTTACGGCAT GGGCCTCGCC GATATCAGAG CCACGCGGCA GAAAGCACTG GGCGTCGCGC TGGACGAGGC GGCACCGCAC GCGATTCTGG AGCTCGGTCA CAAACTGCAA TCGGAATGCC TGGTTGAACT GGAGGCGCAG GGGATCGCCC TGGAGCGGAT CCGCACGCAT TTGCGCGCCC ATATTCGCTA TGCCGGCACC GATACGGTTC TGCCGGTCGA AGCGGCCTTT CCCGATCAGG ACGACCAGGC GCGGCTCCGC CGTGACTTCG AGCATTTACA CAAGCGCCGC TTCGGTTTCA TCGCCGAGAA CAAGGCTCTG ATGATCGATG CCGTCGAAGT CGAAACGGTG GGTGGCGGGG CTGCAGAAAT GGAGGCGGAG GGCCTTGCCG CCACGGCCGG TGAGCTTGTT CCCGACCAGC GAATCCGTTT CTATTCGCAA GGCATGTTTC ACGACGCGCC GGTGGCGCTG CGTTCGCAAG TCCAGCCCGG CCAGAAACTC GCCGGTCCTG CGATCATTAT CGAGCCCAAC CAGACGATCG TGGTCGAAGA CGGCTGGCAG GCGGAGTTGA CGACGAAGGA CCACATCGTG CTGAGACGCG TCAAGGCGCT GCCCGAGCGT ACCGCGATAG GCACGAAGGC AGATCCGGTC ATGCTGGAAA TCTTCAACAA CCTCTTCATG TCCATCGCCG AGCAGATGGG CGTGACGCTG CAGAACACGG CTTATTCCGT CAACATCAAG GAGCGGCTCG ATTTCTCCTG TGCTGTCTTC GACAGCGGGG GCAGTCTGGT CGCCAATGCG CCGCACATGC CGGTACATCT GGGCTCCATG GATGCCTCGG TTGCGACTGC CATCCGCGAA AACCCGGTGA TCCATCCGGG CGACGTGTTC CTGATCAATG CACCCTACAA TGGCGGCACG CATCTTCCTG ACCTGACGGT TTGCACGCCG GTCTTCGACG ATGCGGGCCG CGAGATCCGC TTCTGGGTCG CAAGCCGGGG CCATCATGCG GATATCGGAG GTATCTCGCC GGGCTCCATG TCGCCGCTCG CCACCGATAT CGAAGAGGAG GGCGTCTATA TCGACAATTT CAGGCTCATC GACCGCGGCC GATTCTGCGA GGAGGGACTC GAGAAGCTCC TGACCGGCGC GCGTTACCCG GTGCGCAATC TCCTTCAGAA CGTCAACGAT CTCAAAGCCC AGGTCGCGGC CAACGAGAAG GGTGTCGCGG AACTCAAAAA GATGATCGCG CAGTTCGGCG AGGACGTGGT CGAGGCCTAT ATGGGCCACG TTCAGGACAA TGCGGCCGAA AGCGTGCGCC GCGTGATCGA TCAGTTGCCG GATGGCGAAT TCTGCTACGA GATGGACCAG GGTTGCCGCA TCGTCGTGAA AATCTCCATC GATCGGGGGA GCCGCGAAGC AACGGTCGAT TTCACCGGAA CGTCGGAGCA GCGCGCCGAC AATTTCAATG CGCCGGAGCC GGTGACGCGC GCCGCCGTGC TCTATGTCTT TCGGGTGCTG GTAGAGGCCG ACATCCCGAT GAATGCCGGC TGCCTGAGGC CCATCCGCAT CGTCATCCCG GACGGCACCA TGCTTTCGCC ACGCTACCCG GCGGCGGTCG TCGCCGGCAA TGTCGAGGTC AGCCAGGCGG TCACCAATTG TCTCTTCGGT GCGGTCGGGG CGCAGGCAGC CGCGCAGGGA ACGATGAACA ATCTGACGTT CGGCAATGCC GAGTACCAAT ATTACGAGAC GATATGCTCC GGCGCGCCGG CCGGTCCCGG CTATGACGGC GCCGATGCGG TCCACACCCA TATGACCAAT TCACGTCTGA CCGACCCGGA GATTCTCGAG ACGCGCTTCC CGGTAGTTCT TGAAGATTTT CACATCCGCA AGGATTCCGG CGGGTGCGGC AAGTGGTCGG CCGGCGATGG CACCGAGCGA ACGATCCGGG CGCGCCAGCG GCTCGACTTC GCGATCCTTT CCGGCCATCG GCGTGTCCGG CCCTTCGGGC TGAAGGGCGG AGAGCCGGGC GAACCGGGCC GCAACTATGT TCGCCGCAAT GACGGTCGCA TCGAAGAACT GCCTGGCTCG GCTCATACGG TGCTCGAAGC GGGCGAGGCC TTTACAGTGG TGACGCCGAC GGGCGGCGGC TATGGGAAGG ATGAGGGGCC GGAGGAAGGG CAGTTCTAG
|
Protein sequence | MSLTGTWDFW VDRGGTFTDV IGRDPSGALR ALKVLSENPG AYRDAAVHGI RLHLGLGSGE PVPHGLIGEV RMGTTVATNA LLERKGERLG LVTTRGFRDA LRIGYQERKK IFATEIIKPE ALYSGVVELD ERVLADGTVE LPLDEAAARR ALESLKADGY GAVAIVLLHA YKYPAHEMTV AHLARSIGFE QVSVSHEVSP LVKYVGRGDT TVIDAYLSPV LGRYVAQVSQ ELDVGRSGAR VMFMMSSGGL TAAEMFQGKD AILSGPAGGV VGLARTGEAA GFDRVIGFDM GGTSTDVAHF DGEYERAFET EVAGVRVRAP MMLIHTVAAG GGSILHFDGG RFRVGPDSAG ANPGPACYRN GGPLAVTDAN VMLGKLLPEH FPAIFGPEQN QPLDVETVRE RFAALAAEIG DGRSPEDVAD GFIRIAVANM VEAIKKISVS RGYDVTRYAL NCFGGAGGQH ACLVADALGM KSILLHPMSG LLSAYGMGLA DIRATRQKAL GVALDEAAPH AILELGHKLQ SECLVELEAQ GIALERIRTH LRAHIRYAGT DTVLPVEAAF PDQDDQARLR RDFEHLHKRR FGFIAENKAL MIDAVEVETV GGGAAEMEAE GLAATAGELV PDQRIRFYSQ GMFHDAPVAL RSQVQPGQKL AGPAIIIEPN QTIVVEDGWQ AELTTKDHIV LRRVKALPER TAIGTKADPV MLEIFNNLFM SIAEQMGVTL QNTAYSVNIK ERLDFSCAVF DSGGSLVANA PHMPVHLGSM DASVATAIRE NPVIHPGDVF LINAPYNGGT HLPDLTVCTP VFDDAGREIR FWVASRGHHA DIGGISPGSM SPLATDIEEE GVYIDNFRLI DRGRFCEEGL EKLLTGARYP VRNLLQNVND LKAQVAANEK GVAELKKMIA QFGEDVVEAY MGHVQDNAAE SVRRVIDQLP DGEFCYEMDQ GCRIVVKISI DRGSREATVD FTGTSEQRAD NFNAPEPVTR AAVLYVFRVL VEADIPMNAG CLRPIRIVIP DGTMLSPRYP AAVVAGNVEV SQAVTNCLFG AVGAQAAAQG TMNNLTFGNA EYQYYETICS GAPAGPGYDG ADAVHTHMTN SRLTDPEILE TRFPVVLEDF HIRKDSGGCG KWSAGDGTER TIRARQRLDF AILSGHRRVR PFGLKGGEPG EPGRNYVRRN DGRIEELPGS AHTVLEAGEA FTVVTPTGGG YGKDEGPEEG QF
|
| |