Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5232 |
Symbol | |
ID | 5319534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 191348 |
End bp | 192652 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640777009 |
Product | amidohydrolase |
Protein accession | YP_001313941 |
Protein GI | 150377346 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGCCG CGGCTCTCGT GCTCGGACTT TGCGCCACGG CCAGCGCCGA TGACGTGCTG TTCGAGAATG TCCGCATCTT TGACGGAAAA GGTGCTGTTC TTTCGGCGCC GTCGAATGTT CTGGTCAAGG GCAATGTAAT CGCGGCGATT GCGACAAACC CGATTCAAGC CGAAGGCGCT GAGCGAATTG CCGGCGACGG GCGAACGCTG ATGCCGGGCC TGATCGATGC GCACTGGCAT GCCATGTTGG CCGCGTCCAG TCCGGCGGAG GCAATGGGCG ATCTCAGCTT CGCCAGTATC CTGGCCGGCG AGGAGGCGAC GGATACCCTG ATGCGCGGCT TCACCACTGT GCGTGATGTG GGCGGACCAG CCTTCGGCCT CAAGCGTGCG ATCGATCGGG GCATCATCCC CGGACCGCGT ATCTATACCT CCGGTGCCAT GATCACAGTT ACGAGCGGAC ATGGCGATTT CCGCCAGTTC ACCGACCTGC CGCGGAGAAC CGGCGGTCCG CTCACCCCTA TGGAGACGGT CGGGGGTGCA ATGGTCGTCG ACAGCCCGGA CGAGGTGCGA ATGCGCGTCC GCGAACAGTT CATGCAGGGA GCCGTTCTGA TCAAGCTAAC CGCCGGCGGC GGGGTGTCTT CACCCTTCAG CCCGCTCGAC GTCACCACCT TCACCGAGCC AGAATTGCGG GCCGCCGTTG AGATCGCCGA GAATTGGGGC ACCTACGTTG CCGCGCATGC CTTCACCTCT GACGCGATTC GGAAAGCGAT CGCGGCTGGT GTGAAGTGCA TCGAGCACGG CTTCCTCATG GATGAAGCCA CCGCCAGGCT GATTGCTGAA AAGGACGTTT GGCTGAGCTT GCAGCCGCTT CCCGAATTGA TGAGGACCGG CCTCCGGGAG GGTTCGGTCG AGCGCGCCAA GGCAGATGAG GTTTGGCCAG GCATCGGCAG AACATACGAA CTCGCAAAGA AGTACAAGAT TAAGACCGCG TGGGGCACGG ATGTTCTGTT CTCCCGCGCG CTGGCGAAGC AGCAGGGAGC AATACTGGCC TCGCTCGTGC GTTGGTACAC GCCTGCCGAG GCGCTCGTTA TGGCGACCGG GACCAATGCC GAACTGCTGG CGCTGTCGGG CCAGCGCAAC CCCTACCCGG GAAAGCTGGG CGTTGTCGAG GAAGGCGCTC TCGCGGACCT CCTGCTCGTC GAAGGCAATC CGCTGGAGAA CATCGACCTG GTGGCTGATC CTGCCAAAAG CTTCAAGATC ATCATGAAAG ACGGCATCAT CTACAAAAAC GCGCTGACTG AATGA
|
Protein sequence | MLAAALVLGL CATASADDVL FENVRIFDGK GAVLSAPSNV LVKGNVIAAI ATNPIQAEGA ERIAGDGRTL MPGLIDAHWH AMLAASSPAE AMGDLSFASI LAGEEATDTL MRGFTTVRDV GGPAFGLKRA IDRGIIPGPR IYTSGAMITV TSGHGDFRQF TDLPRRTGGP LTPMETVGGA MVVDSPDEVR MRVREQFMQG AVLIKLTAGG GVSSPFSPLD VTTFTEPELR AAVEIAENWG TYVAAHAFTS DAIRKAIAAG VKCIEHGFLM DEATARLIAE KDVWLSLQPL PELMRTGLRE GSVERAKADE VWPGIGRTYE LAKKYKIKTA WGTDVLFSRA LAKQQGAILA SLVRWYTPAE ALVMATGTNA ELLALSGQRN PYPGKLGVVE EGALADLLLV EGNPLENIDL VADPAKSFKI IMKDGIIYKN ALTE
|
| |