Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5926 |
Symbol | |
ID | 6977313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 344038 |
End bp | 345300 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643393379 |
Product | imidazolonepropionase |
Protein accession | YP_002278197 |
Protein GI | 209546307 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.103117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.194871 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGGA ACAATTTTTC TGAAGGTACC GCATCCGCCG AGGTCAGGCC GGCACTGTGG CGCAATGCGC GCCTGGCGAC GCTTGCGCCT GACAAAGCCG GGCTCGGCAT CGTCGAAAAA GGCGCCGTTC TGATCGAAAA CGGCCGCATC GCCTTTGCCG GCGCCGAAAG CGAGCTGCCG GCATCGGCCA TCGAACATTC CGAGATCGTC GACCTCGAAG GCCGCTGGGT AACGCCCGGC CTCGTCGACT GCCACACCCA TATCGTTCAC GGCGGCAATC GCGCCCGCGA GTTCGAGATG CGCCTTGAAG GCGCGACCTA TGAAGAGATC GCGCGGGCCG GCGGCGGCAT CGTCTCCTCG GTTCAGGCGA CCAATGCGCT GTCGGTCGAG GAACTCGTCG CTACGGCGCT GCCGCGGCTC GATACGCTGC TTGCTGAAGG CGTGACCACG GTCGAGATCA AATCCGGCTA TGGCCTCAAC CGAAGCGGTG AAGTGAAGAT GCTGCAGAGC GCCCACCTGC TCGGCCATGT CAGGCCGGTC CGCGTCGCCA CCAGCTATCT CGGCGCCCAT GCGACGCCGG TCGAATATAA GGGCCGCAAC GGCGACTATC TCGACGATGT CGTGCTGCCC GGCCTCGACG ACATGTATAA TCTCGGCCTT GCCGACGCTG TCGACGGCTT CTGCGAAGGC ATCGCTTTTT CGACGGCCGA GATCGCCCGC GTCTTCGACA AGGCCAAGGC GCTCGGCCTG CCGGTCAAGC TGCATGCCGA GCAGCTCTCC AATCTCGGCG GCGCCAAGCT GGCGGCCGCC TATGGCGCGC TGTCCGCCGA TCATCTGGAA TATCTCGACG AGGAAGGCGT TGCCGCAATG GCCACCGCCG GCACCGTCGC CGTGCTTCTG CCCGGCGCCT TCTATGCCAT TCATGAAAAG CAGAAGCCGC CGGTGGAGGC GCTGCGCCGG GCCGGCGTGC CGATCGCGAT CGCCACCGAC TGCAATCCGG GCACCTCGCC GCTCACCTCG ATGCTGCTGA CCATGAACAT GTCGGCGACG CTTTTCGGCC TCACCGTCGA GGAGTGCATC GCCGGCGCCA CCCGCGAAGG CGCGCGCGCG CTCGGCCTCC TCGGCGAGAC CGGCACGCTC GAAGCCGGCA AATCCGCCGA TCTCGCCGTC TGGAATATCG AGAGCCTCGC CGAGCTCGTC TACCGCATCG GCTTCAACCC ACTTCACGCA CGCGTCTTCA AGGGCGAAAG GAACGGTCGA TGA
|
Protein sequence | MTGNNFSEGT ASAEVRPALW RNARLATLAP DKAGLGIVEK GAVLIENGRI AFAGAESELP ASAIEHSEIV DLEGRWVTPG LVDCHTHIVH GGNRAREFEM RLEGATYEEI ARAGGGIVSS VQATNALSVE ELVATALPRL DTLLAEGVTT VEIKSGYGLN RSGEVKMLQS AHLLGHVRPV RVATSYLGAH ATPVEYKGRN GDYLDDVVLP GLDDMYNLGL ADAVDGFCEG IAFSTAEIAR VFDKAKALGL PVKLHAEQLS NLGGAKLAAA YGALSADHLE YLDEEGVAAM ATAGTVAVLL PGAFYAIHEK QKPPVEALRR AGVPIAIATD CNPGTSPLTS MLLTMNMSAT LFGLTVEECI AGATREGARA LGLLGETGTL EAGKSADLAV WNIESLAELV YRIGFNPLHA RVFKGERNGR
|
| |