Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4634 |
Symbol | |
ID | 8015378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4757049 |
End bp | 4758329 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644827209 |
Product | allantoate amidohydrolase |
Protein accession | YP_002978409 |
Protein GI | 241207313 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.60427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.118111 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCA ATCTTCCCGT CAATGCCAGC CGGATCGCTG AAGACATCGA TGCGCTGGCC GGGATTACCG AGCCGGGGCA TCCCTGGACG CGGCGGGCGT TCTCGCCACT CTTTCTCGAA GGCCGGGCCT ATATCGACGC GCGGATGAAG GCGGCGGGGC TGGAAACGCG GGTCGATGCC GCCGGCAATC TGATCGGCCG GCGGACTGGC CGGAAACCGT GGCTCGGCAC GATCATGGTC GGCTCGCACT CCGACACGGT GCCGGACGGC GGCCGCTTCG ATGGCATTGC CGGCGTGATC TCGGCGCTGG AGGTGGCGCG CGCGCTTGTT GACCAGAATA TCGAGCTCGA TCACGATCTC GAAATCGTCG ATTTTCTTGC CGAGGAGGTC AGCATCTTCG GCGTGTCCTG CATCGGCAGC CGCGGCATGA CCGGCCAACT GCCGGAGGTC TGGCTTTCGC GCGTCAGCGA CGGAGGCGAC CTGGCAGAGG GCATCGCGCA GGTCGGTGGC CGACCCTATG TGCTGATGCA GCAGAACAGG CCCGATATAG CCGGCTTTCT GGAGCTTCAT ATCGAACAGG GCCCGGTGCT CGAAGCCGAA AAGGAGGATA TCGGCATCGT CACCGCGATA TCAGGCATCA CCCGGATCGA GATCACCGTC GAAGGGCGGG CCGACCATGC CGGCACGACG CCGATGGACC GGCGGGCGGA TGCGTTGGTG GCGGCATCAC AGCTGGTGCT CGACATCCGC AACGCCGCCG CAGAACTTGC CAAAACGCCG GGGCATTTCG CAGCGACGGT CGGCGAATTC CGGATCGAGC CGAATGCCGC CAATGTCGTG CCGTCGAAGG TGGTGCTGTT GATCGATGGC CGTGCCGAAA TCCGTGCCGA CATGGAGGCA TTCTGCCGCT GGCTCGACGG TCATGTCGAG AAGCTGGCGG CGGCCTATGG CGTGACGATC AAGACCCCGA ACCGGGTTTC CGACAATCAG CCGACGCCTG GTGATGCCGG GCTGCTGTCG ACCTTGGAGG CTGCCTGCGA ACGGGTCGGC GCAAAACATC GGCGCATGGC CTCCGGCGCT GGGCACGATA CGGCCTGGAT CGCCAAGGTG GCGCCGGCAG CGATGATCTT CGTGCCCTGC CGGGGAGGCC GCAGCCATTC GGCCGATGAA TGGGCTGAGA ATGACGATAT CGCGCTCGGC GCCGCCGTGC TGTTCGAGGC GGTGCGCGAG ATGGACACGA GCTTGAATCA GGAGAGGACC GATGGGACGC ATACTCGTTG A
|
Protein sequence | MSRNLPVNAS RIAEDIDALA GITEPGHPWT RRAFSPLFLE GRAYIDARMK AAGLETRVDA AGNLIGRRTG RKPWLGTIMV GSHSDTVPDG GRFDGIAGVI SALEVARALV DQNIELDHDL EIVDFLAEEV SIFGVSCIGS RGMTGQLPEV WLSRVSDGGD LAEGIAQVGG RPYVLMQQNR PDIAGFLELH IEQGPVLEAE KEDIGIVTAI SGITRIEITV EGRADHAGTT PMDRRADALV AASQLVLDIR NAAAELAKTP GHFAATVGEF RIEPNAANVV PSKVVLLIDG RAEIRADMEA FCRWLDGHVE KLAAAYGVTI KTPNRVSDNQ PTPGDAGLLS TLEAACERVG AKHRRMASGA GHDTAWIAKV APAAMIFVPC RGGRSHSADE WAENDDIALG AAVLFEAVRE MDTSLNQERT DGTHTR
|
| |