Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4363 |
Symbol | |
ID | 6983137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4528653 |
End bp | 4529933 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643399091 |
Product | allantoate amidohydrolase |
Protein accession | YP_002283847 |
Protein GI | 209551930 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0103838 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCA ATCTGCCCGT CAATGCCGGC CGGATCGCCG GAGATATCGA GGCGCTGGCC GCCATCACCG AGCCGGGGCA CCCTTGGACA CGGCGGGCCT TCTCGCCGCT CTTCCTCGAA GGCCGGGCCT ATATCGAAGC GCGAATGAAG GCGGCGGAGT TGGAAACGCG GATCGATGCC ACCGGCAATC TGATCGGCCG GCGAACGGGC CGCAAACCCT GGCTCGGCAC CATCATGGTC GGTTCGCATT CCGAAACGGT GCCGGACGGC GGCCGTTTCG ACGGCATTGC CGGGGTGATC TCCGCCCTGG AGGTGGCGCG TGCACTGAGC GACCAGGCGA TCGAACTCGA CCACGATCTC GAAATCGTCG ACTTCCTTGC CGAGGAGGTC AGCATCTTCG GCGTCTCCTG CATCGGCAGC CGTGGGATGA CCGGGCAATT GCCGGAAGCC TGGCTTTCCC GGATCAGCGA CGGGCGCGAC CTTGCCGAGG GCATCGCTGA GGTAGGCGGC GAGCCCGGCG TGCTGGCGCA GCAGAAGCGG CCGGATCTCG CCGGATTTTT GGAGCTTCAT ATCGAGCAGG GACCGGTGCT CGAAGCGGAA AGAGAGGATA TCGGCATCGT CACCGCGATT GCAGGCATTA CCCGCATCGA GATCACCGTC GAGGGGCGGG CCGATCATGC CGGCACGACA CCAATGGATA GGCGGGCGGA TGCGCTGGTG GCGGCCGCCC AGCTGGTGCT CGACATCCGC AACGCCGCCG CTGAGCTTGC CAAAACACCG GGCCACTTCG CGGCGACGGT CGGCGAATTC AGGATCGAGC CGAATGCCGC CAATGTCGTG CCTTCGAAAG TGGTGCTGCT GATCGACGGC CGCGCCGAGA TCCGTGCCGA CATGGAAGCC TTCTGCCGCT GGCTCGACGG CCATGTCGAA AAGCTGGCCA CCGCCTATGG CGTCACGATC AGAACGCCGA ACCGGGTGTC CGACAATATG CCGACACCCG GCGATGCCGG ACTGCTTTCG ACCTTGGAGG CTGCCTGCGA ACGCGTCGGC GCCAAACACC GGCGCATGGC CTCCGGCGCG GGACACGATA CGGCCTGGAT CGCCAAGGTG GCGCCGGCGG CGATGATCTT CGTGCCCTGC CGGGAGGGCC GCAGCCATTC CGGCGACGAA TGGGCGGAGA ATGACGATAT CGCGCTCGGC GCCGCCGTGC TGTTCGAGGC GGTGCGCGAG ATGGACAAGG ATTTGACGCG GGAGAAGGCC GATGGGACGC ATACTGGTTG A
|
Protein sequence | MSRNLPVNAG RIAGDIEALA AITEPGHPWT RRAFSPLFLE GRAYIEARMK AAELETRIDA TGNLIGRRTG RKPWLGTIMV GSHSETVPDG GRFDGIAGVI SALEVARALS DQAIELDHDL EIVDFLAEEV SIFGVSCIGS RGMTGQLPEA WLSRISDGRD LAEGIAEVGG EPGVLAQQKR PDLAGFLELH IEQGPVLEAE REDIGIVTAI AGITRIEITV EGRADHAGTT PMDRRADALV AAAQLVLDIR NAAAELAKTP GHFAATVGEF RIEPNAANVV PSKVVLLIDG RAEIRADMEA FCRWLDGHVE KLATAYGVTI RTPNRVSDNM PTPGDAGLLS TLEAACERVG AKHRRMASGA GHDTAWIAKV APAAMIFVPC REGRSHSGDE WAENDDIALG AAVLFEAVRE MDKDLTREKA DGTHTG
|
| |