Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0629 |
Symbol | |
ID | 6979345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 651433 |
End bp | 652677 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643395341 |
Product | allantoate amidohydrolase |
Protein accession | YP_002280152 |
Protein GI | 209548235 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.223457 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCATC GTTCGATCGA TCCTGCGCGT CTGCTCGGAC GCATCGCCGA GCTCGGCGCG ATCGGCCGCG ATGCCGAGGG AAGGCTTGTC CGCTTGGCGG CGTCCGATAG CGAAAAGCTC GGGCGCGATC GGTTCGTCTC CTGGCTTGAG GACGCCGGGC TGGAAGTTGC TGTTGACCGG ATCGGCAATA TCTTCGGCAT CTGGAATGGC GTGGGTGGGG GTGAAAAGCC GATCATGATC GGCTCCCACA TCGACACGGT GATCAATGCC GGCATCTATC ATGGATGTTA TGGCGTGCTG GCTGGCCTCG AAGTGATCCA GACGCTGAAA AGCGAAGGGT TCGAACCGCG GCATCCCATT GTCGTTGCCG CCTTCACCAA TGAGGAAGGT GTGCGCTATG CACCGGATAT GATGGGTTCG CTAGTCTATT CCGGCGGCCT CGATGTCGCC GCGGCACTCG AGACGGTGGG CACGGACGGT ACGGTGCTCG GCGATGAACT GGCGCGGATC GGCTATGCCG GCAGCCATGC GCCCGGCTTC ATGACGCCGC ACGCCTATGT CGAACTGCAT ATCGAACAGG GTCCTGTTCT CGAGAGGGAG GGCGTGCCGG TCGGCGCGGT CGAGAACCTG CAGGGTATTT CCTGGCAGAA GGTGACGATC GACGGCGATG CCAACCATGC CGGCACGACG CCGATCTCGA TGCGCAGGGA CGCCGGCTAC GCGGCTGCCC GCGTCATCAC CTTTCTGCGC GACCGCGCGA AAGCGTCGAA CACCCCGACG GTCGCGACCG TCGGCTGCAT TGCTTTTGAG CCGAACGCCA TCAATGTCAT TCCTTCACGC GCGACCTTCA CGGTCGATCT GCGCGATCCG GACGAAGACC GGCTGAAGGA AGAGAAAAAT GCGCTGGCCG CATTCCTCGA ACTGCTTTCG GCCGAGGAAG GGGTTGGCGT ATCCGTCGAA CGGCTGGCGC GCTTCGAGCC GGTCAAGTTC GATCAGGCGA TCGTTCGCCA GATCGAGGTG ACCGCCAGGG ATCGCGGCCT GGCGTGCAAG CGGATGACAT CGGGCGCCGG CCATGACGCC CAGATGATTG CCCGCATCGC GCCTGCTGCG ATGATCTTCG TGCCGAGCCA CGGCGGCATC AGTCACAATC CGAAGGAATT CACATCAGAT ACGGAGCTTG TCGCGGGGGC GAATATCCTC CTCGACGTCG TTTGCGGACT TGCAACAGGG GAACTGCCGA GATGA
|
Protein sequence | MSHRSIDPAR LLGRIAELGA IGRDAEGRLV RLAASDSEKL GRDRFVSWLE DAGLEVAVDR IGNIFGIWNG VGGGEKPIMI GSHIDTVINA GIYHGCYGVL AGLEVIQTLK SEGFEPRHPI VVAAFTNEEG VRYAPDMMGS LVYSGGLDVA AALETVGTDG TVLGDELARI GYAGSHAPGF MTPHAYVELH IEQGPVLERE GVPVGAVENL QGISWQKVTI DGDANHAGTT PISMRRDAGY AAARVITFLR DRAKASNTPT VATVGCIAFE PNAINVIPSR ATFTVDLRDP DEDRLKEEKN ALAAFLELLS AEEGVGVSVE RLARFEPVKF DQAIVRQIEV TARDRGLACK RMTSGAGHDA QMIARIAPAA MIFVPSHGGI SHNPKEFTSD TELVAGANIL LDVVCGLATG ELPR
|
| |