Gene Rleg_4634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4634 
Symbol 
ID8015378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4757049 
End bp4758329 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content65% 
IMG OID644827209 
Productallantoate amidohydrolase 
Protein accessionYP_002978409 
Protein GI241207313 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.60427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.118111 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCA ATCTTCCCGT CAATGCCAGC CGGATCGCTG AAGACATCGA TGCGCTGGCC 
GGGATTACCG AGCCGGGGCA TCCCTGGACG CGGCGGGCGT TCTCGCCACT CTTTCTCGAA
GGCCGGGCCT ATATCGACGC GCGGATGAAG GCGGCGGGGC TGGAAACGCG GGTCGATGCC
GCCGGCAATC TGATCGGCCG GCGGACTGGC CGGAAACCGT GGCTCGGCAC GATCATGGTC
GGCTCGCACT CCGACACGGT GCCGGACGGC GGCCGCTTCG ATGGCATTGC CGGCGTGATC
TCGGCGCTGG AGGTGGCGCG CGCGCTTGTT GACCAGAATA TCGAGCTCGA TCACGATCTC
GAAATCGTCG ATTTTCTTGC CGAGGAGGTC AGCATCTTCG GCGTGTCCTG CATCGGCAGC
CGCGGCATGA CCGGCCAACT GCCGGAGGTC TGGCTTTCGC GCGTCAGCGA CGGAGGCGAC
CTGGCAGAGG GCATCGCGCA GGTCGGTGGC CGACCCTATG TGCTGATGCA GCAGAACAGG
CCCGATATAG CCGGCTTTCT GGAGCTTCAT ATCGAACAGG GCCCGGTGCT CGAAGCCGAA
AAGGAGGATA TCGGCATCGT CACCGCGATA TCAGGCATCA CCCGGATCGA GATCACCGTC
GAAGGGCGGG CCGACCATGC CGGCACGACG CCGATGGACC GGCGGGCGGA TGCGTTGGTG
GCGGCATCAC AGCTGGTGCT CGACATCCGC AACGCCGCCG CAGAACTTGC CAAAACGCCG
GGGCATTTCG CAGCGACGGT CGGCGAATTC CGGATCGAGC CGAATGCCGC CAATGTCGTG
CCGTCGAAGG TGGTGCTGTT GATCGATGGC CGTGCCGAAA TCCGTGCCGA CATGGAGGCA
TTCTGCCGCT GGCTCGACGG TCATGTCGAG AAGCTGGCGG CGGCCTATGG CGTGACGATC
AAGACCCCGA ACCGGGTTTC CGACAATCAG CCGACGCCTG GTGATGCCGG GCTGCTGTCG
ACCTTGGAGG CTGCCTGCGA ACGGGTCGGC GCAAAACATC GGCGCATGGC CTCCGGCGCT
GGGCACGATA CGGCCTGGAT CGCCAAGGTG GCGCCGGCAG CGATGATCTT CGTGCCCTGC
CGGGGAGGCC GCAGCCATTC GGCCGATGAA TGGGCTGAGA ATGACGATAT CGCGCTCGGC
GCCGCCGTGC TGTTCGAGGC GGTGCGCGAG ATGGACACGA GCTTGAATCA GGAGAGGACC
GATGGGACGC ATACTCGTTG A
 
Protein sequence
MSRNLPVNAS RIAEDIDALA GITEPGHPWT RRAFSPLFLE GRAYIDARMK AAGLETRVDA 
AGNLIGRRTG RKPWLGTIMV GSHSDTVPDG GRFDGIAGVI SALEVARALV DQNIELDHDL
EIVDFLAEEV SIFGVSCIGS RGMTGQLPEV WLSRVSDGGD LAEGIAQVGG RPYVLMQQNR
PDIAGFLELH IEQGPVLEAE KEDIGIVTAI SGITRIEITV EGRADHAGTT PMDRRADALV
AASQLVLDIR NAAAELAKTP GHFAATVGEF RIEPNAANVV PSKVVLLIDG RAEIRADMEA
FCRWLDGHVE KLAAAYGVTI KTPNRVSDNQ PTPGDAGLLS TLEAACERVG AKHRRMASGA
GHDTAWIAKV APAAMIFVPC RGGRSHSADE WAENDDIALG AAVLFEAVRE MDTSLNQERT
DGTHTR