Gene Rleg2_4363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4363 
Symbol 
ID6983137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4528653 
End bp4529933 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID643399091 
Productallantoate amidohydrolase 
Protein accessionYP_002283847 
Protein GI209551930 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0103838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCA ATCTGCCCGT CAATGCCGGC CGGATCGCCG GAGATATCGA GGCGCTGGCC 
GCCATCACCG AGCCGGGGCA CCCTTGGACA CGGCGGGCCT TCTCGCCGCT CTTCCTCGAA
GGCCGGGCCT ATATCGAAGC GCGAATGAAG GCGGCGGAGT TGGAAACGCG GATCGATGCC
ACCGGCAATC TGATCGGCCG GCGAACGGGC CGCAAACCCT GGCTCGGCAC CATCATGGTC
GGTTCGCATT CCGAAACGGT GCCGGACGGC GGCCGTTTCG ACGGCATTGC CGGGGTGATC
TCCGCCCTGG AGGTGGCGCG TGCACTGAGC GACCAGGCGA TCGAACTCGA CCACGATCTC
GAAATCGTCG ACTTCCTTGC CGAGGAGGTC AGCATCTTCG GCGTCTCCTG CATCGGCAGC
CGTGGGATGA CCGGGCAATT GCCGGAAGCC TGGCTTTCCC GGATCAGCGA CGGGCGCGAC
CTTGCCGAGG GCATCGCTGA GGTAGGCGGC GAGCCCGGCG TGCTGGCGCA GCAGAAGCGG
CCGGATCTCG CCGGATTTTT GGAGCTTCAT ATCGAGCAGG GACCGGTGCT CGAAGCGGAA
AGAGAGGATA TCGGCATCGT CACCGCGATT GCAGGCATTA CCCGCATCGA GATCACCGTC
GAGGGGCGGG CCGATCATGC CGGCACGACA CCAATGGATA GGCGGGCGGA TGCGCTGGTG
GCGGCCGCCC AGCTGGTGCT CGACATCCGC AACGCCGCCG CTGAGCTTGC CAAAACACCG
GGCCACTTCG CGGCGACGGT CGGCGAATTC AGGATCGAGC CGAATGCCGC CAATGTCGTG
CCTTCGAAAG TGGTGCTGCT GATCGACGGC CGCGCCGAGA TCCGTGCCGA CATGGAAGCC
TTCTGCCGCT GGCTCGACGG CCATGTCGAA AAGCTGGCCA CCGCCTATGG CGTCACGATC
AGAACGCCGA ACCGGGTGTC CGACAATATG CCGACACCCG GCGATGCCGG ACTGCTTTCG
ACCTTGGAGG CTGCCTGCGA ACGCGTCGGC GCCAAACACC GGCGCATGGC CTCCGGCGCG
GGACACGATA CGGCCTGGAT CGCCAAGGTG GCGCCGGCGG CGATGATCTT CGTGCCCTGC
CGGGAGGGCC GCAGCCATTC CGGCGACGAA TGGGCGGAGA ATGACGATAT CGCGCTCGGC
GCCGCCGTGC TGTTCGAGGC GGTGCGCGAG ATGGACAAGG ATTTGACGCG GGAGAAGGCC
GATGGGACGC ATACTGGTTG A
 
Protein sequence
MSRNLPVNAG RIAGDIEALA AITEPGHPWT RRAFSPLFLE GRAYIEARMK AAELETRIDA 
TGNLIGRRTG RKPWLGTIMV GSHSETVPDG GRFDGIAGVI SALEVARALS DQAIELDHDL
EIVDFLAEEV SIFGVSCIGS RGMTGQLPEA WLSRISDGRD LAEGIAEVGG EPGVLAQQKR
PDLAGFLELH IEQGPVLEAE REDIGIVTAI AGITRIEITV EGRADHAGTT PMDRRADALV
AAAQLVLDIR NAAAELAKTP GHFAATVGEF RIEPNAANVV PSKVVLLIDG RAEIRADMEA
FCRWLDGHVE KLATAYGVTI RTPNRVSDNM PTPGDAGLLS TLEAACERVG AKHRRMASGA
GHDTAWIAKV APAAMIFVPC REGRSHSGDE WAENDDIALG AAVLFEAVRE MDKDLTREKA
DGTHTG