Gene Rleg_4780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4780 
Symbol 
ID8007033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp152050 
End bp153297 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content65% 
IMG OID644821710 
Productallantoate amidohydrolase 
Protein accessionYP_002972970 
Protein GI241113135 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.105367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGC GATCCATCGA TGCGGCGCGT CTGCTTTGGC GCATCAGGAC GCTCGGCGAA 
ATCGGCCGGG ATAGCGACGG CCGGCTCGTG CGGCTGGCAG CTTCTGATGC CGAAAAACTC
GGCCGCGACC AATTCGTCGT ATGGATCGAG GACGCAGGGC TCGCCGTCGC CGTCGATCGC
ATCGGCAACA TCTTCGGCAT CTGGAAACCG GACGGCGTCG CCGACGAAGC GCCCCTGCTG
CTCGGCTCGC ATATCGACAC CGTCATCGGC GCCGGTATCT ATGACGGCTG CTACGGCGCG
CTATCCGGTC TGGAAGTCAT CGAGACGCTG AAGGCCGAAG GCCTGGCGCC ATCCCGGCCG
ATCGTCGTGG CGGCCTTCAC CAATGAGGAA GGTGCGCGCT ACGCGCCCGA TATGATGGGG
TCGCTGGTCT ATGCCGGCGG TCTCGATGTC GACGCGGCTC TTGCCACCAT TGGCACCGAC
GGGACGATAC TTGGCCAGGA GCTCGAGCGG ATCGGCTATG CCGGCGAACA TGAGCCCGGC
TTCCTCAGGC CGCACGCCTA TATCGAGCTG CATATCGAGC AAGGCCCGGT CCTCGAACGC
GAAGGCATTC CGGTCGGCGC CGTGGAAGAC CTTCAAGGCA TCTCCTGGCA GAGGGTGACC
ATCACCGGCG ATGCCAATCA CGCCGGAACA ACGCCGATCT CCATGCGCCG AGACGCCGGG
CATGCCGCCG CGCGAGTCGT CATCTTCCTG CGCGAGCGGG CGAAGGCTTC GAACACGCCG
ACGGTCGCGA CAGTCGGCTG CATGCGCTTC GAACCTGATG TCATCAACGT GATCCCGTCG
CGGGCAACCT TCACCGTCGA CCTTCGCGAT CCGGACGAGG ATCGCCTCAG AGAAGAGGAG
ACCGCGCTCA CCAACTTCCT GGAGATTCTA TCAACCGAGG AGCAGGTCGG CATATCGGTG
GAAAGGCTTG CCCGGTTCGA GCCTGTGAAG TTCGACCAAG GGATCGTCGG CCTCATCGAA
AAGGCTGCGC GGGACCGGGG TCTCGCCTGC CGGCGGATGA CCTCCGGCGC CGGCCACGAC
GCGCAGATGA TTGCCAGGAT CGCCCCGTCG GCGATGATCT TCGTCCCGAG CATCGGCGGG
ATCAGCCATA ACCCGAGGGA ATACACGGCC GACGAAGATC TCGTTGCCGG AGCGAACATC
CTGCTGGATG TCGTTCGCCA GCTCGCCAAG GAAGGACTGC CGGCATGA
 
Protein sequence
MTARSIDAAR LLWRIRTLGE IGRDSDGRLV RLAASDAEKL GRDQFVVWIE DAGLAVAVDR 
IGNIFGIWKP DGVADEAPLL LGSHIDTVIG AGIYDGCYGA LSGLEVIETL KAEGLAPSRP
IVVAAFTNEE GARYAPDMMG SLVYAGGLDV DAALATIGTD GTILGQELER IGYAGEHEPG
FLRPHAYIEL HIEQGPVLER EGIPVGAVED LQGISWQRVT ITGDANHAGT TPISMRRDAG
HAAARVVIFL RERAKASNTP TVATVGCMRF EPDVINVIPS RATFTVDLRD PDEDRLREEE
TALTNFLEIL STEEQVGISV ERLARFEPVK FDQGIVGLIE KAARDRGLAC RRMTSGAGHD
AQMIARIAPS AMIFVPSIGG ISHNPREYTA DEDLVAGANI LLDVVRQLAK EGLPA