Gene Rleg2_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3036 
Symbol 
ID6981781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3095765 
End bp3097018 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content63% 
IMG OID643397746 
Productallantoate amidohydrolase 
Protein accessionYP_002282529 
Protein GI209550612 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.376517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCAG CACCAGGCGA AAACATGCGC GTCAATGGCG ACCGTCTCTG GGACAGTCTC 
ATGGACATGG CCAAGATCGG CCCCGGCATT GCGGGCGGCA ACAATCGCCA GACGCTGACG
GATTCGGATG CCGAGGGCCG AAGCCTTTTC AAGACATGGT GCGACGAAGC GGGCCTCACC
ATGGGCATCG ACCAGATGGG CACGATGTTC GCTACCCGCC CCGGCACCGA TCCCGATGCC
CTGCCCGTCT ATGTCGGTTC GCATCTCGAC ACTCAGCCGA CCGGCGGCAA ATATGACGGC
GTGCTCGGCG TGCTGGCTGC CCTCGAAGTC GTGCGCACCA TGAACGATCT CGGCATCAAG
ACCAAACATC CCATTGTCGT CACCAATTGG ACGAACGAGG AAGGGGCACG TTTTGCCCCG
GCCATGCTGG CCTCAGGCGT CTTTGCCGGC GTGCACAGCC TCGACTTTGC CTATAATCGC
AAGGATCCCG AGGGCAATCT CTTCGGCGAC GAACTGAAAC GCATCGGCTG GCTCGGCGAC
GAAGAGGTCG GTGCCCGCAA GATGCACGCC TATTTCGAAT ATCACATCGA GCAGGGTCCG
ATCCTCGAGG CCGAAGACAA GCAGATCGGC GTCGTCACCC ACTGTCAGGG CCTCTGGTGG
CTGGAATTCA CGCTGACCGG CAAGGAAGCC CATACCGGCT CGACGCCGAT GAACATGCGC
GTCAATGCCG GGCTTGCCAT GTCGCGCATC CTGGAAATGG TTCAAGGCGT GGCGATGGGC
GAGCAGCCGG GCGCCGTCGG CGGTGTCGGG CAGGTGTTCT TCTCGCCGAA TTCGCGCAAC
GTGCTGCCCG GCAAGGTCGT CTTCACCGTC GACATCCGCT CGCCCGACAA GGCCAAGCTC
GACCGCATGC GGGCAAAGAT CGAGGCGGAA GCGCCAAAGA TCTGCGATGC TTTGGGTGTC
GGCTGTTCCG TCGAGGCGAT CGGCCATTTC GCGCCTGTTA CCTTCGACGA AAAGCTCGTC
AGCTCGGTCC GCTCCGCCGC CGAGCGCCTC GGCTACAGCC ACATGAACCT CATCTCGGGC
GCCGGCCACG ACGCCTGCTG GGCCGCCAAG GTCGCCCCTG CGACGATGGT CATGTGCCCC
TGCGTCGGCG GTCTGTCGCA CAATGAAGCG GAAGACATTT CCAAGGAATG GGCGACGGCG
GGCGCCGATG TTCTGTTCCA TGCGGTGGTG GAGACGGCGG AGATTGTTCC GTGA
 
Protein sequence
MVAAPGENMR VNGDRLWDSL MDMAKIGPGI AGGNNRQTLT DSDAEGRSLF KTWCDEAGLT 
MGIDQMGTMF ATRPGTDPDA LPVYVGSHLD TQPTGGKYDG VLGVLAALEV VRTMNDLGIK
TKHPIVVTNW TNEEGARFAP AMLASGVFAG VHSLDFAYNR KDPEGNLFGD ELKRIGWLGD
EEVGARKMHA YFEYHIEQGP ILEAEDKQIG VVTHCQGLWW LEFTLTGKEA HTGSTPMNMR
VNAGLAMSRI LEMVQGVAMG EQPGAVGGVG QVFFSPNSRN VLPGKVVFTV DIRSPDKAKL
DRMRAKIEAE APKICDALGV GCSVEAIGHF APVTFDEKLV SSVRSAAERL GYSHMNLISG
AGHDACWAAK VAPATMVMCP CVGGLSHNEA EDISKEWATA GADVLFHAVV ETAEIVP