Gene Rleg_4779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4779 
Symbol 
ID8007032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp150893 
End bp152053 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content62% 
IMG OID644821709 
Productamidohydrolase 
Protein accessionYP_002972969 
Protein GI241113134 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.488683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0993236 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCG ACAAGGATGC GCTGCATGCG GAAATGACAG CGTGGAGACG CGATCTCCAC 
GCTCATCCCG AATTTGGCTT CGAGGAGCGG CGGACATCCG CCTTCGTTGC GGCCAAGCTG
CGGGAATTCG GCTTCGACGA GGTCACCGAG GGCATCGGCG GCACCGGCGT CGTCGGAACG
CTGAAACGCG GCAACGGCAA TCGCGCCATT GCCCTGCGTG CCGATATGGA TGCGCTCAGG
ATCAACGAAC AGGCGGAGCT CTCGCACCGG TCCCAAAACC CGGGAATCAT GCATGCTTGC
GGTCACGACG GCCACACCGC CATGCTGCTC GGCGCAGCAA AGGTCCTGGC CGGGGAAGGC
GGTTTCGACG GCACGGTACG CTTCATCTTC CAGCCTGCAG AAGAATGGGG CAAAGGCGCG
CTGGCAATGA TCGCCGATGG GCTCTTCGAA AGATTCCCCT TCGACGAGAT CTACGGCATC
CACAACATGC CGGGGATCGA CATTGGCCGC TTCCATACGC GTCCTGAAGC GATCATGTCC
GCCGAGGACA ATTTCGAGAT TACGCTGACC GGCGTCGGCG GCCACGCCGC CCGGCCTCAC
TGGGGCAATG AAGTGCTCGT CGCGGCCTGC GCGCTCGTGA CCAATCTGCA GACCATCGTC
TCGCGGCGAC TGGATCCGGC CGACATCGCC GTCGTCTCCG TCACTGAGCT GATCACCGAC
GGCACGAGGA ATGCGCTTCC CGGCTTCGCC CGCATTCTGG GCGACGCCCG CAGCTTTCGC
TCGGAGATCA GCGAGACGAT CGAGAAGCAG ATGCGCGTGA TCGCCGAGGG TACCGCCATG
ACGCACAACA TCAAGGCTGA CGTCGTCTAC ACCAGGGAAT TCATCCCTCT CATGAACGAT
CCGTCGTTGA CGGAGGAGGC CTTGAGCGTC GCACGCGATC TGTACGACGC TTCAAATGTC
GCCATCGCGA GCAAGCCCAT GACCGGATCC GAAGACTTCG CGCAGTTCCT TACGCGGGTT
CCGGGCTGTT TCGTGTTCCT TGGCAACGGC GAGCATTCGC CGCCACTTCA TAACCCGACC
TATGACTTCA ACGATGCCGG CCTCCTGCAT GGGGCAAACT TCCACGCAGG GATTGTGCGT
CGACGGCTTC AGACAAGCTG A
 
Protein sequence
MTIDKDALHA EMTAWRRDLH AHPEFGFEER RTSAFVAAKL REFGFDEVTE GIGGTGVVGT 
LKRGNGNRAI ALRADMDALR INEQAELSHR SQNPGIMHAC GHDGHTAMLL GAAKVLAGEG
GFDGTVRFIF QPAEEWGKGA LAMIADGLFE RFPFDEIYGI HNMPGIDIGR FHTRPEAIMS
AEDNFEITLT GVGGHAARPH WGNEVLVAAC ALVTNLQTIV SRRLDPADIA VVSVTELITD
GTRNALPGFA RILGDARSFR SEISETIEKQ MRVIAEGTAM THNIKADVVY TREFIPLMND
PSLTEEALSV ARDLYDASNV AIASKPMTGS EDFAQFLTRV PGCFVFLGNG EHSPPLHNPT
YDFNDAGLLH GANFHAGIVR RRLQTS