Gene Rleg_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1994 
Symbol 
ID8013030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1991694 
End bp1992887 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content65% 
IMG OID644824581 
ProductSerine-type D-Ala-D-Ala carboxypeptidase 
Protein accessionYP_002975813 
Protein GI241204717 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0979633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0234131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGC CCGCGCTTCG CCTGCTTGCA TGCCTGATGC CGTTCGCCAC CGGCGCCTTT 
GCAGCCGATG GCGGTGCCGC CGGTTTCGCC ACCAAGGCGG CGCAGGCCTA TATGATCGAG
GCCGCCACCG GCACGGTGCT TCTCGCCAAA AACGAGGATC AGGGCTTTTC GCCGGCCTCG
CTCGCCAAAC TGATGACGAT GGATCTCGCC TTCGAGGCGC TGACCAAGGG CCAGATCACG
CTCGACACCG AATATCCCGT TTCCGAATAT GCCTGGCGGA CGGGCGGCGC GCCGTCGCGG
ACGGCAACAA TGTTTGCCAG CCTCAAATCG CGCGTGCGCG TCGAGGACCT GATCAAGGGC
GTCGCCATCC AGGGCGCCAA CGACAGCTGC ATCATCCTCG CCGAAGGCAT GGCCGGAAGC
GAGCAGCAAT TTGCCGTGTC GATGACGCGG CGCGCCCGCG AGCTCGGCAT GGAGAAGGCC
GAGTTCGGAA ATTCCACCGG CCTTCCCGAC GGCAAGAGCA AGGTGACGGC ACGCGAGATG
GTGACGCTCG CCGCCGCCCT CCAGCAGACC TATCCGAACC TCTATCCCTA TTTCGCGCAG
CCGGATTTCG AGTGGAACAA GATCTTCCAG CGCAACCGCA ATCCGCTGCT CGGGCTCGAT
CTCGGCGCCG ATGGGCTGGC GACGGGCTTT ACCGAGGGCG AGGGCTATTC GATCGTCGCT
TCGGTTGAGC GTGACGGCCG GCGGCTTTTT GTGGCGCTTG CCGGCATCGC CTCCGACAAG
GAGCGGACGG AGGAAGCCAA ACGCGTACTC GAATGGGGGC TGACGGCCTT CGAGAACCGG
CAGATCTTCG GCGAGAAGGA AGTGATTGGT GCTGCCAGCG TCTATGGCGG CACGGCGCGT
ACCGTCGACC TCGTCGCCAA GGCGCCGGTC AGCGTCTATA TCCCGATCAG CAATCCCGAC
CGGCTGTCGG CGCGCATCAT CTATCGCTGG CCGCTGACGG CGCCGGTCAA GCCGGATACC
CAGGCAGGAA CGCTGAGGAT TTTCGCAGGC AGCCGGCTGC TCAGGGAAGT GCCGCTTTAT
ACCGTGCAGG CAGTCGGCGA GGGATCGCTC AGCAGCCGGG CGGTCGATGC CATGCTGGAA
CTCGGCGAAT CGCTGTTCTT CTCCTGGCTC TGGGACAAGC CCGCGCCCGT CTGA
 
Protein sequence
MLKPALRLLA CLMPFATGAF AADGGAAGFA TKAAQAYMIE AATGTVLLAK NEDQGFSPAS 
LAKLMTMDLA FEALTKGQIT LDTEYPVSEY AWRTGGAPSR TATMFASLKS RVRVEDLIKG
VAIQGANDSC IILAEGMAGS EQQFAVSMTR RARELGMEKA EFGNSTGLPD GKSKVTAREM
VTLAAALQQT YPNLYPYFAQ PDFEWNKIFQ RNRNPLLGLD LGADGLATGF TEGEGYSIVA
SVERDGRRLF VALAGIASDK ERTEEAKRVL EWGLTAFENR QIFGEKEVIG AASVYGGTAR
TVDLVAKAPV SVYIPISNPD RLSARIIYRW PLTAPVKPDT QAGTLRIFAG SRLLREVPLY
TVQAVGEGSL SSRAVDAMLE LGESLFFSWL WDKPAPV