Gene Rleg_4181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4181 
Symbol 
ID8014971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4276231 
End bp4277289 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content63% 
IMG OID644826751 
Productpeptidase M19 renal dipeptidase 
Protein accessionYP_002977961 
Protein GI241206865 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.351187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTCG TATTTGACGG TCACAACGAC GTTCTCCTTC GACTCTGGAC ACATTCAAAA 
GACGGCAGCG ACCCGATCGC GGAATTCGCA GACGGCACGA CGGTCGGCCA TATCGATGCG
CATCGGGCTA GAGAGGGCGG TCTTTCAGGC GGCCTCTGTG CCATCTATAT TCCCTCGGGC
GATCTCGTCT TCGCCGATCC GGATGCCCAC GGCCGCTATA TGACGCCGAT GGCGGCCCCT
CTCGATCCGC TGCCTTCGCT TGCCATCGCC AATGAAATGG CGGCGATTGC GCTGCGGCTC
GACCAGGCCG GCGCCTGGCG GCTCTGCCGG ACGGTGAAGG ATATCCGCGG CGCCATGGCG
GACGACATTT TCGCCGCCGT CATGCATATG GAAGGCTGCG AGGCGATCGG CGCCGATCTC
TCGGCGCTCG AGGTGTTCTA CGCGGCAGGG CTGCGGTCGC TCGGGCCGGT CTGGAGCCGG
CACAATGTCT TCGGTCACGG CGTGCCCTTC GCCTTCCCGA TGTCGCCGGA CACGGCGCCG
GGCCTTACCG ATGCCGGCTT CGCGCTGGTC AGGGAATGCA ATCGCCTCGG CATCCTGATC
GACCTTGCCC ATATCACCGA GAAGGGTTTC TGGGACGTGG CGAAGAAGAC GGACCAGCCG
CTGGTCGCCA GCCATTCCAA TGCCCACGCC CTGACGCCGG TCGCGCGCAA CCTGACGGAC
AGGCAGCTCG ATGCGATCCG CGAAAGCCGC GGGCTCGTCG GCATCAACTA TGCCACCGCC
ATGTTGCGTG CCGACGGCCG CTCAGACAGC GACACGCCGC TTGCCGACAT GATCCGCCAT
ATCGACTATC TTGTGAATCG CATCGGCATC GACTGCGTGG CGCTCGGATC GGACTTCGAC
GGGGCCACCA TTCCGGAGGA AATCGGTGAT GCAGCCGGTA ATCAGAAGCT GATTGCCGCT
CTCAGAGAGG TTGGCTATGC TGACGCCGAC CTGGCAAAAC TTGCCCGTGA AAACTGGCTT
CGCATTCTGG CCCAGGCTTG GCGGGAGGAC CACGCCTAA
 
Protein sequence
MQFVFDGHND VLLRLWTHSK DGSDPIAEFA DGTTVGHIDA HRAREGGLSG GLCAIYIPSG 
DLVFADPDAH GRYMTPMAAP LDPLPSLAIA NEMAAIALRL DQAGAWRLCR TVKDIRGAMA
DDIFAAVMHM EGCEAIGADL SALEVFYAAG LRSLGPVWSR HNVFGHGVPF AFPMSPDTAP
GLTDAGFALV RECNRLGILI DLAHITEKGF WDVAKKTDQP LVASHSNAHA LTPVARNLTD
RQLDAIRESR GLVGINYATA MLRADGRSDS DTPLADMIRH IDYLVNRIGI DCVALGSDFD
GATIPEEIGD AAGNQKLIAA LREVGYADAD LAKLARENWL RILAQAWRED HA