Gene Rleg2_3853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3853 
Symbol 
ID6982616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3995568 
End bp3996626 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content63% 
IMG OID643398575 
Productpeptidase M19 renal dipeptidase 
Protein accessionYP_002283341 
Protein GI209551424 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.417509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTCG TATTTGACGG CCACAACGAC GTTCTCCTTC GACTCTGGAC ACATTCAAAA 
GACGGCAGCG ATCCGATCTC GGAATTTGCA GACGGCACGA CGGTCGGGCA TATCGATGCG
CGCCGGGCCA GGGAGGGCGG CCTTTCGGGC GGTCTTTGCG CCATCTACAT TCCCTCCGGC
GACCTGGTCT TTGCCGATCC GGATGCTGAC GGCCGTTATA TCACGCCGAT GGCGGCGCCT
CTCGATCCGC TGCCGTCCCT TGCCATCGCC ACTGAAATGG CGGCGATCGC GTTGCGGCTC
GACCAGGCCG GCGCCTGGCG GCTCTGCCGG ACGGTGAAGG ATATCCGCGG CGCCATGGCG
GACGATATTT TTGCCGCCGT CCTGCATATG GAAGGCTGCG AAGCGATCGG CGCCGATCTT
GCGGCGCTCG AAGTCTTCTA CGCAGCCGGG CTGCGGTCGC TCGGGCCGGT CTGGAGCCGG
CACAATGTCT TCGGTTACGG CGTGCCCTTC GCCTTTCCGA TGTCGCCGGA CACGGCACCC
GGCCTCACCG ATGCCGGTTT TGCGCTGGTG CGGGAATGCA ATCGCCTCGG TATCGTGATC
GACCTTGCCC ACATCACCGA GAAGGGCTTC TGGGACGTGG CGAAGACGAC GGACCAGCCG
CTGGTCTCCA GCCATTCCAA TGCCCATGCG CTGACGCCGG TCGCGCGCAA TCTGACCGAC
AGGCAGCTCG ATGCGATCCG CGAAAGCCGC GGGCTCGTCG GCATCAATTA TGCCACCGCC
ATGCTGCGTC CCGACGGCCG CTCGGACAGC GATACACCGC TTGCCGACAT GATCCGCCAT
ATCGACTATC TGGTGAACCG CATCGGCATC GATTGCGTCG GCCTCGGATC GGACTTCGAC
GGGGCCACAA TTCCTGAGGA AATCGGCGAT GCAAGCGGCA ATCAGAAGCT GATTGCCGCT
CTCAGGGAGG TTGGTTATGG TGAGGCCGAT CTCACGAAAC TTGCCCGTGA AAATTGGCTT
CGCATCCTCG CACAAGCCTG GCGCGAGGAC GACGCCTAA
 
Protein sequence
MQFVFDGHND VLLRLWTHSK DGSDPISEFA DGTTVGHIDA RRAREGGLSG GLCAIYIPSG 
DLVFADPDAD GRYITPMAAP LDPLPSLAIA TEMAAIALRL DQAGAWRLCR TVKDIRGAMA
DDIFAAVLHM EGCEAIGADL AALEVFYAAG LRSLGPVWSR HNVFGYGVPF AFPMSPDTAP
GLTDAGFALV RECNRLGIVI DLAHITEKGF WDVAKTTDQP LVSSHSNAHA LTPVARNLTD
RQLDAIRESR GLVGINYATA MLRPDGRSDS DTPLADMIRH IDYLVNRIGI DCVGLGSDFD
GATIPEEIGD ASGNQKLIAA LREVGYGEAD LTKLARENWL RILAQAWRED DA