Gene Rleg_4032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4032 
Symbol 
ID8014838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4108335 
End bp4110422 
Gene Length2088 bp 
Protein Length695 aa 
Translation table11 
GC content62% 
IMG OID644826601 
ProductPeptidyl-dipeptidase Dcp 
Protein accessionYP_002977812 
Protein GI241206716 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCTT CCCATGATTT CAATCCAGCG CTGGTGACCT GGGACGGTCT TCACGGTCTG 
CCGCGTTTCG ATGCCTTGAA CGACGGCGAT TTCGCCGCGG CCTTCGAAGC CGCACTTGCC
GCGCATGAAA GGGAGATCGA CGAGATTGCC GGAAACGGCG ATGCGCCGAC ATTCGACAAT
ACGGTCGTGG CGCTGGAGAT TGCCGGAGAC GCACTGTCGC GCGTTTCGTC GCTATTCTGG
AACAAGGCGG GCGCGCATAC CAATGATGTG ATCCAGGCGC TGGAGCGGGA GATCTCGCCG
AAGATGTCGC GCCACTATTC GAAGATCGGG ATGAATGCAG CGCTTTTTGC CCGCATCGAC
ACGCTCTGGG AGAGCCGCGA CAGCCTGGGC CTGACGCTCG AACAGACACG GGTGCTGGAA
CGCCACTGGA AAGGCTTCGT CAAATCGGGC GCCAAGCTTG AGAAGGCCGA GCAGGAAAAA
CTCGCCACGA TCAATGAAAA GCTTGCCGGC CTCGGGACGC AATTCGGCCA GAACGTGCTG
GCCGACGAAA AGGCCTGGGC ACTTGTGCTT TCCGATGGCG CCGAGCTCGA GGGCCTGCCG
GAATTCCTGC GCGACGCGAT GGCGGGGGCA GCGCGCGAAC GCGGCGAGGA GGGCAAATAT
GCGGTGACGC TGTCACGCTC GATCATCGAG CCGTTCCTCA CCTTTTCGGA GCGCCGTGAT
CTACGCGAGC AGGCTTTCAA GGCGTGGGTG GCGCGCGGTG AAAATGATGG TGAGACGGAC
AATCGCGCCG TCATCAGGGA AACGCTGGCG CTGCGCCATC AGGTGGCGAC GCTGCTGGGC
TACGGCAATT TCGCCGAGCT GAAGCTCGAC AATACGATGG CGAAGACGGC GGAGGCCGTA
AACGGCTTGT TGCGGGCTGT CTGGGCGCGA GCGGTGAAAC GCGCCGGCGA GGAGGAGATC
GATATCGCGG CAATGATTGC CGAGGAGGGC CGGAACCACG AGGTCATGCC CTGGGACTGG
CGCCACTATG CCGAAAAGAT CCGGGCGCGG AAGTTCGACT TCTCCGAGAC CGAACTCAAG
CCTTATCTGC AGCTCGAGAA GATCATCGAA GCCTGCTTCG ACGTGGCCGG CCGGCTGTTC
GGCATCCGGG CCGTCGAGAA GAAGGGCGTG GCCGCCTATC ACCCTGACGT TAGGGTGTTC
GAAATCAGGG ACCGCGAGGA CAAGCTCGTC GCCCTCTTCC TCGGCGATTA TTTCGCCCGC
AGTTCGAAAC GCTCAGGCGC CTGGATGAGC TCGCTGCAAT CGCAGCACAG GCTGGAGCTG
AAGAACGGTC GCCATGGCGA ATTGCCGATC ATCTATAATG TCTGCAACTT CGCCAAGCCC
GCGGAAGGCA AGCCGGCGCT GCTGTCGCTC GACGATGCCC GCACGCTGTT CCATGAATTC
GGCCATGCGC TGCACGGCAT GCTTTCGAAC GTCACCTATC CCTCGGTTGC GGGCACCGGC
GTCTCTCGCG ATTTCGTCGA GTTGCCGTCG CAGCTCTATG AGCACTGGTT GACGGTGCCC
GCTATCCTGA AGCGTTACGC CGTGCATGTC GAAACCGGCG AGCCGATGCC GCAGGCCCTG
CTCGACAAGG TGCTTGCCGC CCGGACCTTC AATGCCGGCT TCAATACCGT CGAGTTCACC
TCGTCGGCGC TGGTCGACAT GGCGTTTCAC ACGAGAACGG CCGTCGAGGA CCCGATGGCG
GTGCAGGCCG AGGTACTGGC CGAGATCGGC ATGCCGAAGT CGATCGTCAT GCGCCATGCG
ACGCCGCACT TCCAGCACAT CTTTTCGGGA GGCTATTCGG CCGGCTATTA CTCCTACATG
TGGTCGGAGG TGCTCGACGC CGACGCCTTT GCCGCCTTCG AGGAGACGGG AGACGCCTTC
AACGGCGAGA TGGCGCGCAA GCTCAAGGAC AATATCTATT CCGTCGGCGG TTCGGTCGAT
CCGGAAGACG CCTACAAGGC CTTCCGCGGC AAGCTGCCGA GCCCGGATGC GATGCTTGTC
AAAAAGGGAC TTTCGACCTT CGAGGAATTG ACAGGCAGCG ACGCCTAA
 
Protein sequence
MSSSHDFNPA LVTWDGLHGL PRFDALNDGD FAAAFEAALA AHEREIDEIA GNGDAPTFDN 
TVVALEIAGD ALSRVSSLFW NKAGAHTNDV IQALEREISP KMSRHYSKIG MNAALFARID
TLWESRDSLG LTLEQTRVLE RHWKGFVKSG AKLEKAEQEK LATINEKLAG LGTQFGQNVL
ADEKAWALVL SDGAELEGLP EFLRDAMAGA ARERGEEGKY AVTLSRSIIE PFLTFSERRD
LREQAFKAWV ARGENDGETD NRAVIRETLA LRHQVATLLG YGNFAELKLD NTMAKTAEAV
NGLLRAVWAR AVKRAGEEEI DIAAMIAEEG RNHEVMPWDW RHYAEKIRAR KFDFSETELK
PYLQLEKIIE ACFDVAGRLF GIRAVEKKGV AAYHPDVRVF EIRDREDKLV ALFLGDYFAR
SSKRSGAWMS SLQSQHRLEL KNGRHGELPI IYNVCNFAKP AEGKPALLSL DDARTLFHEF
GHALHGMLSN VTYPSVAGTG VSRDFVELPS QLYEHWLTVP AILKRYAVHV ETGEPMPQAL
LDKVLAARTF NAGFNTVEFT SSALVDMAFH TRTAVEDPMA VQAEVLAEIG MPKSIVMRHA
TPHFQHIFSG GYSAGYYSYM WSEVLDADAF AAFEETGDAF NGEMARKLKD NIYSVGGSVD
PEDAYKAFRG KLPSPDAMLV KKGLSTFEEL TGSDA