Gene Smed_3177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3177 
Symbol 
ID5324056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3340859 
End bp3342922 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content62% 
IMG OID640792125 
Productpeptidyl-dipeptidase Dcp 
Protein accessionYP_001328836 
Protein GI150398369 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA TCGCTTCGCT CAACCCCGCC CTCGTGAACT GGACCGGACA TGAGGGGCTG 
CCCCGGTTCG AGGCGGTAAG GGACGAGGAT TTCGGACCTG CCTTCGATGC GGCGCTTGCC
GCCCATGAGG CGGAGATCGA CGCTATCGCC CGTAATGCGG AACCCCCAAG TTTCGGGAAT
ACCGTCGTGG CGCTGGAAAT CGCCGGGGAT GAGCTGTCTC GCGTCTCCGC GCTCTTCTGG
AGCAAGGCGG GAGCCCACAC GAACGAAACG ATCCAGGCAC TGGAGCGCGA GATCGCGCCC
AAGATGTCGC GTCATTATTC GAAAATCGGC ACAAACCCAG CCCTCTTCAG CCGGATCGAC
GCGCTATGGG AGCGCCGGGA AGACCTCGGG CTCGATGTCG AGGCCATGCG CGTTCTGGAG
CGGCACTGGA AGAGCTTCGT CAAGTCGGGT GCCAAGCTCG ATAAGGCCGA TCAGGACCGC
CTCGCGGCAA TCAACGAGAA GCTCGCCAGC CTCGGGGCGC GGTTTGGCCA GAATGTTCTC
GCGGACGAGA AGGCCTGGGC GCTGCTGCTG TCGAGCGAAG AGGAACTGGC AGGGATTCCT
GATTTTCTGC GTGATGCTAT GGCGGCTGCG GCGCGCGAGC GCGGCGAGGA AGATAAGTAT
GCGGTCACGC TGTCTCGCTC GATTATCGAG CCTTTCCTGA CTTTCTCCGA AATACGAGAG
TTGCGTGAGC AGGCGTTCAG GGCATGGGCG GCGCGCGGCG AAAATGGCGG CGAAACCGAC
AATCGCGGCA TCATCGGCGA AACGCTGTCT CTGCGGGCGG AGAAGGCCGA GCTGCTCGGC
TACGAGAACT ACGCGGCACT GAAGCTCGAC AACACCATGG CGAAGACTCC CGACGCCGTG
AACGGCCTGC TGATGCAGGT TTGGGAAAAG GCCGTGGCGC GTGCCCGGGA AGAGGAGGCG
GACCTCGCCC GGCTGATCGC GGAAGAGGGC CGCAACCACG AGGTCATGCC CTGGGACTGG
CGCCACTATG CGGAAAAACT CAGGGCGCAG AGGTTCAGCT TTTCGGAAAG CGAACTGAAA
CCTTATCTGC AGCTCGAGAA GATCATCGAT GCCTGTTTCG CTATCGGCAA GCGCCTCTTC
GGCCTCACCG CAGTCGAGAA GAAGGGCATT CCCGCCTATC ACCCCGATGT GCGCGTCTAC
GAGATCCGCG ACGCCTCCGG CAAGCTTACG GCGCTTTTCC TCGGCGATTA CTTCGCGCGT
CCCTCCAAGC GCTCCGGAGC CTGGATGAGC TCATTTCAGT CGCAGCACAG GCTGGCGCTC
AAGAATGGCG AAACCGGCGA GATCCCGATC GTCTACAATG TCTGCAACTT CGCGAAGCCC
GCAGAAGGCA AGCCGGCTCT GCTCTCGATC GATGACGCCC GCACGCTCTT CCACGAATTC
GGTCATGCGC TGCACGGCAT GCTTTCAAAC GTCACCTATC CGTCCGTTTC GGGCACCGGT
GTCGCACGTG ATTTCGTCGA ACTGCCGTCG CAGCTTTACG AACATTGGCT GACGGTGCCG
GAGGTCCTGC GGACCTATGC GGTGCATTAC CGGACCGGTG AACCCATGCC GCAGGCATTG
CTCGACAAGG TGCTGGCGGC AAGGACGTTC AATTCAGGCT TCGCTACCGT CGAGTTCACC
GCTTCGGCGC TCGTCGACAT GGCCTATCAC ACGACGGAAG GGGTTGGCGA TCCGATGGCG
CTCGAGAAGG CGACGCTGGA TAAGATCGGC CTGCCCAAAT CGATCGTGAT GCGCCACCGC
AGCCCGCATT TCCTGCATGT CTTCTCCGGC GACGGTTACT CGGCCGGCTA TTACTCCTAC
ATGTGGTCGG AGGTGCTGGA CGCCGACGCC TTTGCAGCCT TCGAGGAGAC CGGCGATCCC
TTCGACCCGG CGACGGCGGC GAGATTGAAG GACAACATCT ATTCGGTCGG CGGCTCCGTC
GATCCGGAAG ACGCCTACAA GGCTTTCCGC GGCAAGCTGC CGAGCCCTGA AGCGATGCTG
GGCAAGAGGG GGCTTGCGGC TTAG
 
Protein sequence
MNDIASLNPA LVNWTGHEGL PRFEAVRDED FGPAFDAALA AHEAEIDAIA RNAEPPSFGN 
TVVALEIAGD ELSRVSALFW SKAGAHTNET IQALEREIAP KMSRHYSKIG TNPALFSRID
ALWERREDLG LDVEAMRVLE RHWKSFVKSG AKLDKADQDR LAAINEKLAS LGARFGQNVL
ADEKAWALLL SSEEELAGIP DFLRDAMAAA ARERGEEDKY AVTLSRSIIE PFLTFSEIRE
LREQAFRAWA ARGENGGETD NRGIIGETLS LRAEKAELLG YENYAALKLD NTMAKTPDAV
NGLLMQVWEK AVARAREEEA DLARLIAEEG RNHEVMPWDW RHYAEKLRAQ RFSFSESELK
PYLQLEKIID ACFAIGKRLF GLTAVEKKGI PAYHPDVRVY EIRDASGKLT ALFLGDYFAR
PSKRSGAWMS SFQSQHRLAL KNGETGEIPI VYNVCNFAKP AEGKPALLSI DDARTLFHEF
GHALHGMLSN VTYPSVSGTG VARDFVELPS QLYEHWLTVP EVLRTYAVHY RTGEPMPQAL
LDKVLAARTF NSGFATVEFT ASALVDMAYH TTEGVGDPMA LEKATLDKIG LPKSIVMRHR
SPHFLHVFSG DGYSAGYYSY MWSEVLDADA FAAFEETGDP FDPATAARLK DNIYSVGGSV
DPEDAYKAFR GKLPSPEAML GKRGLAA