Gene Smed_3733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3733 
Symbol 
ID5318661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp177034 
End bp178005 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content63% 
IMG OID640775546 
Productmembrane dipeptidase 
Protein accessionYP_001312479 
Protein GI150375883 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.632106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATCG ATGCACTTCA ATGTGGTCAT TTCGACCGCG GGTCCTTCGA GGCGCTAAGG 
CGGGGCGGCT ACAGTGCGGT GACGCCGACG CTCGGCTTCT GGGAAGGGAC GATGGAGTCT
CTCGACTCGC TTGCCCGCTG GCGAGACATG GAGCGCGAGA ACGCCGACCT GATCCTTATT
GCCAGGACCG CTGCCGACAT CGAGCGCGCC GAAACGGAAG GCAAGCTGGC GGTCGTGCTC
GGCTACCAGA ACTCGAACCT GTTCGAGGAC CGCATCGCCT TCGTTGAATT CTTCGCCGAG
CTCGGCGTCC GCGTGGTCCA GCTGACCTAC AACAACCAGA ACGAACTCGG CGGCTCCTGT
TACGAGGAGA ATGACAGTGG CCTTGCTCGG TTCGGCCGCG ATGTCGTACG GGAAATGAAC
CGCGTCGGCA TGCTGGTCGA TCTCTCCCAT GTCGGCGACC GGACGACTCT CGACGCCATC
GAATGGTCGG AAAGGCCGGT TGCGATCACG CATGCCAATG CCGCTTCGCT TTTTGCCCAC
AAGCGCAACA AGTCGGACAA GGTGATCAAG GCTCTTGCCG AACGCGGCGG TGTCATTGGG
TGCGTCGCCT ACCGGAACAT CACGCCCGAC GCCGCCTGCG CCACCGTCGA CGGCTGGTGC
GAGATGGTCG CCCGCACCGT CGACATAGCC GGCATCGACC ATGTCGGCAT CGGCACCGAC
ATTTCGCACA ACCACACCCC GCGCGACTAC GACTGGATGC GCAAGGGCCG CTGGACCCGC
TCGGTTCAGT ATGGTGCAGG CTCGCCGGAG CGGCCTGGCG CGGTGGCGAA GCCGGAATGG
CTGCTCAAGC CGGAAAACCT GCAGGATGTC GCCGCGGCAC TGCTGCGCGC CGGCTTCAAT
CAGGAGGAAG CGAACAAGAT CCTTCGCGGC AACTGGCTCC GTCTTTACGC GGAGGTTTTC
CGTCCGAACT GA
 
Protein sequence
MIIDALQCGH FDRGSFEALR RGGYSAVTPT LGFWEGTMES LDSLARWRDM ERENADLILI 
ARTAADIERA ETEGKLAVVL GYQNSNLFED RIAFVEFFAE LGVRVVQLTY NNQNELGGSC
YEENDSGLAR FGRDVVREMN RVGMLVDLSH VGDRTTLDAI EWSERPVAIT HANAASLFAH
KRNKSDKVIK ALAERGGVIG CVAYRNITPD AACATVDGWC EMVARTVDIA GIDHVGIGTD
ISHNHTPRDY DWMRKGRWTR SVQYGAGSPE RPGAVAKPEW LLKPENLQDV AAALLRAGFN
QEEANKILRG NWLRLYAEVF RPN