Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3733 |
Symbol | |
ID | 5318661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 177034 |
End bp | 178005 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640775546 |
Product | membrane dipeptidase |
Protein accession | YP_001312479 |
Protein GI | 150375883 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.632106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATCG ATGCACTTCA ATGTGGTCAT TTCGACCGCG GGTCCTTCGA GGCGCTAAGG CGGGGCGGCT ACAGTGCGGT GACGCCGACG CTCGGCTTCT GGGAAGGGAC GATGGAGTCT CTCGACTCGC TTGCCCGCTG GCGAGACATG GAGCGCGAGA ACGCCGACCT GATCCTTATT GCCAGGACCG CTGCCGACAT CGAGCGCGCC GAAACGGAAG GCAAGCTGGC GGTCGTGCTC GGCTACCAGA ACTCGAACCT GTTCGAGGAC CGCATCGCCT TCGTTGAATT CTTCGCCGAG CTCGGCGTCC GCGTGGTCCA GCTGACCTAC AACAACCAGA ACGAACTCGG CGGCTCCTGT TACGAGGAGA ATGACAGTGG CCTTGCTCGG TTCGGCCGCG ATGTCGTACG GGAAATGAAC CGCGTCGGCA TGCTGGTCGA TCTCTCCCAT GTCGGCGACC GGACGACTCT CGACGCCATC GAATGGTCGG AAAGGCCGGT TGCGATCACG CATGCCAATG CCGCTTCGCT TTTTGCCCAC AAGCGCAACA AGTCGGACAA GGTGATCAAG GCTCTTGCCG AACGCGGCGG TGTCATTGGG TGCGTCGCCT ACCGGAACAT CACGCCCGAC GCCGCCTGCG CCACCGTCGA CGGCTGGTGC GAGATGGTCG CCCGCACCGT CGACATAGCC GGCATCGACC ATGTCGGCAT CGGCACCGAC ATTTCGCACA ACCACACCCC GCGCGACTAC GACTGGATGC GCAAGGGCCG CTGGACCCGC TCGGTTCAGT ATGGTGCAGG CTCGCCGGAG CGGCCTGGCG CGGTGGCGAA GCCGGAATGG CTGCTCAAGC CGGAAAACCT GCAGGATGTC GCCGCGGCAC TGCTGCGCGC CGGCTTCAAT CAGGAGGAAG CGAACAAGAT CCTTCGCGGC AACTGGCTCC GTCTTTACGC GGAGGTTTTC CGTCCGAACT GA
|
Protein sequence | MIIDALQCGH FDRGSFEALR RGGYSAVTPT LGFWEGTMES LDSLARWRDM ERENADLILI ARTAADIERA ETEGKLAVVL GYQNSNLFED RIAFVEFFAE LGVRVVQLTY NNQNELGGSC YEENDSGLAR FGRDVVREMN RVGMLVDLSH VGDRTTLDAI EWSERPVAIT HANAASLFAH KRNKSDKVIK ALAERGGVIG CVAYRNITPD AACATVDGWC EMVARTVDIA GIDHVGIGTD ISHNHTPRDY DWMRKGRWTR SVQYGAGSPE RPGAVAKPEW LLKPENLQDV AAALLRAGFN QEEANKILRG NWLRLYAEVF RPN
|
| |