Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5170 |
Symbol | |
ID | 5319472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 123669 |
End bp | 124706 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640776948 |
Product | peptidase S58 DmpA |
Protein accession | YP_001313880 |
Protein GI | 150377285 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.935854 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.481324 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGG CACGGGAACT GGGACTGATA CCGCAGGGCC GGCTGGCGCC GGGGCCGGGC AATGAAATAA CCGACGTTGC GGGCGTTTCG ATCGGCCCCC GGTCGTTGCG CGGCGAGGGC GTGTACACAG GGATCACTGC CATCGTCCCG CATCCGGGCG ATATCTTCCG CGTCAAGCCG CGGGCGGCGG TGGAGGTCAT CAACGGCTTC GGCAAATCGG CGGGGCTGAT GCAGGTGGCC GAACTCGGCA CGCTCGAGAC GCCGATCCTG CTCACCAACA CCTTTTCCGT ATCCGCCTGT ACGGAGGCGC TTATCCGCCG CGCCGTCGCC GCCAATCCGG CAATCGGGCG ACAGACCTCG ACCGTCAATG CAGTGGTCTG CGAATGCAAC GACGGCAGCA TCAACGACAT TCAGGCGCTG GCGGTAACGC CAGCCGATGC CGAGGCGGCG CTTGACGCCG CCCGCATGGG GCCAGTGGAG CAGGGGGCGG TGGGTGCCGG CTCCGGCATG ACGGCTTTCG GCTTCAAGGC GGGTATCGGC ACGGCGTCGC GGCGCATGCG GATAGGCAAG CGCGACTTCA CGCTGGGCAC GCTAGTTCTC GCAAATTTCG GGGCGGCCGG CGATCTCGTC CTGCCGGACG GGCGCCGACC CGATCCGAAA GTGCCGGCCA AGCCCGAATG TGGCTCCGTT ATCGTCGTCA CGGCGACGGA CCTTCCTTTG GCCGATCGCC AGTTGCAGCG GATCACGCGG CGGGCCGGCG CGGGTCTTGC CCGCCTCGGC GCCTTCTGGG GCCATGGCAG CGGCGATGTC GCCCTCTGTT TCACCACTGC CGATCCGGTG GAGCACGAGC CGCCCGCATC CTTCGCCACG CAGGAGCGGA TCGCTGACGC TCACATCGAC ATCGCTTTCC GCGCCGCCGC CGAGACGACG CAGGAGGCGG TCCTGAACGC GCTCTGCATG GCGCCCGCCA TGGCGGGCCG CAGCGGCCGG ATCATTCCTT CTCTTGCCGA CTGGCTGAGG AAGAACAGCG TTTCATGA
|
Protein sequence | MKTARELGLI PQGRLAPGPG NEITDVAGVS IGPRSLRGEG VYTGITAIVP HPGDIFRVKP RAAVEVINGF GKSAGLMQVA ELGTLETPIL LTNTFSVSAC TEALIRRAVA ANPAIGRQTS TVNAVVCECN DGSINDIQAL AVTPADAEAA LDAARMGPVE QGAVGAGSGM TAFGFKAGIG TASRRMRIGK RDFTLGTLVL ANFGAAGDLV LPDGRRPDPK VPAKPECGSV IVVTATDLPL ADRQLQRITR RAGAGLARLG AFWGHGSGDV ALCFTTADPV EHEPPASFAT QERIADAHID IAFRAAAETT QEAVLNALCM APAMAGRSGR IIPSLADWLR KNSVS
|
| |