Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5236 |
Symbol | |
ID | 5319538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 196826 |
End bp | 198037 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640777013 |
Product | peptidase M20 |
Protein accession | YP_001313945 |
Protein GI | 150377350 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01883] peptidase T-like protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCATGA TGGACAACTT CGCAATTCCC TTGGATACGG CTGCTGCCGT AGAACACCTC ATGAGGTTTC TCTCCGTAGA GGGAGTAACG GGCAAGGAGG CGGACATCGC GGCGTCGGTC ATCGAAGCTC TCAAGGCCGT GGGGGTTTCT GGAGAAAACA TCCGCTTCGA CGACGCCAAC GAGAGGATCC CTTTACCGAC CGAAACGGGA AACCTGATAG TCGATCTTCC GGGCACCCGA CCCGGCCCTC GGCTGCTGTT CTCCACGCAT CTCGACACTG TACCCTTGTG TGCAGGAGCA AAACCACTGC GTGACGGCAA TCGCATAGTG TCGGACGGCA CAACCGCCCT CGGCGGGGAC GCGCGCACAG GGGTTGCTCT TCTGGTGGTC GTGGCGGAAA CGCTGATCAA GCATAGCCTC CCGCACCCGC CGATCACCCT TCTTTTTACC GTTCGCGAGG AGAGCGGGCT CCACGGCGCC CGCGAACTCG ACCCGGCCGT CCTTGGCGGA CCGGTGATGT GCGTCAACGT CGACGGCCAG CTCGCATCGG ACCTCATCAT TGGAGCAGTG GGGCAGGAGA ACTGGGAGGC AGAAATCGTG GGACGGGCGT CCCACGCCGG AGTCGCGCCG GAAACGGGGA TATCCGCAAC GCTCGTGGGC GCCCTCGCCC TGGCTGCGGC CTATGCGGCA GGCTGGTTCG GAAAGATCGA GAAATCCGAC GATCGCGGCA CAAGCAACAT CGGAATTTTC GGCGGGAAGG ACGGCATGGC AGCCGGCGAC GCGACGAACG TCGTCACCGA CTACGCTTTC CTGAAAGGCG AGGCCCGCAG TCCGGAACCG GCCTTCGCAA AGATGATAGC CGAAGGCTAT GAAGCTGCTT TCGAGAAGGC GAAGCAGGCG GTGAAGGATC GCAACGGAGA GACCGCCCGC GTGACCTTCA TTCACCACAC AGCTTATCCA CCATTCAAGC TCAACGAGAA TTCGCCGGCT GTGGTTCGCG CAGCCAAGGC GATGAAGTTG CTCGGACTCG AGCCGAACTA TCTGTTCTCA AATGGAGGTC TCGACGCCAA TTGGCTGGAC AAGCACGGCG TTCCTACGGT CACGATCGGT GCAGGTCAGG CGGAAATCCA TACCGTGAAC GAGTATGTTG ACCTTAGGGA GTATGAGAAA GGATGTCGGC TTGGCGTGCT GCTTGCAACG ATGCCGGAAT AG
|
Protein sequence | MVMMDNFAIP LDTAAAVEHL MRFLSVEGVT GKEADIAASV IEALKAVGVS GENIRFDDAN ERIPLPTETG NLIVDLPGTR PGPRLLFSTH LDTVPLCAGA KPLRDGNRIV SDGTTALGGD ARTGVALLVV VAETLIKHSL PHPPITLLFT VREESGLHGA RELDPAVLGG PVMCVNVDGQ LASDLIIGAV GQENWEAEIV GRASHAGVAP ETGISATLVG ALALAAAYAA GWFGKIEKSD DRGTSNIGIF GGKDGMAAGD ATNVVTDYAF LKGEARSPEP AFAKMIAEGY EAAFEKAKQA VKDRNGETAR VTFIHHTAYP PFKLNENSPA VVRAAKAMKL LGLEPNYLFS NGGLDANWLD KHGVPTVTIG AGQAEIHTVN EYVDLREYEK GCRLGVLLAT MPE
|
| |