Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_6502 |
Symbol | |
ID | 5320805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009622 |
Strand | + |
Start bp | 191590 |
End bp | 192570 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640778054 |
Product | restriction endonuclease |
Protein accession | YP_001314986 |
Protein GI | 150378392 |
COG category | [V] Defense mechanisms |
COG ID | [COG1715] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.843252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGAAAAC TAAGTGCAGA GGAATGCGAT GCACTCGATC TTGAGCTCAC TAAATTGCGG GCGGACGAGA GCCTCGATCC GCAGGAGTCG GAATTGGTCC TGGGCCGCGT ATTGGAGCCG CTGTTTGGCA TCGAAGGCTA CCGGGTTGAG CACACCGGCG GATTGAACGA TCAGGGAATC GATTTTCGCG CCTTGCGTAT GGCGGATGAG ACCAGTTCTG CACCCGCCGA GACTATCGGA GTTCAGGCCA AGTTTTACAG GAAGGCTGTT CGCCGTGTGC CGATGGGGGA GGTGCAGAAG TTGATCGGCG CTGCGTTATT GCAGGACCTC ACCCGCGTCG TGCTCGTGAG CAACGGCGAA TTTTCGCGCG AGGCTCATGC CGCCGTCGAG AAAAGCTTGC CGCTTCGAAT CGAATTGCTC GACATTAGCG GAATGCGTGG CTGGATCAGC CGCCTGCGCG AAGAGAAGGT CGACGTGGAG GCTGAGGTTC GGATCATGCT GCGCGATCTC AGCAGCGGGC TAGCGCGGTT GATCGCGAAG AGTCCGGATG CCTTGGATCA TCTCGAATGG CGCATGGTCG AGCAAGTCGT CGCAGAGGTG TTCGAGGGGT TGGGTTTTGT CGTGACGCTT ACGCCGGGGT CGAAGGATGG CGGAAAGGAC GTCATTCTGA CGTGCACGGT GAAGGGTAAG CTCGCGGAGT ATTACGTTGA GATCAAACAT TGGCGTTCAT CCACGAAGGT CGGATCGATT GCCGTGGAAA AGCTCTTGAA GGTCATCGTC GAGGAAAAGA AAGACGGTGG ACTCTTCCTG TCGACATACG GCTTCACGTC GAATGCATTC GAGCAGCTCA CGACTATTGA CAAGCAAAAA CTAAAATTCG GCGATCAGGA GAAGATCGTC ACCTTTTGCC AAACATACGT CAAAGCCAAG GCTGGACTCT GGTCGCCGCC GGAGAACCTC ACCGAGGTGC TTTTCGCGTG A
|
Protein sequence | MRKLSAEECD ALDLELTKLR ADESLDPQES ELVLGRVLEP LFGIEGYRVE HTGGLNDQGI DFRALRMADE TSSAPAETIG VQAKFYRKAV RRVPMGEVQK LIGAALLQDL TRVVLVSNGE FSREAHAAVE KSLPLRIELL DISGMRGWIS RLREEKVDVE AEVRIMLRDL SSGLARLIAK SPDALDHLEW RMVEQVVAEV FEGLGFVVTL TPGSKDGGKD VILTCTVKGK LAEYYVEIKH WRSSTKVGSI AVEKLLKVIV EEKKDGGLFL STYGFTSNAF EQLTTIDKQK LKFGDQEKIV TFCQTYVKAK AGLWSPPENL TEVLFA
|
| |