Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5437 |
Symbol | |
ID | 5319739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 395591 |
End bp | 398272 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640777200 |
Product | type III restriction protein res subunit |
Protein accession | YP_001314132 |
Protein GI | 150377537 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.463119 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTGC ATCCAGGCTT TCCCAACGAT CCGCACACCG TTCTCGACCC TTCGATCCGA TGGTTTCCGG CCGACGAGGC AATGCGCGAA ACCAGCATGG ACAAGCTTAT GCCGCCGCTC GTGGCGGCGC TGCGCCAGAA GGTGAAAGAC TTCCGTGACG GCGGTTATGT CGGCGCGACC GATACCAGCC GGAGCCTGCT CAATTGGTGG TTCAATACGC CCCACCTAAT CCCCCAGACT GACGGCAACA TGACTGAGTT TCAGTATTTC TTCGCACAGA GGGAATCCCT GGAAACCATT ATCTATCTTT ATGATGTTGT CGGCGTGAAC GACAAATTCG ACCTAATGCG CTTCGACAGT AGCGGGGCCG TTTCCGCCAA CATGTTCGAT GAAACATGGC GTCGTTTCGT TGTGAAAATG GCGACAGGGG CGGGGAAAAC CAAGGTGCTT AGTCTGGCCC TGGCATGGAG CTTCTACCAC AAGCTCTACG AGCCCGACTC CCGGCTGGCC CGCAACTTCC TTGTGATTGC ACCTAACATC ATCGTGCTGG ACCGCATCTA TAAGGACTTC CAGGGACTGC GGTTGTTTTT CGACGACCCG GTTATCCCCG ATAACGGCTT TGATGGCCGT AACTGGCGCG ATGATTTCCA GTTGACGCTG CATGTGCAGG ACGAGGTGCG TATCACGCAT CCGACCGGCA ACATCTTTCT CACCAACATT CACCGGGTTT ATGGAGGCGA GGACATCCCT GCATCGCCAG AAGACGACAA CTCGATGGAT TATTTCCTTG GCACCCGACC GACCGGCGCG ACGACGGATT CCCAGGTGGA CCTCGGTATG ATCGTCCGTG ACATCGATGA ACTGATGGTG CTCAACGACG AGGCCCATCA CATTCACGAC TCGCGGCTCG CATGGTTCAA GTCGATTGAG GACATCCACA ACCGACTGTT GCAAAAAGGG TCTGCCCTAT CGCTACAAGT AGACGTGACG GCGACCCCGA AGCACAACAA CGGCGCGATC TTCGTGCAGA CCGTGGCCGA TTATCCGCTA GTCGAAGCCA TTTCGCAGAA TGTCGTGAAG CATCCCGTGC TGCCAGACGC TGCCAGCCGA GCCAAGTTGT CCGAGCGACA GAGCGCCAAA TACACCGAGA AACACGCTGA CTATATTGAC CTCGGGGTGA TCGAATGGCG CAAGGCCTAT GCCGAACACG AGAAGGTGGG CAAGAAAGCC ATCCTGTTCA TCATGACTGA CGACACGCGC AACTGCGATG ATGTTGCGGA CTATCTTGAA GGCAACTACG CCGATCTCAA AGGCGCAGTG CTAGTCATTC ACACCAAAGC CAACGGCGAA ATCTCCGAAT CCACCTCAGG CAAGGCCAAG GAAGAACTGG AAAAGCTCAG GAAGCAGGCG AACGAGATCG ATGACGCCGC CAGCCCCTAC AAGGCCATTG TATCGGTCCT GATGCTGAAA GAGGGGTGGG ACGTACGGAA TGTGACTACC ATTGTCGGAC TACGGGCCTA TACCTCGAAG AGCAACATTC TCCCCGAACA GACGCTGGGG CGTGGCCTCC GCAAGATGTA TCCTGGTGGC ATCGAAGAGT ATGTCAGTGT TGTTGGCACC GACGCCTTTA TGGAGTTTGT CGAGTCCATC CAAGCCGAGG GCGTCGAGCT TGAGCGCCAG GCGATGGGCC AGGGGACGAA GCCGAAGACC CCCCTCGTGA TAGAGGTGGA GAAAGACAAC GAGAAGAAGG ACATAGATGC CCTAGATATC GAGATTCCGG TGATGACCCC ACGGAGCTAC CGCGAGTACA AGTCACTGGG CGACCTTGAT ATAACGGCCT TTGCCCATCA GCGCGTTCCT TACCGAACTT TCAGCGAGGA GCAACAGCGC GAGATTGTGT TCAAGGACAT CACGACTGGC GCGATAACGC ACACCACCAT ACTCGACACG GCAGGAATTG CCGATTACCG CAGCGTCCTG GGGTATTTCG CCCAGACCGT CATGAAGGAA TTGCGGCTGA TCAGCGGCTA CGACATTCTT TACGGGAAAA TCAAAGCCTT CGTGCAGTCG GAATTGTTTG ACTGCGAAGT TGACCTCGAC AGCGCCAACA CCTTGCGGAA TCTATCCGAA TTGTCCGCGA CGAAAACGCT TATCGAAAAT TTCAAGAAAG CCATCAACGC CCTAACGGTA AAGGACAAGG GCGACGCGGA AATCCGCGAC ACTATTAAGC TGCGGCAGAC CCGGCCCTTC GTGACAAAGG AGCAGGGCTA TCTAGTGCCG AAGAAAAGCG TCTTCAACCG GATTATTGGC GACAGCCATC TTGAACTAAT TTTTGCCAGT TTCCTTGAGT CCTGCTCCGA TGTGGTTGCT TATGGAAAGA ACTACCTCGC AGTGCATTTT AAGATTGATT ACGTCAACGC CGAAGGGAAT ATCTCCAACT ACTATCCCGA CTTTCTGGTG AAGCTACCTG ACAAGCGGAC TGTGATCGTT GAAACCAAGG GTCTTGAGGA CTCAGACGTA CCGCTGAAGA TGGAGCGTCT GAAGCAATGG TGCGAGGATA TCAATCGGGT GCAGGCCGAC GTAACCTATG ATTTTGTTTA CGTTGACCAG GAAAGCTTTG AAAAATACAG TCCGAAGTCA TTTTCGGAGC TTGTTGAGAA TTTCACACAA TACAAGGTGT GA
|
Protein sequence | MALHPGFPND PHTVLDPSIR WFPADEAMRE TSMDKLMPPL VAALRQKVKD FRDGGYVGAT DTSRSLLNWW FNTPHLIPQT DGNMTEFQYF FAQRESLETI IYLYDVVGVN DKFDLMRFDS SGAVSANMFD ETWRRFVVKM ATGAGKTKVL SLALAWSFYH KLYEPDSRLA RNFLVIAPNI IVLDRIYKDF QGLRLFFDDP VIPDNGFDGR NWRDDFQLTL HVQDEVRITH PTGNIFLTNI HRVYGGEDIP ASPEDDNSMD YFLGTRPTGA TTDSQVDLGM IVRDIDELMV LNDEAHHIHD SRLAWFKSIE DIHNRLLQKG SALSLQVDVT ATPKHNNGAI FVQTVADYPL VEAISQNVVK HPVLPDAASR AKLSERQSAK YTEKHADYID LGVIEWRKAY AEHEKVGKKA ILFIMTDDTR NCDDVADYLE GNYADLKGAV LVIHTKANGE ISESTSGKAK EELEKLRKQA NEIDDAASPY KAIVSVLMLK EGWDVRNVTT IVGLRAYTSK SNILPEQTLG RGLRKMYPGG IEEYVSVVGT DAFMEFVESI QAEGVELERQ AMGQGTKPKT PLVIEVEKDN EKKDIDALDI EIPVMTPRSY REYKSLGDLD ITAFAHQRVP YRTFSEEQQR EIVFKDITTG AITHTTILDT AGIADYRSVL GYFAQTVMKE LRLISGYDIL YGKIKAFVQS ELFDCEVDLD SANTLRNLSE LSATKTLIEN FKKAINALTV KDKGDAEIRD TIKLRQTRPF VTKEQGYLVP KKSVFNRIIG DSHLELIFAS FLESCSDVVA YGKNYLAVHF KIDYVNAEGN ISNYYPDFLV KLPDKRTVIV ETKGLEDSDV PLKMERLKQW CEDINRVQAD VTYDFVYVDQ ESFEKYSPKS FSELVENFTQ YKV
|
| |