Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0957 |
Symbol | |
ID | 5321798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1031233 |
End bp | 1034067 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640789895 |
Product | ribonuclease |
Protein accession | YP_001326645 |
Protein GI | 150396178 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1530] Ribonucleases G and E |
TIGRFAM ID | [TIGR00757] ribonuclease, Rne/Rng family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.545354 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0139882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAGA AAATGCTTAT CGATGCGTCT CACTCAGAGG AGACGCGCGT CGTTGTCGTA CGCGGAAACC GCATAGAAGA ATTCGATTTC GAATCGGAAC ATAAAAAGCA GATCCGCGGC AATATTTACC TGGCCAAGGT AACAAGAGTG GAGCCGTCGC TCCAGGCAGC TTTCGTCGAC TACGGCGGCA ATCGCCACGG ATTTCTGGCC TTCGCCGAGA TCCATCCCGA CTATTATCAG ATCCCCCTCG CCGATCGTCA GGCCCTGTTG AAAGCGGAAG CGGAAGAAGC GCGCCGCGAA GACGACATCG AGCCGATGGA GACCTCGGCC GAACGGAAAT CTGCGTCTTC TACCGATTCG GTGGCCGAAG TGGTGGCACA GGCCGAAGCC GCCGAGGCGG CCGGAGAGGC CGCTGAGGCG CAAGCCGCTG CGGAGAAGCC CAAGGCCAAG GCCAAGCCGA AGCGTACACG CCGTGCGAAG CCGAAGGCAA GCGAGGAGAC CGCGGCGAGC CAGGAGTCGG AGGCGCCTGA AGGTGGCCCG TCTGAAGACG GCGAAAGCGA CAAGAGCGAA ATGGCCGCGA TCGTCGAAAC CGACTCGATC TCCGAGGATG TCGACGCGCG CCGCCAGCGC GACGACGACG ATGACGACGA CAATCACGAC GGCGAAAAGG AAATCATCGA ATCCGTCGGC GCCGAAGACG CCATGGAAGA GGTGGCGGAG CGCCAGTTCC GCAAACCGCG CAAGCAGTAC CGGATTCAGG AAGTGATCAA GCGTCGGCAG ATCCTGCTGG TGCAGGTCGC GAAAGAAGAA CGCGGCAACA AGGGTGCGGC ACTCACGACC TATCTTTCGC TCGCCGGCCG CTACTCCGTG CTCATGCCGA ACACGGCGCG CGGTGGCGGC ATCTCTCGCA AGATCACCAA CCTGCAGGAC CGCAAGCGCC TGAAGGAGAT CGCCCGCGGC CTCGACGTGC CGCAGGGCAT GGGTGTGATC CTGCGCACCG CCGGAGCCAA TCGCACCAAA GTCGAGATCA AGCGTGATTT CGAATATCTC ATGCGTCTGT GGGAGAACGT CCGGACGCTG ACCCTGAACT CGACGGCCCC CTGCCTTGTT TACGAGGAGG GCAGCCTGAT CAAGCGATCG ATCCGCGACC TCTACAACAA GGACATAAGC GAAATCGTCG TCTCGGGCGA GGAAGGCTAC AAGGAAGCCA AGGGCTTCAT GAAGATGCTG ATGCCCAGCC ACGCGAAGGT CGTGCAGCCC TATCGCGACG TTCACCCGAT CTTCTCGCGC TCCGGCATCG AAGCGCAACT CGACCGCATG CTGCAGCCGC AGGTGACGTT GAAGTCGGGC GGCTACATCA TCATCAATCA GACCGAAGCG CTGGTTTCCA TCGACGTGAA CTCCGGTCGT TCGACGCGGG AGCATTCGAT CGAGGACACT GCTCTCCAGA CGAACCTCGA GGCAGCCGAG GAAGTAGCGC GGCAATTGCG CCTGCGCGAC CTCGCCGGCC TCATCGTGAT CGACTTCATC GACATGGAAG AGAAGCGGAA CAACCGCTCG GTCGAGAAGA AGCTCAAGGA TTGCCTCAAG AACGATCGCG CCCGCATCCA GGTCGGCCGG ATCTCGCATT TCGGCCTGCT CGAAATGTCT CGCCAGCGAA TCCGTGCCTC GGTGCTGGAA AGCACGATGC AAACCTGCCC GCACTGCAAC GGCACGGGCC ACATTCGTTC GCAATCCTCT GTCGCTTTGC ATGTGCTGCG CGGCATCGAG GAACACCTGC TCAAGAACAC CACGCATGAC ATCAGCGTAA GAACGACCCC CGACATCGCG CTCTATCTGC TCAATCAGAA ACGCAGCTCG ATTACGGACT ATGAACAGCG CTTCGGTGTT TCAATCTTCA TTGAAGCGGA CGCCCATGTC GGCGCCCAAC ACTTCGCGAT CGATCGCGGT GAACCGGTCG AAAATCCGGT CAGGATCGAC CAGATTCTGC AGTTCGAGCC GGAACCGGAG GAAGAAGAAG AAGAAGAAGA GGTCCTGATC GAAGAAGAAC TCGACGACGA GGAAGCGGAA GAGACCGCTG CCGAGCATCA GGACCAACAG AAGGGGCAGC CCGACGATCA GGGCGGCCGC AAGCGCAAGC GCCGCCGCCG CCGGCGCGGC AAGGGCGCGG GTCAGGGAGC GGACGCAGTT TCCGCAGAGT CTGCCGATGC GGCTGAAGAC GAGGCCGGCG ACGAGGCCGC CGCGGACGAC GCCGAGGATG AACTCGACGC TGCGGACTCC CTGAATGGCG ATAGTGAGCA GAAGCGCAAG CGTCGCCGGC GTGGCAAGCG CGGCGGACGC CGCAATCGTC AGGAAGAACT GGCTGGTTCC GAAGGCGAAA CCGGCGCGAC GGCAGAAACC GTCGAAGAGC CTGGTGAAGA GTCCGTCGAC AGTACTGCAG CAGAGATTCC AGCAGTCTCT GCGGCTGAGG AAGCGCCGGC CGACGTCGCT GCTGCTGCCG CCGTCGAGGG AGTGGCCGAA GAGCCGAAGC CGGCCAAACC GCGCCGCAGC CGTAAGAAAA CGACCAAGAC CGAGGAACCG GCGGTGAATG CCGACGAGGT CGCCGAAAAG GTGGTCGAGG CTCAGCCGGA ACCCGCAGAG CCTGCGCCTC AAATAGTCGA GGAAGCAGTC GAGGAAGCGG TCGAGGACCT CGAAGGCGCC AAGCCCGCGC GCTCAAATCG AGACCTGTCG GCGATCGCAT CCAAGCCGGT CGTGACGTCC AGCAAAGGCG AGAGCGAAGA AGAGCCGACA AAGCCCAAAA AGGGCGGCTG GTGGCAGCGC CGCGGTTTCT TCTGA
|
Protein sequence | MAEKMLIDAS HSEETRVVVV RGNRIEEFDF ESEHKKQIRG NIYLAKVTRV EPSLQAAFVD YGGNRHGFLA FAEIHPDYYQ IPLADRQALL KAEAEEARRE DDIEPMETSA ERKSASSTDS VAEVVAQAEA AEAAGEAAEA QAAAEKPKAK AKPKRTRRAK PKASEETAAS QESEAPEGGP SEDGESDKSE MAAIVETDSI SEDVDARRQR DDDDDDDNHD GEKEIIESVG AEDAMEEVAE RQFRKPRKQY RIQEVIKRRQ ILLVQVAKEE RGNKGAALTT YLSLAGRYSV LMPNTARGGG ISRKITNLQD RKRLKEIARG LDVPQGMGVI LRTAGANRTK VEIKRDFEYL MRLWENVRTL TLNSTAPCLV YEEGSLIKRS IRDLYNKDIS EIVVSGEEGY KEAKGFMKML MPSHAKVVQP YRDVHPIFSR SGIEAQLDRM LQPQVTLKSG GYIIINQTEA LVSIDVNSGR STREHSIEDT ALQTNLEAAE EVARQLRLRD LAGLIVIDFI DMEEKRNNRS VEKKLKDCLK NDRARIQVGR ISHFGLLEMS RQRIRASVLE STMQTCPHCN GTGHIRSQSS VALHVLRGIE EHLLKNTTHD ISVRTTPDIA LYLLNQKRSS ITDYEQRFGV SIFIEADAHV GAQHFAIDRG EPVENPVRID QILQFEPEPE EEEEEEEVLI EEELDDEEAE ETAAEHQDQQ KGQPDDQGGR KRKRRRRRRG KGAGQGADAV SAESADAAED EAGDEAAADD AEDELDAADS LNGDSEQKRK RRRRGKRGGR RNRQEELAGS EGETGATAET VEEPGEESVD STAAEIPAVS AAEEAPADVA AAAAVEGVAE EPKPAKPRRS RKKTTKTEEP AVNADEVAEK VVEAQPEPAE PAPQIVEEAV EEAVEDLEGA KPARSNRDLS AIASKPVVTS SKGESEEEPT KPKKGGWWQR RGFF
|
| |