Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0686 |
Symbol | |
ID | 5321523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 738473 |
End bp | 740779 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640789623 |
Product | RNA-binding S1 domain-containing protein |
Protein accession | YP_001326377 |
Protein GI | 150395910 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.194488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACGA CCGCGAGACC CATCGCCGCC ATCATTGCCA GCGAAATCAA GGCGAGCGCC GCACAGGTTG CAGCCGCCGT CGAATTGTTG GACGGTGGTG CGACCGTGCC TTTCATCGCC CGCTACCGCA AGGAAGCCAC GGGCGGGCTC GACGACACGC AGCTGCGCGT CCTGTCGGAG CGCCTCACCT ATCTGCGCGA GCTCGAGGCG CGGCGTGCAT CGATCCTCGA TTCGATCCGC AGCCAGGACA AGATGACCGA TGAACTCGAA CGCAAGATCG AGGGCGCGGT TACCAAAGCG GAGCTCGAAG ACATCTACCT GCCCTTTAAG CCGAAGCGCC GCACCAAAGC GGAGATCGCG CGCGAGCGGG GCCTTGGCGC GCTGGCGGAC GCCATCCTTT CCGACCGTTC GACAGCGCCT GCCGAACGTG CTGCCGCCTT TCTCACCGCC GATGTGCCGG ACGCCAAGGC CGCACTCGAT GGTGCCCGCG ACATCATCGC CGAGGGCATG ACCGAGAATG CCGAGCTTCT TGGGCAGCTG CGGAGCTACA TGCGGGGTGC AGCCTTCCTG CGCGCCCGCG TCGTGGGTGG CAAACAGGAG TCCGGCGCAA AGTTCTCCGA CTATTTCGAT CACTCCGAGC GATGGGCGAC GGTGCCCGGC CACCGCGCGC TTGCGATGCT TCGCGGCTGG AACGAGGAGG TCCTGTCCGT GGACATCGTT GTCGACGAGG ATGCGGCGCC TTCGCATAGA CCGATCGAGC GTATGATCGC GGCTGCCTAC AGCGTTGGCG ACCGGCTGCC CGGTGACAAA TGGCTGCTGG AGGTCATCGG CTGGACCTGG CGCGTGAAGC TTTCGCTTTC GCTATCACTG GACCTGATGC GCGAGCTTCG AGAGCGCGCC GAGGAGGAGG CCATCCGCGT CTTCGCGCGC AATCTCAAGG ACCTGCTGCT TGCAGCGCCT GCGGGCTCGC GTCCGACCAT GGGCCTCGAC CCTGGCATCC GGACGGGCGT CAAGGTCGCG ATCGTCGACG GTACCGGCAA GCTCCTCGAC ACGAGCACGG TCTATCCATT TCCGCCGAAG AACGACATTC GCGGGACTCA GGCAGAGCTT GCGGCGCTTG TGCGCAAACA CAAGGTGGAA CTTATCGCGA TCGGCAACGG CACCGGGAGC CGTGAAACCG AAAAGCTCGT CGCGGATATG CTCGCTGCCA TGCCGGCGCC GAAGCCGACA AAGGTGATCG TCTCCGAGGC GGGAGCATCG GTCTATTCCG CTTCGGAGAC GGCGGCTGCC GAGTTCCCCG GCCTCGACGT GTCCTTGCGG GGTGCCGTTT CCATCGCGCG CCGGCTCCAG GATCCCTTGG CCGAGCTCGT GAAAATCGAA CCGAAGTCGA TCGGGGTCGG TCAATATCAG CACGACGTTG ATCAGTCGAA GCTCAGCCGT TCGCTGGATG CCGTCGTCGA GGATGCCGTG AACGCGGTCG GGGTCGATCT CAACACCGCC TCGGCGCCGC TGCTGGCGCG CGTTTCGGGG TTGGGTAAGT CCTCGGCGGA GGCGATCGTT GCGCATCGCG ATGCGCTGGG CGCCTTCGCC AGCCGCAAGG ACCTTTTGAA AGTGCCTCGC CTCGGTGCCC GCACTTTCGA ACAATGCGCC GGCTTCCTCA GGATCACGAA CGGCTCCGAG CCGCTCGATG CGTCTGCGGT GCATCCGGAG GCTTATGGAG TCGCAAAGAA GATCGTCGCC GCCTGCGGTC GTGATGTGCG TTCTCTGATG GGCGACAGCG CCGAACTGAA GAAGCTCGAT CCCCGCATTT TCGTCGATGA ACGCTTCGGC CTGCCAACTG TCAAGGACAT TCTCGCTGAA CTGGAGAAAC CGGGTCGCGA TCCACGCCCG AGCTTCAAAA CGGCCACCTT CGCGGAAGGG ATCGACGACA TCAAGGATCT GAAGGTCGGC ATGCGCCTTG AGGGCACGGT GACCAATGTC GCCGCCTTCG GCGCTTTCGT CGATATCGGC GTGCATCAGG ACGGTCTCGT CCATGTGTCG CAATTGGCCG ACCGCTTCGT CAAGGATCCG CATGAGGTCG TCAAAGCCGG GGACGTCGTG CACGTCCGTG TCACGGAGGT CGATGTGGCG CGAAAGCGCA TTGGACTGTC GATGCGCAAG GAAGGTGGCG CCGAAACGCC CCGAGAGGCG AGGGGGACGG CGCCGAACGG CGGCAATCGG GGTGCGCCCG CTCGACCGCA GCCGCCGCAG CAGCCGGCGC AAGGCGCCTT CGGTGAAGCG CTGATCGCGG CGATGAAGAA AAAATGA
|
Protein sequence | MVTTARPIAA IIASEIKASA AQVAAAVELL DGGATVPFIA RYRKEATGGL DDTQLRVLSE RLTYLRELEA RRASILDSIR SQDKMTDELE RKIEGAVTKA ELEDIYLPFK PKRRTKAEIA RERGLGALAD AILSDRSTAP AERAAAFLTA DVPDAKAALD GARDIIAEGM TENAELLGQL RSYMRGAAFL RARVVGGKQE SGAKFSDYFD HSERWATVPG HRALAMLRGW NEEVLSVDIV VDEDAAPSHR PIERMIAAAY SVGDRLPGDK WLLEVIGWTW RVKLSLSLSL DLMRELRERA EEEAIRVFAR NLKDLLLAAP AGSRPTMGLD PGIRTGVKVA IVDGTGKLLD TSTVYPFPPK NDIRGTQAEL AALVRKHKVE LIAIGNGTGS RETEKLVADM LAAMPAPKPT KVIVSEAGAS VYSASETAAA EFPGLDVSLR GAVSIARRLQ DPLAELVKIE PKSIGVGQYQ HDVDQSKLSR SLDAVVEDAV NAVGVDLNTA SAPLLARVSG LGKSSAEAIV AHRDALGAFA SRKDLLKVPR LGARTFEQCA GFLRITNGSE PLDASAVHPE AYGVAKKIVA ACGRDVRSLM GDSAELKKLD PRIFVDERFG LPTVKDILAE LEKPGRDPRP SFKTATFAEG IDDIKDLKVG MRLEGTVTNV AAFGAFVDIG VHQDGLVHVS QLADRFVKDP HEVVKAGDVV HVRVTEVDVA RKRIGLSMRK EGGAETPREA RGTAPNGGNR GAPARPQPPQ QPAQGAFGEA LIAAMKKK
|
| |