Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1839 |
Symbol | |
ID | 5322697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1917125 |
End bp | 1918558 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640790777 |
Product | hypothetical protein |
Protein accession | YP_001327509 |
Protein GI | 150397042 |
COG category | [R] General function prediction only |
COG ID | [COG5565] Bacteriophage terminase large (ATPase) subunit and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.244964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.210765 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCAC CCCACCCATC CGAAAGAACC GTTGCGTTCG GCCTCTCCGC GATGCTCAGG GAACAGATGT CGCTGATGGC GGAACTCCAC CGGCGGCAAC GAACGAATAT CCTTTCCAGC TATCAGCCCT ATGCCAAGCA GCGGGAGTTC CATGCCGCCG GCGCAACCTT TCGCGAGAGG CTGTTCATGG CGGGCAACCA GCTCGGCAAG ACGCTTGCGG GCGCGGCGGA GGCAGCCATG CATCTGACCG GCCGCTATCC CGAATGGTGG CAGGGAAGAC GGTTCGACCG GCCCGTCGCA ATGCTCGCGG GCTCGGAGTC CTATGAGCTG ACACGCGACG GGGTGCAGCG GTTGCTGATA GGCCCGCCTC TGAATGAAGA TGAATGGGGC ACCGGATTCG TGCCCAAGGC AACGATCCAG GCGACGACGC GCCGCTCCGG CGCTTCCGGG GCTCTCGACA GCGTAACGGT GCGGCACGTT GCAGGCGGAG CCTCGACGCT GCTTTTCAAA GCTTACGAGC AGGGACGGGC CAAGTGGCAG GCCAACACGG TGGACTATGT CTGGTTCGAC GAGGAGCCGC CTGAGGACGT CTATTTCGAA GGGATCACCC GCACCAATGC GACCCGCGGT TCCATCGCCG TGACCTTCAC GCCGCTCAAG GGCCTGAGCG CCGTGGTGGC CAGATACCTG ATGGAAAAGT CGGCGGACCG CGAGGTCACC ACCATGACGA TCGAGGATGC GGAACATTAT ACGCCCGAGG AGCGCCGGCG GATCATCGAC AGCTATCCCG CCCATGAGCG CGAGGCGCGC ACCAAGGGCG TGCCGGCTCT CGGCTCCGGA CGGATCTTTC CCGTAACCGA GGAGAGCATT CGTGCCGATC CGTTCGATAT ACCGAAGCAC TGGGTCCAGA TCGGCGGACT CGACTTCGGC TGGGACCATC CTTTCGCGGC TGTCGGCTGC GCCTGGGACC GGGATGCTGA TGTCTTCTAT GTGACCAAGC TCTATCGCGA GCGGGAATCG ACGCCGATCA TCCACGCGGC AGCCCTCAAA CCCTGGGGCG GAACCTTGCC CTGGGCGTGG CCCCATGACG GGTTGCAGCA TGACAAGGGC AGCGGCGAGC AACTGGCGGC CCAGTACCGG GCACAGGGGC TGGCGCTTCT TCCCGAAAGG GCGACCTTCG ACGACGGCAC GAACGGCGTC GAAGCCGGGC TTTCCGACAT GCTGCAGCGG ATGCAGACCG GGCGCTGGAA GGTGTTTTCC ACCTGCACGG AATGGTTCGA GGAATTCCGC CTGTATCACC GCAAGGACGG CAGGATCGTC AAGGAGCGCG ACGACCTCCT CGCCGCCTCG CGCTACGCGC TGATGATGAA GCGCCATGCA CGGGCAATCG GCGGCAACGC AAACTGGAAA TTCACCGCCC GAAAGGTTCT CTGA
|
Protein sequence | MSAPHPSERT VAFGLSAMLR EQMSLMAELH RRQRTNILSS YQPYAKQREF HAAGATFRER LFMAGNQLGK TLAGAAEAAM HLTGRYPEWW QGRRFDRPVA MLAGSESYEL TRDGVQRLLI GPPLNEDEWG TGFVPKATIQ ATTRRSGASG ALDSVTVRHV AGGASTLLFK AYEQGRAKWQ ANTVDYVWFD EEPPEDVYFE GITRTNATRG SIAVTFTPLK GLSAVVARYL MEKSADREVT TMTIEDAEHY TPEERRRIID SYPAHEREAR TKGVPALGSG RIFPVTEESI RADPFDIPKH WVQIGGLDFG WDHPFAAVGC AWDRDADVFY VTKLYRERES TPIIHAAALK PWGGTLPWAW PHDGLQHDKG SGEQLAAQYR AQGLALLPER ATFDDGTNGV EAGLSDMLQR MQTGRWKVFS TCTEWFEEFR LYHRKDGRIV KERDDLLAAS RYALMMKRHA RAIGGNANWK FTARKVL
|
| |