Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1931 |
Symbol | |
ID | 5322789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 1983662 |
End bp | 1984828 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640790868 |
Product | hypothetical protein |
Protein accession | YP_001327600 |
Protein GI | 150397133 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00772625 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACCGCG TCATCAAGAG ACGCGCCAAC GGTTCGCAAA CCGTCTACAC CTTCTACACA AGGTCCCGGA ACACTAAGGA CGCGTGGCCG TCGATCGCCC TTCCGGAACC GCTTGAGAAG GAGTTCTCCG AGCGCCTGTC GATCTGTGAA GCCATGGCCC GCGATGAGAA GGGCTTTCTA CTGGACGGCA AGCGGCTACC GGACCTGAAG AGTAAAGAGT TTTGGCCCGA GGCCACGAAG GCGCACGAAG CATTCATCCG CCGCGGTCGC CAGGGCATCA AGGATTTCAA GGCGCTCGTC GAAGCCTTCC AGAGCGAGAC CAACCCCTTC TGGACCAAGC TGGCGGCTTC CACTCAGCGC GGCTACCGAA CCTCTGGCGA CATCATCAAG GAGACATGGG GAGACGACCT TCCCGTCGAC TTGACGACGG TCGACGCGCA GGACGCGATA GACGCCCTAG GCGAGACGCC GGCGAAAGCA AACCAGTTCC GAGCCTTCCT GTCCCGCCTG ATGGCGTGGG GCGCCTCCCG AGGCTACTGC AAGACCAACG TCGTGGAGAT GACGGAAAAG ATACCGGGCG GCGAGCCGTG GGTGCCGTGG CCGAACTGGG CTTTTGAGAT CCTGCTGGAG CACGCACCGT TCCACATGCA GATGATCGCC ATGTCGGCAT TCTTCACCGG GCAGCGCCAG GGCGACGTGC TGGCTATGAC GAAGCCGAAG GCCGGCGAGA ACACGATCGC CGTCCGCGCG CAGAAGACGG GAAACACGGT TTGGATTCCG ATCCACTTCG CCTATCGGAA ATGGATCGAT CGCGTGCCGA CGTCCGATAG CGTGATGCTG CACGCCGGCG CTCGCGCCAC GTCATACAAG AGCCCCGACG GTTTCCGGAC CGAATGGCAG AAGCTCATGG CGAAGGACGC GTTCAAGCCG TTCCGAGAAA ACCGGATCGT CTTCCACGGT CTGCGCAAGA ACGCGGTGAT CAATTTGCTG GAGGTTGGCT GCACCGAGAA CCAGGTGGGC GCGATCTGCA ACATGTCGGC GCAGATGGTG CAGCATTACG GCCGAGAGGT GGTTTTGAGG AGCCTCGCGA AGGACGCGAT GAAGCTCATG GAAGCACGCT GGAGCGAGAT CGAGCCGGCC GCTTTCAGGA ACAAGAACGG AACGTGA
|
Protein sequence | MHRVIKRRAN GSQTVYTFYT RSRNTKDAWP SIALPEPLEK EFSERLSICE AMARDEKGFL LDGKRLPDLK SKEFWPEATK AHEAFIRRGR QGIKDFKALV EAFQSETNPF WTKLAASTQR GYRTSGDIIK ETWGDDLPVD LTTVDAQDAI DALGETPAKA NQFRAFLSRL MAWGASRGYC KTNVVEMTEK IPGGEPWVPW PNWAFEILLE HAPFHMQMIA MSAFFTGQRQ GDVLAMTKPK AGENTIAVRA QKTGNTVWIP IHFAYRKWID RVPTSDSVML HAGARATSYK SPDGFRTEWQ KLMAKDAFKP FRENRIVFHG LRKNAVINLL EVGCTENQVG AICNMSAQMV QHYGREVVLR SLAKDAMKLM EARWSEIEPA AFRNKNGT
|
| |