Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1010 |
Symbol | |
ID | 5321855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 1081123 |
End bp | 1082133 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640789952 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001326698 |
Protein GI | 150396231 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.21505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCAGA AAAATTGGCA GGAATTGATC AAGCCGAACA AGGTGGAGTT CGCCTCCTCC GGCCGCACCA AGGCAACGTT GGTAGCGGAA CCGCTTGAGC GGGGCTTTGG TCTGACGCTC GGCAACGCGC TTCGTCGCGT GCTTTTGTCG TCGCTGCGCG GTGCCGCTGT AACCGCGGTT CAGATCGACG GCGTATTGCA TGAGTTCTCT TCTATCCCGG GCGTCCGGGA AGACGTGACC GACATCGTGC TCAACATCAA GGAAATCGCC ATCAAGATGG ATGGCGACGA CGCGAAGCGC ATGGTCGTGC GCAAGCAGGG CCCTGGCGTT GTTACCGCCG GTGACATTCA GACGGTTGGC GATATCGAGA TCCTCAACCC GAACCACGTG ATCTGCACGC TCGACGAGGG CGCGGAAATC CGCATGGAGT TCACCGTCAA TAACGGCAAG GGCTATGTAC CGGCTGACCG TAACCGCTCG GAAGACGCGC CGATCGGGCT CATTCCGGTG GATAGCCTGT ACTCTCCGGT CAAGAAGGTC TCCTACAAGG TGGAAAACAC CCGTGAAGGC CAGGTTCTCG ACTATGACAA GCTGACGATG TCCATCGAGA CCGATGGCTC CGTCACGGGC GAAGATGCGA TCGCGTTCGC GGCCCGCATC CTTCAGGACC AGCTGTCGGT CTTCGTTAAC TTCGACGAGC CGCAGAAGGA AACCGAGGAA GAGGCGGTCA CCGAACTCGC CTTCAATCCG GCGCTTCTCA AGAAGGTCGA CGAACTGGAG CTTTCCGTCC GCTCGGCCAA CTGCCTGAAG AACGACAACA TCGTCTATAT CGGCGACCTC ATTCAGAAGA CCGAAGCAGA AATGCTCCGC ACGCCGAATT TTGGTCGCAA GTCGCTCAAC GAGATCAAGG AAGTTCTCGC TTCCATGGGC CTGCATCTCG GCATGGAAGT GCCGTCCTGG CCGCCGGAGA ACATCGAAGA TCTCGCCAAG CGATACGAAG ACCAATATTA A
|
Protein sequence | MIQKNWQELI KPNKVEFASS GRTKATLVAE PLERGFGLTL GNALRRVLLS SLRGAAVTAV QIDGVLHEFS SIPGVREDVT DIVLNIKEIA IKMDGDDAKR MVVRKQGPGV VTAGDIQTVG DIEILNPNHV ICTLDEGAEI RMEFTVNNGK GYVPADRNRS EDAPIGLIPV DSLYSPVKKV SYKVENTREG QVLDYDKLTM SIETDGSVTG EDAIAFAARI LQDQLSVFVN FDEPQKETEE EAVTELAFNP ALLKKVDELE LSVRSANCLK NDNIVYIGDL IQKTEAEMLR TPNFGRKSLN EIKEVLASMG LHLGMEVPSW PPENIEDLAK RYEDQY
|
| |