Gene Smed_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1010 
Symbol 
ID5321855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1081123 
End bp1082133 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content58% 
IMG OID640789952 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001326698 
Protein GI150396231 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.21505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAGA AAAATTGGCA GGAATTGATC AAGCCGAACA AGGTGGAGTT CGCCTCCTCC 
GGCCGCACCA AGGCAACGTT GGTAGCGGAA CCGCTTGAGC GGGGCTTTGG TCTGACGCTC
GGCAACGCGC TTCGTCGCGT GCTTTTGTCG TCGCTGCGCG GTGCCGCTGT AACCGCGGTT
CAGATCGACG GCGTATTGCA TGAGTTCTCT TCTATCCCGG GCGTCCGGGA AGACGTGACC
GACATCGTGC TCAACATCAA GGAAATCGCC ATCAAGATGG ATGGCGACGA CGCGAAGCGC
ATGGTCGTGC GCAAGCAGGG CCCTGGCGTT GTTACCGCCG GTGACATTCA GACGGTTGGC
GATATCGAGA TCCTCAACCC GAACCACGTG ATCTGCACGC TCGACGAGGG CGCGGAAATC
CGCATGGAGT TCACCGTCAA TAACGGCAAG GGCTATGTAC CGGCTGACCG TAACCGCTCG
GAAGACGCGC CGATCGGGCT CATTCCGGTG GATAGCCTGT ACTCTCCGGT CAAGAAGGTC
TCCTACAAGG TGGAAAACAC CCGTGAAGGC CAGGTTCTCG ACTATGACAA GCTGACGATG
TCCATCGAGA CCGATGGCTC CGTCACGGGC GAAGATGCGA TCGCGTTCGC GGCCCGCATC
CTTCAGGACC AGCTGTCGGT CTTCGTTAAC TTCGACGAGC CGCAGAAGGA AACCGAGGAA
GAGGCGGTCA CCGAACTCGC CTTCAATCCG GCGCTTCTCA AGAAGGTCGA CGAACTGGAG
CTTTCCGTCC GCTCGGCCAA CTGCCTGAAG AACGACAACA TCGTCTATAT CGGCGACCTC
ATTCAGAAGA CCGAAGCAGA AATGCTCCGC ACGCCGAATT TTGGTCGCAA GTCGCTCAAC
GAGATCAAGG AAGTTCTCGC TTCCATGGGC CTGCATCTCG GCATGGAAGT GCCGTCCTGG
CCGCCGGAGA ACATCGAAGA TCTCGCCAAG CGATACGAAG ACCAATATTA A
 
Protein sequence
MIQKNWQELI KPNKVEFASS GRTKATLVAE PLERGFGLTL GNALRRVLLS SLRGAAVTAV 
QIDGVLHEFS SIPGVREDVT DIVLNIKEIA IKMDGDDAKR MVVRKQGPGV VTAGDIQTVG
DIEILNPNHV ICTLDEGAEI RMEFTVNNGK GYVPADRNRS EDAPIGLIPV DSLYSPVKKV
SYKVENTREG QVLDYDKLTM SIETDGSVTG EDAIAFAARI LQDQLSVFVN FDEPQKETEE
EAVTELAFNP ALLKKVDELE LSVRSANCLK NDNIVYIGDL IQKTEAEMLR TPNFGRKSLN
EIKEVLASMG LHLGMEVPSW PPENIEDLAK RYEDQY