Gene Smed_2223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2223 
Symbol 
ID5323084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2299716 
End bp2301770 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content60% 
IMG OID640791161 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001327890 
Protein GI150397423 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.909568 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA AAGTCAAAGA AAACGAAGAA GCCGACGTTG AACGTGAAGG TGCGCCGGAC 
GGTCCGCTTC TCGATCTTTC CGACGACGCT GTCAAGAAGA TGATCAAGGC CGCCAAGAAG
CGCGGCTATG TGACGATGGA CGAGCTAAAC TCCGTTCTGC CTTCCGAGGA AGTGACTTCC
GAGCAGATCG AGGACACGAT GTCCATGCTT TCCGACATGG GCATCAACGT CATCGAAGAC
GAAGAAGCCG AGGAAGCTGG TGGCAGTGAC GACGATGACA GCAGCGACGA TGCTGAAAGC
GAAGGCGGCG AACTCGCGCC TGCCAGCGGC ACCGCGCTTG CGACCAGCAA GAAAAAAGAG
CCGACCGACC GTACCGACGA TCCCGTGCGC ATGTATCTGC GCGAGATGGG CTCGGTCGAG
CTATTGTCGC GCGAAGGCGA AATCGCCATT GCCAAGCGCA TCGAGGCTGG CCGTGAAACG
ATGATCGCCG GGCTCTGTGA GAGCCCGCTT ACTTTCCAGG CGTTGATTAT CTGGCGCGAC
GAATTGAACG AAGGCCAGAC GCTGCTGCGC GAGATCATCG ATCTCGAGAC GACCTATTCC
GGCCCGGAAG CCAAGGCTGC TCCGCAGTTT CAGAGCCCGG AAAAGATCGA AGCCGACCGC
AAGGCGGCAG AAGAGAAGGA AAAGGTCCGC AAGACTCGCG CTGCGGCCAA CGACGACGAC
ATCACCAATG TCGGCGGCGA AGGCCAGCCC GCCGAGGAAG AGGAGGACGA CGACGACGAG
TCGAACCTCT CGCTCGCCGC GATGGAAGCG GAACTGCGCC CGCAGGTGAT GGAGACGCTG
GACGTCATCG CCGAGACTTA CAAGAAGCTC CGCAAGCTGC AGGACCAGCA GGTCGAGGCG
CGCCTTGCCG CGACCGGAAC TCTGTCGCCG GCACAGGAGC GCCGCTATAA GGAACTGAAG
GACGAGCTGA TCAAGGCGGT GAAGTCGCTG TCGCTCAACC AGAACCGCAT CGACGCTCTG
GTCGAGCAGC TTTACGACAT CTCCAAGCGC CTGACGCAGA ACGAGGGCAG GCTGCTGCGC
CTGGCCGAAT CCTATGGTGT CAAGCGCGAG GCTTTCCTGG AGCAGTATTC CGGCGCCGAG
CTCGATCCGA ACTGGATGAA GTCGATCAGC AATCTCGCGG GCAAAGGCTG GAAGGAGTTT
GCCAGGGCGG AGAACCAGAC GATCCGCGAC ATCCGCCAGG AGATCCAGAA TCTCGCGACG
GAGACCGGCA TTTCCATCGC CGAATTCCGC CGCATCGTGT CCATGGTGCA GAAGGGCGAG
CGCGAAGCAC GCATTGCCAA GAAGGAAATG GTCGAGGCAA ACCTCCGTCT CGTGATTTCG
ATCGCCAAGA AGTACACGAA CCGCGGGCTT CAGTTCCTCG ATCTCATCCA GGAAGGCAAT
ATCGGCCTCA TGAAGGCGGT TGACAAATTC GAATACCGCC GCGGCTACAA GTTCTCGACC
TATGCGACCT GGTGGATCCG ACAGGCGATC ACCCGTTCGA TCGCCGACCA GGCCCGCACG
ATCCGCATCC CGGTGCACAT GATCGAGACG ATCAACAAGA TCGTCCGTAC CTCGCGCCAG
ATGCTTCACG AGATCGGCCG CGAGCCGACG CCGGAGGAAC TGGCGGAAAA GCTCGCAATG
CCGCTCGAGA AGGTGCGCAA GGTTCTGAAG ATCGCCAAAG AGCCGATCTC GCTCGAAACC
CCCGTTGGCG ACGAGGAAGA TTCGCATCTC GGCGATTTCA TCGAGGACAA GAATGCGCTG
CTGCCGATCG ACGCGGCCAT TCAGGCGAAC CTCAGAGAGA CGACGACTCG CGTGCTCGCC
TCGCTCACGC CGAGAGAGGA GCGTGTGCTG CGCATGCGTT TCGGCATCGG CATGAACACC
GACCACACGC TGGAAGAAGT CGGCCAGCAG TTCTCGGTCA CCCGCGAACG CATCCGGCAG
ATCGAAGCCA AGGCGCTGCG CAAGCTCAAG CATCCGAGCC GCTCGCGCAA GCTGCGCTCG
TTCCTGGACA GCTGA
 
Protein sequence
MATKVKENEE ADVEREGAPD GPLLDLSDDA VKKMIKAAKK RGYVTMDELN SVLPSEEVTS 
EQIEDTMSML SDMGINVIED EEAEEAGGSD DDDSSDDAES EGGELAPASG TALATSKKKE
PTDRTDDPVR MYLREMGSVE LLSREGEIAI AKRIEAGRET MIAGLCESPL TFQALIIWRD
ELNEGQTLLR EIIDLETTYS GPEAKAAPQF QSPEKIEADR KAAEEKEKVR KTRAAANDDD
ITNVGGEGQP AEEEEDDDDE SNLSLAAMEA ELRPQVMETL DVIAETYKKL RKLQDQQVEA
RLAATGTLSP AQERRYKELK DELIKAVKSL SLNQNRIDAL VEQLYDISKR LTQNEGRLLR
LAESYGVKRE AFLEQYSGAE LDPNWMKSIS NLAGKGWKEF ARAENQTIRD IRQEIQNLAT
ETGISIAEFR RIVSMVQKGE REARIAKKEM VEANLRLVIS IAKKYTNRGL QFLDLIQEGN
IGLMKAVDKF EYRRGYKFST YATWWIRQAI TRSIADQART IRIPVHMIET INKIVRTSRQ
MLHEIGREPT PEELAEKLAM PLEKVRKVLK IAKEPISLET PVGDEEDSHL GDFIEDKNAL
LPIDAAIQAN LRETTTRVLA SLTPREERVL RMRFGIGMNT DHTLEEVGQQ FSVTRERIRQ
IEAKALRKLK HPSRSRKLRS FLDS