Gene Smon_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1043 
Symbol 
ID8600771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1132196 
End bp1133503 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content31% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003306383 
Protein GI269123806 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAAA AATTTAAAAT ATTAATATCT GTAATTTTTT TAGGTATTAC TGTATTTTCT 
TGTGGGACTA AAAATAGTAA AGATGAAGTA GTTACAATTA AATATTGGTC TTTCCCTAAC
TTTAATGCAG ATTCTGAATT AAAAACACCA GAAGAATTTG ACATGGCTTT AATAAAAGCT
TTTGAAGAAG CTAATCCAGA TATTAAAGTT GAATATCAAA AAATAGATTT TACAGATGGA
CCAGCAAAAC TTGAAACATC TATTATTTCA AAATCTAATC CTGATGTTAT TATAGATGCA
CCAGGTAGAG TAATAGATTG GGCTAAAAAA GGATATTTAG TTCCTTTTGA TATAGATACA
TCTATTTATT CTAACACAAT AGTTTCAGCT GCAAGTCATG AAGGAAAATT GTATCTATAT
CCTTTAGGAA CTGCACCATT TGTTATGGCA TTTAATAAGG TAATTACAGA TAAATTAGGT
CTTACTCACA TGTTGCCATT AGATAGAGAA GGAAGAAATT GGACTGTTGA AGAATTTGAA
GCTCTGTTAA TGGCTATTAA AGAAAAAGAT CCAAGTATAG ATCCAATAAT ATTTTTCAAT
AAAACACCAG ATGGTAGTCA TGGATCAAGA TCTTTTGTTT TAAACTTATT TGATACTTGG
CTTACAGATA AAGATATAAC TAAATATATT GTTAATAATG AAAGAGGAGT TAAAGGTTTA
GAATGGGCTA AAAAAGCACA TGATATGGGA CTTTTAGGTG ATGGTGCTTC TTCAGAAGCA
AGAGATGCAT TGGAAGCATT TAGAAGTGGT CTTGCAGCAG GAACTATGAT TTATTCACCA
GGTTTAAATG CTATAAGTTC TAACCAACAA GCTAAGGCAG AAGGTAGATT AGATCCAGTT
TATGTTGCTA TGCCAAATAA TGGAGGGCAG GCTAAATATG AATTATTATT AGCAGGAGCT
GCTGTATTTA ATAATAATGA TGAGGCAAAG ATAGAAGCTT CTAAAAAATT TGTAGATTTT
GTAATAAATG ATCCAGTGTG GGGACAAAGA GCTCTTAAAG CAACAAGAAA CTTCTCACCA
GTTGGAAAAA CAGGATTATA TGGAGATGAT GAGGAAACTA AGTTTATAGA AAATATAAAC
AGTAATGGAA ATTATGGTCC TTATTACAAC ACTATAGATG GTTTTGCTCA AATGAGACCA
TTATGGTCAA ATATGGTTCA AGCTGTATTA AATGGTCAAA TAAGTCCAAA AGCTGGATTA
GATAAATTTG TTATAGATGC AACTAAAGCA ATGGAAGATG CTAAATAA
 
Protein sequence
MYKKFKILIS VIFLGITVFS CGTKNSKDEV VTIKYWSFPN FNADSELKTP EEFDMALIKA 
FEEANPDIKV EYQKIDFTDG PAKLETSIIS KSNPDVIIDA PGRVIDWAKK GYLVPFDIDT
SIYSNTIVSA ASHEGKLYLY PLGTAPFVMA FNKVITDKLG LTHMLPLDRE GRNWTVEEFE
ALLMAIKEKD PSIDPIIFFN KTPDGSHGSR SFVLNLFDTW LTDKDITKYI VNNERGVKGL
EWAKKAHDMG LLGDGASSEA RDALEAFRSG LAAGTMIYSP GLNAISSNQQ AKAEGRLDPV
YVAMPNNGGQ AKYELLLAGA AVFNNNDEAK IEASKKFVDF VINDPVWGQR ALKATRNFSP
VGKTGLYGDD EETKFIENIN SNGNYGPYYN TIDGFAQMRP LWSNMVQAVL NGQISPKAGL
DKFVIDATKA MEDAK