Gene Smon_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1115 
Symbol 
ID8600844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1231887 
End bp1233677 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content30% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003306453 
Protein GI269123876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TATTAACAGC GAGTTTAGCA CTTGCAAGTT TAGGACTTTT GGTATCGTGT 
GGTATAGGAA AATCTAAATC AGAATCAGGA GAAGATTCTA AAATTAGTTT AAGTTTTCCA
AGAGAGTATA AAAATCCAGG AATAGTTGAT GAAAATGCAG AATATAAAGT TGCACTTGTT
GTTTCATCAC CTTTTAAAGG ATTATTATCT AGTTTACATT ACACTGATGC TTATGATTTA
GAAATATTAG CTTTCATAAA CGAAGAAATC TTATGGAAAA ATGAAGATTT TGAAACATTA
AATGTTGAAG GTGGATTAGC AACATTATCA TTAGATTTAA AGAATAAAAA AGTAATATTT
AAATTTAAGG AAGGATTAAA ATGGTCAGAT GGACATCCTT TAGGAGCAGA TGATTTAATC
TATACTTTAG AAGTTTTAGG GCATAAAGAT TATACAGGAA TTAGATATGA TGAATCTGAA
CATAGTAAAA TAGTTGGAAT GACTGAATAC CATTTAGGTA AGGCTAATAA AATTTCAGGA
GTTACTAAAG TTTCTGATAC AGAAGTACAT ATACAATTAA CAGAGAATAA TGCTAAAGTT
GTTACTAGTG GAGGACCGAT TACAGGTATA GGAAGTATAC TTCCAAAACA TTATTTATCA
GATATACCTA TGAAAGATTT AGAAAGTTCT GATAAATTAA GAAAAAATCC TCTTTCAAAT
GGACAATTTG TTATAAAAAA CATAGTACCA GGAGAAGCAG TAGAATTAGT ACCTAATGAA
TACTATCATT TAGGAAAAGT AAAAGTTGGA AAGGTAACAA TTAAGACTAT TGCACCACAA
TTAGTAGTTG AAAGTATGAA ACAAGGGGAG TATCATTCAT ATATAGGAGT GCCTGGTAGT
TCTTATGATA AATATTCTGT TTTAGATAAC TTAGCACTTA TTGGTAGACC AGATCTTTAT
TATTCATATT TAGGATTCAA TTTAGGTCAT AGAGATAATG AGAAAAAAGA AAATATAATG
GATAGAGATA CACCATTACA AGATGTTAAT TTAAGAAAGG CTTTAGCTTA TGCATTAGAT
GTTGATCAAG CAGCTCAAGC TTTCTATAAT GGATTAAGAA CTAGAGCTAA TGGGCAAACT
CCACCAATAT TTAAAAAATT CTATGATTCA ACAAATGTTG GATATCCATA TAATCCTGAA
AAGGCAAAAG AATTATTAAA GAAAGCAGGA TATGAAGATA CAGATGGAGA TGGATTTGTT
GATAAAGATG GTAAGAGATT AAAACTTAAA TTTGCATCTA TGTCTGGATC AGATGTCGCT
GAACCATTAG CACAATTCTA TATACAAAAT TGGAAAGAAA TTGGTGTTGA TGTTGAGTTA
ACAAACGGAA GATTACTTGA ATTTCAATTA TTCTATGAAA AAGTAAAAGC AAATGATCCA
GATATTGATA TGTATGCTGC TGCATGGGGA GTAGGAACAT CACTTGATCC TTCACAATCA
AATTCAAGAT ATGCTGCATT TAATATGACA CGTTTTGTAA GTGAAGAAAA TGATAAATTA
CTTGCTGCAA TTTCAGATGA CAAAGGATTA GAAGATCCAA ATTATAAGGC AGAAGCATAT
AAGAAATGGC ATAAATATTA TTTAGATCAA GCAGCTGAAG TACCATTAAT GTTTAATTAC
AAGATATCAC CAGTTAATAA AGCAATAAAG AGTGCTAATT CATATACAGA TACTGCAAGA
AATGTAATAT TTGAAGGTGT TGTAAGTGCA CCAATTAAAG CAACTAACTA A
 
Protein sequence
MKKILTASLA LASLGLLVSC GIGKSKSESG EDSKISLSFP REYKNPGIVD ENAEYKVALV 
VSSPFKGLLS SLHYTDAYDL EILAFINEEI LWKNEDFETL NVEGGLATLS LDLKNKKVIF
KFKEGLKWSD GHPLGADDLI YTLEVLGHKD YTGIRYDESE HSKIVGMTEY HLGKANKISG
VTKVSDTEVH IQLTENNAKV VTSGGPITGI GSILPKHYLS DIPMKDLESS DKLRKNPLSN
GQFVIKNIVP GEAVELVPNE YYHLGKVKVG KVTIKTIAPQ LVVESMKQGE YHSYIGVPGS
SYDKYSVLDN LALIGRPDLY YSYLGFNLGH RDNEKKENIM DRDTPLQDVN LRKALAYALD
VDQAAQAFYN GLRTRANGQT PPIFKKFYDS TNVGYPYNPE KAKELLKKAG YEDTDGDGFV
DKDGKRLKLK FASMSGSDVA EPLAQFYIQN WKEIGVDVEL TNGRLLEFQL FYEKVKANDP
DIDMYAAAWG VGTSLDPSQS NSRYAAFNMT RFVSEENDKL LAAISDDKGL EDPNYKAEAY
KKWHKYYLDQ AAEVPLMFNY KISPVNKAIK SANSYTDTAR NVIFEGVVSA PIKATN