Gene Smon_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1116 
Symbol 
ID8600845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1233864 
End bp1235642 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content27% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003306454 
Protein GI269123877 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG GATTATCAGT AAGTTTAATA TTAGCTTCTT TAGGATTGTT AATGTCTTGT 
TCACCAGGTA ATTCAAGAAC AGAAAAAATA AGTAAGGTAG ACTTAAACTT TCCATTAGAA
TATACAAATG AAGCACCAGA AATAGAAGGA GCAACATATA CAATTGCTAT GGTTACATCT
AGTCCATTTA AAGGAATTTT TTCACCATTA CATTATCAAG ATAATAATGA TGGAAACATT
ATTCAATTGA TAAATGAGGA ATATACATGG AAAAATGAAG ATTTTGAAAT ATTAGATGTA
GAAGGTGGAA TTGCTACTTT ATCATTAGAT ACTGAAAAAA ATCAAATGGT TATTAAATTT
AAAGAAGGTC TAAAATGGTC AGATGGACAT CCTATGGGTG TTGATGATTT AATTTATACT
TTTGAGGTAG TTGCTCATCC TGATTATACA GGAGTTAGAT TTTCACCAAT AGATTATTAT
AAGATAAAGG GATTAAAAGA ATATAATGAA GGTAAAGCTT CTAAGATTTC AGGATTAGAG
AAAATTTCAG ATACTGAATT AAGAATTAAT GTAGATGAAA TATCATCTAA AGTAATTACT
GGTGGAGGTC CATTAGTTCC TAACTTATTA CCAAAACATG ATTTAGAAAA TATACCTGTT
AAGGATTTAG AATATTCAGA TAAAATTAGA AAAAATCCTG TAGGAAATGG TAAATATGTA
ATTAAAAATA TAGTTCCTGG AGAATCTATA GAATTAGTTC CAAATGAATA TTATCATTTA
GGAAAACCAA AAGTTGCAAA AATTATTTTA AAAACAATTA ATCCACAATT AGCTGTTGAA
TCTTTAAAAA ATGGAGATTT CTTCCAATAT TTAGATTTAC CACAAGATTC ATATGATAAA
TATAAGGATT TAAGTAATAT AACTGTAATA GGAAGACCTG ATTTATATAT ACAATATTTA
GGATTTAATT TAGGGCATTT TGATAAAAGT AAAAATGAAA GTGTAACTGA CAGAGATACA
CCATTACAAG ATATTAATGT AAGAAAAGCT TTATCATATG CATTAAATAT AGATGAAATA
GCTACAGCTT ACTATAATGG TCTAAGACAA AGAGCTAATG GTCATACTCC ACCTATATTT
AAAAAATATT ATGATGAAAC TTTAGAAGGA TATCCATATA ATCCAGAAAA AGCAAGAGAA
TTATTGAAAA AAGCAGGGTA TGAAGATACT AATGGAGATG GAATTGTTGA TAAAGATGGT
AAAAATTTAG AATTGAAATT TGCAACTATG GCAGGTTCAG ATGTTGCTGA ACCAATTGCA
CAAGCATTTT TACAATATTG GAAAGAAATT GGAGTTAATG TTACATTAAC TACAGGAAGA
TTACTAGACT TTAATTTATT CTATGATAAA TTAGAAGCAA ATGAAGATGA TATAGATATA
TATATGGCAG CATGGGGAGT AGGAACATCT TTAGATCCTT CAGCTTCAAA AGCTAGAGAT
GCTATGTTTA ACTTTACTCG TTTCACTAGT GAAGAAAATG ATAAATTATT AGAAGCAGTT
TCAAGCTCTA AAGGTCTTTC TGACTTAGAA TATAAGGTTA AGGCATATAA AGATTGGCAA
AAATATTTCG TTGAACAAGC AGTAGAAGTA CCATTAATGT TTAGATATAA GGTATCACCT
GTAAATAAAA ATGTTAAACA TGATAATTTA TATACAGATA CTAAGAGATC TAATATAATT
GAATCTGTTG TAAATTCTTT ACCTGAAAAG GCAAAATAA
 
Protein sequence
MKKGLSVSLI LASLGLLMSC SPGNSRTEKI SKVDLNFPLE YTNEAPEIEG ATYTIAMVTS 
SPFKGIFSPL HYQDNNDGNI IQLINEEYTW KNEDFEILDV EGGIATLSLD TEKNQMVIKF
KEGLKWSDGH PMGVDDLIYT FEVVAHPDYT GVRFSPIDYY KIKGLKEYNE GKASKISGLE
KISDTELRIN VDEISSKVIT GGGPLVPNLL PKHDLENIPV KDLEYSDKIR KNPVGNGKYV
IKNIVPGESI ELVPNEYYHL GKPKVAKIIL KTINPQLAVE SLKNGDFFQY LDLPQDSYDK
YKDLSNITVI GRPDLYIQYL GFNLGHFDKS KNESVTDRDT PLQDINVRKA LSYALNIDEI
ATAYYNGLRQ RANGHTPPIF KKYYDETLEG YPYNPEKARE LLKKAGYEDT NGDGIVDKDG
KNLELKFATM AGSDVAEPIA QAFLQYWKEI GVNVTLTTGR LLDFNLFYDK LEANEDDIDI
YMAAWGVGTS LDPSASKARD AMFNFTRFTS EENDKLLEAV SSSKGLSDLE YKVKAYKDWQ
KYFVEQAVEV PLMFRYKVSP VNKNVKHDNL YTDTKRSNII ESVVNSLPEK AK