Gene Smon_1393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1393 
Symbol 
ID8601139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1528045 
End bp1531095 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content26% 
IMG OID 
Productalpha amylase catalytic region 
Protein accessionYP_003306711 
Protein GI269124134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATATA TTTTAGGAAT TTACAAAAAT GGTAAATTTT TTAAAGAAGT AAAATTAAAT 
AAGAAAAATG ATAAATATAT ATATAAATGG GCAAAGGTAG ATGTTGCTTC ATATTTTTTT
ACTATAACTA AAGTAGATAA TAATGAGAGA GAAAATTTTA ATATTACTTA CAATCATACT
AGTCCATTTG CATCTGAATT TAGAGCATAT GCAGATAGTA AAAGTACTCC TAAATCTATT
GCAGGTATAC AATCAGGAAT AGATATATTA GTAGAATTTA ATCCACAAGA TTTTAGTTTT
TTATTACAAA AGAAGAGGTT TATAAAATTT AATATAGATA TATCTAAGTT TGGATTATCT
AACCCAAAAG TATTTGAAAT ATCTGGTGAT TTCAATAATT GGCACCCAGA AACAGAACCT
ATAAATCACA TAAAAGATAA TATGTATGAA GTAATTTTAA ATGTAGAAAA TGGATTTTAT
GAATATAAGT TTTTAATAGA CCATAAATGG TATCCTGAAA AAAATGAAAT ATTAGTTGTA
GGAGAAAATG GAAATCTCTT TCCTAAAGGC ATATTAGGTA GTGGTAATTT TATTAATGAA
ATTAGTAAAA ATATCAATAC TGGCAGTAAA ATAACCGCTA TATTACATGA TGCTAAAAAG
CTAATATATT TCAATAAAAT TACAGAAAAA GAATTTGAAA TTAGTATTAG AACTCAAAAA
TCTGATGTAG AAAGAGTATA TATATCAGTA ATTCCATATA CTAAAGATGG TAAGGGATTA
AATAGAATTT ATGAACTTGA AAGATATATT GATTATACTA ATTCTTTTGA CTATTTTAAA
AGAATGCTTA CTTTTTCAGA AGATGTTGAA AGTTTTGAAT ATTTTTTCAT ACTAGAAGAT
GGAGGAATAA AAAACTATTA TGGTGTAAAT GGACTTGAAG AGGAATTAAG TAAACCTCTA
ATTTATTCAA AAAAGGAAAA TGAGGATATC TTCTATATCC CATATTGGTC AAGAGAAGCT
ATATGGTATA ACATATTTCC TGATAGATTC TATAATGGTG ATATGTATAA TGATCCAATT
TTTAATGAAT TTGGTCCTGA AAAATTTAAA AAAAATTCAA ATCATGAAAG TAAATTTGTT
AAAGATTATA GATGGAACAG TAATGAATGT GCATTAGAAT TTGAAAGAAA TAGATGGTGT
TCTGATTTCA GTGAAAGAAC CAATTGGGAA ATACATATGG AATCAAATAT AGATTATTCA
CTTAAATATG CAAGAATGTA TGGTGGAGAT TTAAAGGGTA TTAAAGAGAA GATTCCATAT
CTTAAAAAAT TAGGTGTAAA TGCAGTATGG TTAAATCCTG TATTTTACTC TTTTCAAAAT
CATAAATATG GGGCTAACGA TTTTAGGCAT ATATCTCCTG ATTTAGGAAC TATTAGAACT
AGTGGTAAGC TTCATGATGT ATATATAGAT CCAAAAAATA GATACGGTAA CAAAAGTTAT
CTTGATGTAC TTGGTAAAGA TTCTGTAAAT AATTCAGAAC TTGAATTACT TGAAGTTAAT
TTAATAGGTG AGAATAAAGG TAAAAATGGA TATTTTGAAA CTGATGATCC CAGTAGTTGG
GTTTGGACAG AATCTGATTT AATAATGGTA GATTTAATTA AAGAATTACA TAAAAATGGT
ATTAGAGTAA TATTTGATGC TGTGTTTAAT CATAGTAGTA ACTACAATTT TGCATTTAAT
TTAGCTCTTG CTGAAGGAAA AAATTCAAAA TATACAAATT GGTATAAGTT TAATGATTAC
TCTAATTACA AAGAAGTTAC AGAAGATATG AGTGAAGAAG AGGCATATAA CACTGTAAAT
CTTAATAGAA CTAATCTTAA ATACAATGGT TGGGCTGGAT TTGACACATT ACCTGTTTTT
GATAGTTTTA ATGAGGATTG GAAAGACTAT ATCTTTAATA TTACAAGAAA ATGGATGCTA
GGTCCTGATG GTAAAGAACA CAGTAATTGG CAGGAAGATG ATGGTATAGA TGGTTGGAGG
CTTGATGTTC CTAATAATCT GGAAAATCAA ATGTTTTGGA AAGAATGGCG ACATGTTGTA
AAAAAATGTA AAAAAGAAGC ATATATTACT GCTGAACTTT GGTCTAATGC TGGTGATGAT
ATTAACTCAG GAGATAAATT TGATGCTGTA ATGAACTATG AATGGCTTAA AGCTGTAATA
GGTTATTTTA TTAATCAAGG TAAAGATTTT AATGAAAGCT ATAAGCTAAG TGCTGAACAA
TTTTTATTAG AATTAAGAGA AAAAAGAACA TGGTATCCAC TTCAATCTGT ACAAGCCTGT
CAAAACTTAA ATGGATCACA TGATACTGAT AGACTTTATT CAAGAATAAT AAATGATAAA
ATAGGTAGAA ATTTAGAAGA AGGTAAACAA TATGAAAAAG GATATAATAT TATTAGACCT
GATCTTGCTA GAAACTACCA CCCAAATACT ACTATTGATT GGAAAAATTA TCATATTAAG
CCTAAAGATA TTCTTAAATT AATATCAATA TTCCAAATGA CATATATAGG AGCTCCTATG
CTTTATTACG GAGATGAAGT TGGAATGTGG GGAGCAACAG ATCCATATTG TAGAAAACCT
ATGCTTTGGG ATGAGTTTTG GTATGATAAT GAAAAAAATA CTTCATATGT TAATAATGGT
GAAATATATT CTCAAAAGCC AGATATGGAT TTATTTGAAT GGTATAGAAA AATAATTAAG
ATACGTAAAG AACATAAATC TCTTGTATAT GGTAGAATAA AACAGGTATA TTTTGATAAT
GAAAAAGATA TTATTGCATA TGAGAGATAT GATAAGGAAG AAAGTATAAT AATTGTGTTA
AATAATAGTT TTCAAGATCA TAATGACATT ATTATTACAT CTTTCCATAA CGATAAAACA
TATCTTGATT TATTGTCAGG TAAAAAAATT AAAAGTGATA TAAATGGTAA AATGAAATTT
GATATTAAAG CTAAAAAAGG ATACATATTT TATTTATCAA AAAGGGAATA A
 
Protein sequence
MEYILGIYKN GKFFKEVKLN KKNDKYIYKW AKVDVASYFF TITKVDNNER ENFNITYNHT 
SPFASEFRAY ADSKSTPKSI AGIQSGIDIL VEFNPQDFSF LLQKKRFIKF NIDISKFGLS
NPKVFEISGD FNNWHPETEP INHIKDNMYE VILNVENGFY EYKFLIDHKW YPEKNEILVV
GENGNLFPKG ILGSGNFINE ISKNINTGSK ITAILHDAKK LIYFNKITEK EFEISIRTQK
SDVERVYISV IPYTKDGKGL NRIYELERYI DYTNSFDYFK RMLTFSEDVE SFEYFFILED
GGIKNYYGVN GLEEELSKPL IYSKKENEDI FYIPYWSREA IWYNIFPDRF YNGDMYNDPI
FNEFGPEKFK KNSNHESKFV KDYRWNSNEC ALEFERNRWC SDFSERTNWE IHMESNIDYS
LKYARMYGGD LKGIKEKIPY LKKLGVNAVW LNPVFYSFQN HKYGANDFRH ISPDLGTIRT
SGKLHDVYID PKNRYGNKSY LDVLGKDSVN NSELELLEVN LIGENKGKNG YFETDDPSSW
VWTESDLIMV DLIKELHKNG IRVIFDAVFN HSSNYNFAFN LALAEGKNSK YTNWYKFNDY
SNYKEVTEDM SEEEAYNTVN LNRTNLKYNG WAGFDTLPVF DSFNEDWKDY IFNITRKWML
GPDGKEHSNW QEDDGIDGWR LDVPNNLENQ MFWKEWRHVV KKCKKEAYIT AELWSNAGDD
INSGDKFDAV MNYEWLKAVI GYFINQGKDF NESYKLSAEQ FLLELREKRT WYPLQSVQAC
QNLNGSHDTD RLYSRIINDK IGRNLEEGKQ YEKGYNIIRP DLARNYHPNT TIDWKNYHIK
PKDILKLISI FQMTYIGAPM LYYGDEVGMW GATDPYCRKP MLWDEFWYDN EKNTSYVNNG
EIYSQKPDMD LFEWYRKIIK IRKEHKSLVY GRIKQVYFDN EKDIIAYERY DKEESIIIVL
NNSFQDHNDI IITSFHNDKT YLDLLSGKKI KSDINGKMKF DIKAKKGYIF YLSKRE