Gene Smon_0110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_0110 
Symbol 
ID8599808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp115556 
End bp117409 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content32% 
IMG OID 
ProductPTS system, fructose subfamily, IIC subunit 
Protein accessionYP_003305482 
Protein GI269122905 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAA CAGAATTATT AAGAGAAAAT CAGGTTATCT TCAATTTAAA TGCAGATAAT 
AAAAAAGATG CAATAATTGA AATGGCAAAG GTATTTAAAC CAGATGTAAT TAATGATCAA
GAAAAATTCA TTGAAGACTT ATTTGCAAGA GAATCATTAT CACATACTGC ACTAGAGCTT
GGAGTTGCAA CTCCACATGC TAAATCTCGT GGGGTAAGTA AACCAGCATT GGTTATAGCC
ATAAAAAAAG AGGGTATAGA TTTTAGTGAA GGTCAAGAGG ATAAATCAAA GCTATTTTTT
ATGATAGCTG TTCCTGAAAA TGAGGGTAAT TTACATATAG ATATATTAAC TAAACTTGCA
GATGTAATGT TAGATAATGA TAAACTAAAT GCATTATTAA ATTCTACAAG CTATGATGAA
GTTATAGATA TTATAGAAAA GGAAAAAATT ATGGAAAATA AAGAAAGTGA AAAATTTGTA
GTAGCTGTAA CAGCATGTCC TACAGGTATA GCTCATACAT TTATGGCAAA AGATGCTTTA
ATTAAAGCAG CTAAAGAATT AGGAGTGAAT ATTAAGGTTG AAACTAATGG GACAAATGGA
AGAAAAGATG AAATTACTAA AGAGGATTTA GAAAAAGCAA GTGGAGTAAT ACTTGCTATA
AATAAGAGTG TTAATGAAGA AAGATTTAAT GGATATAAGG TAATAAAGGT TGGAGCAAAA
GACGGTATTA ATAAAGCAAA AGAGTTAATT TTAGATACTT TATCTGGTAA GGGAACTATT
GCTAATTTTG AAAGTTCTGG AAATTCTACT TTTATGAATA ATGGTAAAAA AGGTATGTAT
AATCACTTAA TGTCAGGTGT TTCATACATG TTACCATTAG TAATAAGTGG TGGAATATTA
ATAGCACTTG CTTTCTTATT TGATAGTTTA GCAGGAAATT CTAATGTTGG TGGAGGATTT
GGATCTACTT CTAAACTTGC AGCGACATTT ATGCAAATAG GTGGAGCAGC TTTCGGATTA
TTTGTTCCTA TACTTGCAGG ATATGTTGCA TATAGTATAG GTGAAAAATC ATCTCTTGCA
GCAGGACTTG TAGCTGGAGC TCTTGCATCA AGTGGTGGTT CAGGATTTTT AGGAGCATTA
GTTGGTGGAT TATTTGCAGG ATATGTAACT AAATTTTATT CTAAGGTTAC TTCAAATATT
AAAAAACAAT TACAAGGAAT TAATCTTATA CTATTTACAC CTGTTATAAC AGTTTTACTT
ACAGGGCTTG TTATGCTATT TTTATTAAAT CCTATGGTTA GTGGTATTAA TACTGGAATA
ACTAATTTCC TTGAAAGTAT GAGTGCAAGC TCAAGAATAC TTTTAGGTGC ATTACTTGGT
GGTATGATGG CTGTAGATAT GGGTGGACCA GTTAATAAAG CAGCATATGT ATTTGGTACA
GGAACATTAG CTGCAACAGT TTCTACTGGT GGTTCATCAG CTATGGCGGC AGTTATGGCA
GGGGGTATGG TTCCTCCACT TGCAATAGCT ATTTCAACTA CTGTATTTAA GAATAAGTAC
AATAAGGAAG AAAGAGAAGC AGGACTTTCA AATTATATAA TGGGGATTTC ATTTATAACA
GAGGGGGCAA TACCATTTGC AGCTGCAAAT CCTTTAAGAG TATTACCTGG AGCAATAATA
GGTGCAGCAA TTTCAGGAGC TTTAACTATG TTATTTAATA TTAAAATACC AGCTCCTCAT
GGAGGAATAC TTGTAATGTT CTTAAGTTCT AACTTCTTCT TATACTTACT TGCAATAGTA
ATAGGTTCTA TAGTGGGAGC AATTATTTTA GGGCTTTTAA AAGAAAAAAG ATAA
 
Protein sequence
MKLTELLREN QVIFNLNADN KKDAIIEMAK VFKPDVINDQ EKFIEDLFAR ESLSHTALEL 
GVATPHAKSR GVSKPALVIA IKKEGIDFSE GQEDKSKLFF MIAVPENEGN LHIDILTKLA
DVMLDNDKLN ALLNSTSYDE VIDIIEKEKI MENKESEKFV VAVTACPTGI AHTFMAKDAL
IKAAKELGVN IKVETNGTNG RKDEITKEDL EKASGVILAI NKSVNEERFN GYKVIKVGAK
DGINKAKELI LDTLSGKGTI ANFESSGNST FMNNGKKGMY NHLMSGVSYM LPLVISGGIL
IALAFLFDSL AGNSNVGGGF GSTSKLAATF MQIGGAAFGL FVPILAGYVA YSIGEKSSLA
AGLVAGALAS SGGSGFLGAL VGGLFAGYVT KFYSKVTSNI KKQLQGINLI LFTPVITVLL
TGLVMLFLLN PMVSGINTGI TNFLESMSAS SRILLGALLG GMMAVDMGGP VNKAAYVFGT
GTLAATVSTG GSSAMAAVMA GGMVPPLAIA ISTTVFKNKY NKEEREAGLS NYIMGISFIT
EGAIPFAAAN PLRVLPGAII GAAISGALTM LFNIKIPAPH GGILVMFLSS NFFLYLLAIV
IGSIVGAIIL GLLKEKR