Gene Smon_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1416 
Symbol 
ID8601162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1556689 
End bp1558542 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content32% 
IMG OID 
Productprotein of unknown function DUF87 
Protein accessionYP_003306728 
Protein GI269124151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGTC GTGAAATAGG GAAGATTACT TCAGTAGGCA TTCATGGAGT AATAGCTGAT 
GTAAATTCTG ATTTAGGAAA CTATATAAAT ACAATTGATG GAATCCTTTT TGTCGGAGAG
GTTGGATCAT ATGTATCAAT TTATGAGATA GGAAGAACTG TTATAGCAGA AATAATTGGA
GTAGATGAGA AGACTCAGTT AATTAATTCA AGGGAAATGA TCAAACCAAA TAGTAAAAGG
CAGGTTTACT TAAATTTGAT TGGAGAAATA GTTGAGGATA AATTCCAATT TGGAGTATCA
AAAATGCCAT TGATTTTCTC AACTGTTTAT ATTGTTTCAC AAAAGGAATT GATTACTATG
CTTGAAGTTG GTAAAGAAGA AATAAAGATT TCTGAAGAGT CTAATAAAAC TCGAGCGATT
TTGCTTACAA TAGGGAAATC AGTAATATTT CCGGATTACG ATGTTAAAAT AAATATTGAT
AAGTTCTTTG GGTTTCATTT TGCTGTTTTT GGAAATACTG GGGCAGGTAA GTCTAATACT
GTAGCTAGAA TTTTACAGAA TGTTTTTGTT AAAGATCATT ATTCGGCTAA AGGGGCGAAA
TTTGTAATAA TTGATTCTAA TGGTGAATAT AACAAGGCTT TTTCGAAGTT AAATGAAATT
AATCAAGATA TTAAACATTC TCTAATGATT GCAGATGAAG ATATTGATTC AAAGTTCGAA
ATACCAGTTT GGGCGTTATC AGCAGACGAT TGGGCAACAT TACTACATGC TTCTGAAAAA
ACGCAAATGC CTGTATTAAA AAGAGCGATA GACATTGCAC GAGTATTTTA TAGTTCTGAT
GAAACTAATC AGGAACTACG GAACCACATT CTTGCATCCA CATTACTGGG TATTATTCAG
AGTTCAGATT CTTCTCCATC TAAGTCGGAT AAACTTAAAG CTATAGTAAC AAAATTTGGA
ACTAATGAAA TTAAAATGGA TTCAGTTTTA TCGAATTCTA AAACATTAAG GCAATCCATG
AATATAAATT ATGGTTCAAT GCCCGATGAG GAAGCTGTTA TTTCATTTTT ATCTAATCAT
CTAAATCAAG AATTAATAAC AGAAAATATC ACACGATCAA TGGTTCCGTA TAGTTTAGAA
GATTTTAGCC AAGCGGTTGA GTTTGCGACT CTGTATGAAG GGAGTATTAG TTCACAGAGA
ATACAAGAAT ATACTGCAAC TTTAATGACC CGATTGAATA CCATTCAGGA AGGAATCCAA
GGACGCATTC TCTCGAGAAC AACATATAAT ACTATTGATG ATTATATAGA TATGTTATTG
GGTGAAAACC AAATAGTGGA TCTTGATATT AGCACACTGG ACGATGCTTC AGCAGAGGTT
GTAACAAAAG TTTTGGCTAA ACTTTTATTA GATTATTTGA AGAGAAGAGA AATAAAAGCA
GATTCACCGA TAAATTTTAT AATCGAAGAA GCACATAGAT TCATAAAAAA CGAAGCAAAT
TATGGAGCGG TTGGATATAA TATTTTTGAA AGAATTGCTA AAGAAGGTCG CAAATTTGGA
ATGCTTTTGG GAATATCATC TCAAAGACCA AGTGAATTGT CTAAAACAGT AGTATCACAG
TGTAGTAATT TTATTGTTCA TCGTGTACAA AACCCGGATG ATTTGCAATA TATATCTAGA
ATGGTTCCAT ACATAAATCA GAATATGATA GAAAGGCTTA CTTATCTTCA GACAGGAAAT
GCATTGGTTT TTGGTAGTGC AATAAATCTT CCGACATTAA CTAAATTTGC TCAAGCGAAT
CCTACAACAG ATAGTGATAA TGCAAAAATA TCAGAAAAAT GGTACATTGA ATAA
 
Protein sequence
MSSREIGKIT SVGIHGVIAD VNSDLGNYIN TIDGILFVGE VGSYVSIYEI GRTVIAEIIG 
VDEKTQLINS REMIKPNSKR QVYLNLIGEI VEDKFQFGVS KMPLIFSTVY IVSQKELITM
LEVGKEEIKI SEESNKTRAI LLTIGKSVIF PDYDVKINID KFFGFHFAVF GNTGAGKSNT
VARILQNVFV KDHYSAKGAK FVIIDSNGEY NKAFSKLNEI NQDIKHSLMI ADEDIDSKFE
IPVWALSADD WATLLHASEK TQMPVLKRAI DIARVFYSSD ETNQELRNHI LASTLLGIIQ
SSDSSPSKSD KLKAIVTKFG TNEIKMDSVL SNSKTLRQSM NINYGSMPDE EAVISFLSNH
LNQELITENI TRSMVPYSLE DFSQAVEFAT LYEGSISSQR IQEYTATLMT RLNTIQEGIQ
GRILSRTTYN TIDDYIDMLL GENQIVDLDI STLDDASAEV VTKVLAKLLL DYLKRREIKA
DSPINFIIEE AHRFIKNEAN YGAVGYNIFE RIAKEGRKFG MLLGISSQRP SELSKTVVSQ
CSNFIVHRVQ NPDDLQYISR MVPYINQNMI ERLTYLQTGN ALVFGSAINL PTLTKFAQAN
PTTDSDNAKI SEKWYIE