Gene Smon_0185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_0185 
Symbol 
ID8599883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp224120 
End bp225568 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content29% 
IMG OID 
Productpeptidase U34 dipeptidase 
Protein accessionYP_003305549 
Protein GI269122972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CATTAAGTTT AGGATTATTA CTTTTTTCTA TACAAGCATT AGCTTGTACA 
GGGTTTATTG CAGGTCGTGA TGCAACTGTT GATGGATCTA TGATTTATGC TAGAAATGAA
GATCTTAAAG GAGTTAATCC TAAAAAATTT ATGATAGTTG AAGCCAAAAA ATATAAGTCA
GGGGAAATGT TCATAAATCC AGATACAGGA TTTAAATGGT CTTACCCTAA AAAAGCATTA
AGACATAGCT TAATTCCTGA TGCTGATCCG AGTTATGGAA TATTTGGAGA AGCTGGGATT
AATGAACTAG GAGTAAGTGT TTCTGCAACA GTTTCAGCAA ATGCTAATGA ACAAATTCTT
AAATTAGATC CATATGTAAA AGGTGGATTA ACAGAACCAG ATATAGCTTC ACTTGTATTA
ATGCAGGCTA AAACTGCAAG ACATGCAGTA GAAATTATTG CAGAAATAGT TGAAAAAGTT
GGGGCTGGAG AAGGTAATGT TATAGTATTT GCAGATAAAA ATGAAATGTG GTATATGGAA
ATTTATACTG GACATCAATA TACAGCAGTT AAAGTACCTA ATGATGTCTA TGCAGTAATT
CCAAATGCTT TCTATTTAGG AAGTTATAAT TTTAAAGATA AAGATGTAAT TTCTTCTAAA
GATATAGAAA AATTACCTGA AAGTAATGGA CTAGCAAAGA CAGTTGATGG TAAATTCCAT
TTAGCTTTAA CATATAGGGA AATACATTCT AAATATAACT ATGATAGAAT AGCTATGGGA
CAAAATCTTT TCTGTCCATT ATTACCAGTA GAACATAATA GTGAAATAGC TTATGAACTT
TTTAGAAAAC CAGATAAGAA AATATCTGTT AAAGATGTAA TGAATTTTTT AAGATACAGA
TATACAGGGA CAGAATATGA TGTAGATACT GTTGATAATA AAGGGAAAAT AAGAGCTATA
GGAACAGATA CTAATTTAGA AGCACATATA TTCCAAATTA GAGAAAATGC TCCTACAGTT
ATGTGGTTAG CTATGGGAAC AGTAGAACAT TCAGTATTTG TTCCATATTA TGAATATATA
ACTAAGACAC ATAAGAGCTA CACAGCAGAT GCAAAAGAAT TTAATAGAGA TTCTATGTAT
TGGGCAATGA AAGGACTTCA TATACTTGCA AGAGAAGATA GAATCAGATA TGGTTTAGGT
GTAAAAACTT ATTGGGATAA AGTGGAAAAT GATTTTATTA CAATGTTAAA AGAAGAAGAT
AAGATTATAA ATTCTAAAAA AGGATTAGAT AAAATTAGAT ATGCAAACAA ACTTGGATTG
ATGAAAGCAG AAAATGTTAA AAAAGATGCT GATAAAATGT TTAATCAATT GATGTATTTT
AAAGGCAGTA TGACAGATAT GACAATTCAA GGTAAAAATA AGAATGCCAA ATTCGAGTAT
AAAAAATAA
 
Protein sequence
MKKTLSLGLL LFSIQALACT GFIAGRDATV DGSMIYARNE DLKGVNPKKF MIVEAKKYKS 
GEMFINPDTG FKWSYPKKAL RHSLIPDADP SYGIFGEAGI NELGVSVSAT VSANANEQIL
KLDPYVKGGL TEPDIASLVL MQAKTARHAV EIIAEIVEKV GAGEGNVIVF ADKNEMWYME
IYTGHQYTAV KVPNDVYAVI PNAFYLGSYN FKDKDVISSK DIEKLPESNG LAKTVDGKFH
LALTYREIHS KYNYDRIAMG QNLFCPLLPV EHNSEIAYEL FRKPDKKISV KDVMNFLRYR
YTGTEYDVDT VDNKGKIRAI GTDTNLEAHI FQIRENAPTV MWLAMGTVEH SVFVPYYEYI
TKTHKSYTAD AKEFNRDSMY WAMKGLHILA REDRIRYGLG VKTYWDKVEN DFITMLKEED
KIINSKKGLD KIRYANKLGL MKAENVKKDA DKMFNQLMYF KGSMTDMTIQ GKNKNAKFEY
KK