Gene Smon_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1023 
Symbol 
ID8600751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1114858 
End bp1116555 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content27% 
IMG OID 
Productoligoendopeptidase, M3 family 
Protein accessionYP_003306365 
Protein GI269123788 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTTTTA AAGACTTTGA ATACAAAAGA GTTAATATTG AAGAGTTAAA AAAAACATAT 
ACAAAAATTA AAGAAGAGCT TGAAAATGCT GAAAATGCAC AAAAAGCAAT AGAACTTTAT
TATAAGCTTG ATGAGATTAA TAAAGAATTT TCAACTAATT ATTCATTAGT TTATGTTAGA
AATAGTATAG ATACTACAGA TGAATTTTAT TCTAAAGAAA AGGCTTTTTA TGATGAAAAT
ATACCATTAA TATCAGAATT TAGTAATGAT TTAGGAATAG CTTTTGCTAA TTCTAAGTTT
AGATCTGAAT TAGAAGAAAA ATTTGGTAAA CTTGCATTTC AAAAAATAGA TTTAGAATTA
AAATTATTCA AAAATGAAAT TATTGAAGAC TTACAAGAAG AAAATAAGTT AGTAACAGAG
TATGTAAAAT TAACATCAAG TGCTAAAATA TTATTTGATG GAGAAGAAAG AAATCTTTCA
GAAATGGTTC CATATACTCA AGATGTAGAT AGAAATACAA GAAAAGAAGC TATAATAGCT
GTAGGAAAAT TCTTTGAAGA TAATATGGCA GAATATGATA GAATATATGA CATGTTAGTT
AAAGTAAGAG ATAGAATAGC TAAAAAACTT GGATATAAGA ACTTTGTTGA AGTTGGATAT
TTAAGAATGG GAAGACTAGA TTACAATGCA GAAGATGTTG CAAATTATAG AAAACAAATA
AAAGAAAACG TAGTTCCACT ATATGTTGAA TTGAGAAAAA GACAAGAAAA GAGAATAAAG
GTTGATAAAT TAAAATATTA TGATGAAGGA ATGGCATTTT TAACAGGTAA TCCAACTCCT
AAAGGTGATA GAGCATGGAT GGTTGAAAAA GCACAAATTA TGTATAAAGA ATTATCTCCT
GAAACACATG AATTTTTCTC AAAAATGGTT GAACAAGAAT TACTTGATTT AGATAGTAAA
AAAGGAAAAC AAGGTGGAGG ATATTGTACA TCTTTTGATT CATATAGCAT GCCATTTATA
TTTGCAAACT TTAATGGTAC AGCACATGAT GTTGAAGTTT TAACACATGA GGCAGGACAT
GCATTTCAAT CATATCAAGC TATGAGAAAT GTTGATATAT CTTCATATTA TTGGCCTACA
TCAGAATCAG CAGAAATTCA TTCTATGAGT ATGGAATTTT TAACTTGGCC ATGGATGGAA
TCATTCTTTA AGGAAGATAT AGATAAGTTC AAATATCATC ATTTAAGTGG TGCATTCTTA
TTTATACCTT ATGGAGCTTT AGTTGATGAG TTTCAACATT TTGTGTATGA AAACCCTAAT
GTAACACCAG AAGAAAGAAG AATGAAATGG TTAGAACTTG AAAAAGAATA TTTACCTACA
AGAGATTATG ATGGAATAGA ATCATATCTA AAAGGATTAT TCTGGTTTAA ACAAGGTCAT
ATATTTGAAA TACCATTTTA TTATATAGAT TATACTTTAG CACAAGTAAT TGCATTACAA
ATGTGGAAGT TAAATGGAGA AAATAGTAAA TTAGCATGGG AAAAATATAT GAGATTATGT
ACTCTTGGAG GATCTAAAAC TTTCCTTGGA TTATTAGAAG ATGTTAAATT AGACAATCCA
TTTGAAAATG GAAGTATAGC AAAAATTATT ACACCAGTAA AAGAGTTTTT ATCAACAATT
AATGATGAAA ATTTATAA
 
Protein sequence
MVFKDFEYKR VNIEELKKTY TKIKEELENA ENAQKAIELY YKLDEINKEF STNYSLVYVR 
NSIDTTDEFY SKEKAFYDEN IPLISEFSND LGIAFANSKF RSELEEKFGK LAFQKIDLEL
KLFKNEIIED LQEENKLVTE YVKLTSSAKI LFDGEERNLS EMVPYTQDVD RNTRKEAIIA
VGKFFEDNMA EYDRIYDMLV KVRDRIAKKL GYKNFVEVGY LRMGRLDYNA EDVANYRKQI
KENVVPLYVE LRKRQEKRIK VDKLKYYDEG MAFLTGNPTP KGDRAWMVEK AQIMYKELSP
ETHEFFSKMV EQELLDLDSK KGKQGGGYCT SFDSYSMPFI FANFNGTAHD VEVLTHEAGH
AFQSYQAMRN VDISSYYWPT SESAEIHSMS MEFLTWPWME SFFKEDIDKF KYHHLSGAFL
FIPYGALVDE FQHFVYENPN VTPEERRMKW LELEKEYLPT RDYDGIESYL KGLFWFKQGH
IFEIPFYYID YTLAQVIALQ MWKLNGENSK LAWEKYMRLC TLGGSKTFLG LLEDVKLDNP
FENGSIAKII TPVKEFLSTI NDENL