Gene Smon_0117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_0117 
Symbol 
ID8599815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp122992 
End bp124848 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content24% 
IMG OID 
ProductHyaluronate lyase 
Protein accessionYP_003305487 
Protein GI269122910 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGAGT TATTATTAAA AAGAAGAGAG TATTTAATAG GAAATTTTAA GGATTTACCA 
TTAAATAAGA GAAAACAAAT AGAAGAAATA CAAGAGAAAA ATATAGAAAA ATTAGAATAT
TTAGAAAATT TAAACATAGC TGAGGTAAAG TTAAAATACA ATAATATATT AGAACTTGCT
AAGGCATATA ATCAGGTTGG AAATGTTAGA TATAGGGATG AAAAAATTAA AGTCATTATA
CTTAAAACAT TAAAATTATT AAGAATAACT TACTATAACC TATCATCAGT AGAGAAAGTA
AATTGGTGGC AATGGGAAAT AGGAATACCT TTATTACTAA ATGATATATT TATATTAATG
AATGAAAAAG ATTTTGATTT TGAAAAAGAA GAAAATTTAA AAACTAGTAT ATATTTTCAA
AAAGATCCAA GGTATTCAGG TAATAATCCT GTGGCGACAC ACCCAAGTAA AAAGCCTTTT
AGAATATCTA CGGGTGGAAA TAGAGTTGAT ACAGTTAAAG TATCATTATT TCGATCCATA
TTATTAAATA ATGAAGAGGA ATTGAAACTT GCTCTTAATT CACTTCCAGA AGTTTGGAAA
TGTAGAGAAA AAATAAATAG GATAGAAACC GATACACAAA GAGATGGCTT CTATAATGAC
GGCTCATTTA TTCAACATGG AAGTTTAGCT TACAATGGAA CATATGGTAA TGTCTTATTA
CAAGGTATAG GAGAAATACT TTATGTTATA GGGGATAGTA AATATTTAAA ATATCTTGGA
GATATATATA GCTTGAAAGA TATAATACTT AATAGCTATA AGCCATTTAT GTATAAAGGT
TCATTCCCTG ATATGTTAAA TGGTAGAGCT ATTACAAGAG AAAATTCATC TGATAAAACT
ATAGGGCATA TGTTATTAAA TTCTATAATG CTAATATCAT GTGGTTTAAA TGATGAAGAA
TTAAAAAATT TAGTTGCAAG TGAAATATTA AAATATGAGG ATTATTCATA TTTTGATAAA
GAACTTTCAC CTTTTATGTA TGATTTAGTT AAAAAAAATA TACATAATAG GAAAAAAGAA
GAATATGGAA AGATAATAAA AGTCAGTAAT ATTATGAATA GGGTCTTTAT TAAAGATGAC
AAAAAGGCTA TAGCTATTGC AGGTCATAGT GAAAATATAT CAAATTATGA AAGCATTAAT
GGTGAAAATA CAAAAGGTTG GTATACAGGA GATGGGATGA TATATCTCTA CACTAGTGAT
GTAACATATA CCAATTATTG GAATAATTCC GACACGCGAT ATATGTCAGG AACTACAGAA
GTTTATGAAG ATTTAAATGG TATAAATACA TCACAGATTT TAAATGTGAA TATGAGTAGT
GCCAAGATAG TTAAAGCCAT AGAAAAAGAT AATAAGATGA TATTTTTTAT GGAATTTGAA
AATCATAATA AGAGTTTAAA AATGTATAAA TCATATGTAT ATACAGGTAA GAAACTTATT
TGTTTAAACA CAAATATTGA TACAAAAGAA AAGATATATA CAACAATTGA CAATAGGCTA
TATAAAGAAA AACCTAAAAT TGTAATGGAA GATAAAAGGA TATTAATTAA TGATTTAATA
TTTAATATAA TTACAGATCA TAAATTTAAT TTTGATATAA AAGAAAGTGA ATTTGGATAT
TTTGTAAAAA TATGGATAGA ACATAAATAT AATGAAAATT TGTATTATGA AATAATATTT
GAATATGATG ATAAAACATC ACTAATAGAG GATAATAAAG AAAATATAAT AATAAGAAAT
GGTAATGAAA AATATTTAAT AAATACAAAA GAGAAAGAGG TGTTGAGATT TGAATAA
 
Protein sequence
MYELLLKRRE YLIGNFKDLP LNKRKQIEEI QEKNIEKLEY LENLNIAEVK LKYNNILELA 
KAYNQVGNVR YRDEKIKVII LKTLKLLRIT YYNLSSVEKV NWWQWEIGIP LLLNDIFILM
NEKDFDFEKE ENLKTSIYFQ KDPRYSGNNP VATHPSKKPF RISTGGNRVD TVKVSLFRSI
LLNNEEELKL ALNSLPEVWK CREKINRIET DTQRDGFYND GSFIQHGSLA YNGTYGNVLL
QGIGEILYVI GDSKYLKYLG DIYSLKDIIL NSYKPFMYKG SFPDMLNGRA ITRENSSDKT
IGHMLLNSIM LISCGLNDEE LKNLVASEIL KYEDYSYFDK ELSPFMYDLV KKNIHNRKKE
EYGKIIKVSN IMNRVFIKDD KKAIAIAGHS ENISNYESIN GENTKGWYTG DGMIYLYTSD
VTYTNYWNNS DTRYMSGTTE VYEDLNGINT SQILNVNMSS AKIVKAIEKD NKMIFFMEFE
NHNKSLKMYK SYVYTGKKLI CLNTNIDTKE KIYTTIDNRL YKEKPKIVME DKRILINDLI
FNIITDHKFN FDIKESEFGY FVKIWIEHKY NENLYYEIIF EYDDKTSLIE DNKENIIIRN
GNEKYLINTK EKEVLRFE