Gene Smon_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1041 
Symbol 
ID8600769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1129738 
End bp1131558 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content26% 
IMG OID 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_003306381 
Protein GI269123804 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAG TGATAAATAT GGCTCTTATA AAAATATTGG ATGATAGTAT TTCAAATATT 
ATTGCAGCTG GAGAAGTAGT AGAAAATCCA GCGAGTATGA TAAAAGAACT TCTTGAAAAT
TCATTAGATG CAGAGGCAAG CTCTATACAA ATAGAAGTAT TAAATGGTGG AATATATGTA
AAAATATCAG ATAATGGAAA AGGTATGAGT AGAGAAGATG TTATTTTAAG TATAGAAAGG
CATGCTACTT CTAAGATATC GACTAAAGAA GATATATTTA ATTTACAAAC ATATGGATTT
AGAGGAGAAG CTCTAGCTTC AATTGCAGCT GTTTCAAAAA TTTCAATTAG CAGTAAAACA
AAAGATGAAA AAATTGGAAC AATAGTTAAT GCATATGCTG GAGTAGTTAG AAAGTATGAA
AATTTTACTA GAACTACAGG AACAGAGATA GAAATAAGGG ATTTATTCTA TAACACACCA
GCTAGAAAAA AGTTTTTAAG AAAAGAAAGC ACAGAATATA GTAAGATTAA AGACATAGTT
CTTAAAGAGG CTCTTGCTAA TCCAAATGTA GCCATAACCC TATATATAGA TGGAAAAGCT
ACTTTAAAAA CCACTGGTAA TGGTATGGAA AATACAATTT TTGAACTTTT CGGTAAAAAT
GTTCTTAAGA ATTTAGAAAA ATTTGAATAT GGATATTTAG GAAATGTTGA AATTTTAAGG
TCATCAAAGG AATACATATA TACTTATGTT AATGGTAGAT ATGTTAAATC CAATTTACTA
GATAGAGCAG TTATAGATGC ATATTATACT AAATTAATGA AAGGTAAATA TCCTTTTGTA
ATATTAATGT ATGATGTAAA TCCTAAAGAA ATAGATGTTA ATGTACATCC AAGTAAGAAA
ATGATAAAAT TTTCTGATGA AAAGATAGTG TATAATGATA TTAAAAGATC TATTGATAAT
TTCTTTTATG AATTTGATAG AAGAACTTGG CAACCTACAT TAATACCAAA AACTAATGAA
GTAGTTGTAG ATAATACAGA AACAGAATAC ATACCTATAA ATATTTTTTC AAAAGATGAG
GTAAAAGAAG AAGTTATTCC TGAAACTTTA ACTTTTAAAG AAGAAAAAGA TTTAGAAGTA
AGAGAACCTA AATTAATATA TGAAAATCCT TTTTATGAAG AAGAGGAAGA AATTAAAGTA
GAAAATAAGG TGGAATACTT AAAACCATTT TTTGAGAAAA AAGAAGGGGA AGTAAAATAT
TATGAAGTAT TAGGTCAAAT ATTTGATACA TATATATTAG TTAATAGAGA TAATAATTTA
GAAATATATG ATCAACATAT TATACATGAA AGACTGCTTT ATGAAGAACT TATGTCTAGT
TTTGAGAAAA AGAATATTGG TTCTCAAATA CTTCTTTTAC CTGAATTAAT AGATGTTAGT
CCAGTTGATA AAGATATCAT ATTTAATAAT ATGGACACAT TTGAAAAACT TGGTTTTGAA
ATTGATGAAA TTTCAAATAA TCAAATAGCT TTAAGAGCAG TACCTAATTT TAACTTTAGA
GAAAGTATAA AAAATATTTT AGAGAATATA TTAGTTGATT TAAAGAGTAA AAATAAGGTT
GGAGATATTA GAGAAAAAAT AATAATATCT ATGTCATGTA GAGGAGCTAT TAAAGCGGGG
CAAAAATTAA ATATGCAGGA AATGCAAAAT ATGGTAAGAA GATTACATGA AGTAGGTAAG
TATACTTGCC CTCATGGAAG ACCTATTATA TCAAAAATAT CTAAGTATGA TTTAGATAAA
ATGTTTGGTC GTGTGAAATA A
 
Protein sequence
MKEVINMALI KILDDSISNI IAAGEVVENP ASMIKELLEN SLDAEASSIQ IEVLNGGIYV 
KISDNGKGMS REDVILSIER HATSKISTKE DIFNLQTYGF RGEALASIAA VSKISISSKT
KDEKIGTIVN AYAGVVRKYE NFTRTTGTEI EIRDLFYNTP ARKKFLRKES TEYSKIKDIV
LKEALANPNV AITLYIDGKA TLKTTGNGME NTIFELFGKN VLKNLEKFEY GYLGNVEILR
SSKEYIYTYV NGRYVKSNLL DRAVIDAYYT KLMKGKYPFV ILMYDVNPKE IDVNVHPSKK
MIKFSDEKIV YNDIKRSIDN FFYEFDRRTW QPTLIPKTNE VVVDNTETEY IPINIFSKDE
VKEEVIPETL TFKEEKDLEV REPKLIYENP FYEEEEEIKV ENKVEYLKPF FEKKEGEVKY
YEVLGQIFDT YILVNRDNNL EIYDQHIIHE RLLYEELMSS FEKKNIGSQI LLLPELIDVS
PVDKDIIFNN MDTFEKLGFE IDEISNNQIA LRAVPNFNFR ESIKNILENI LVDLKSKNKV
GDIREKIIIS MSCRGAIKAG QKLNMQEMQN MVRRLHEVGK YTCPHGRPII SKISKYDLDK
MFGRVK