Gene Mfla_0834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_0834 
Symbol 
ID4000287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp876854 
End bp879508 
Gene Length2655 bp 
Protein Length884 aa 
Translation table11 
GC content55% 
IMG OID637937734 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_544943 
Protein GI91775187 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.63588 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATGGAG AGCATGATCG TGGCGGCGTT ATAATCAAGG TTTGTGATGA TGTGCTGGCG 
CTGGCCGGCA AGCATGAACG GGCAATCAAT TTGACTAAGT TAGACAATAC CTTGGAACAA
CATACCCCGA TGATGCGCCA GTATCTTGGT ATCAAGGCGC AATATCCCGA TATGCTGGTG
TTTTATCGCA TGGGGGATTT CTACGAGTTA TTCCATGATG ATGCGGAAAA AGCATCGCGC
CTGCTCGGCA TCACGCTGAC CAAGCGTGGC AGTTCCAACG GCGAGCCTAT CCGCATGGCA
GGTGTGCCTT ATCATGCGGC GGAGCAGTAT TTAGCCAAGC TTGCCAAGCT GGGGGAGGCG
GTTGCCATCT GCGAGCAGGT GGGCGACCCT GCCAAGAGCA AGGGGCCCGT GGAGCGCCAG
GTCACACGTA TTCTGACCCC GGGGACGTTG ACCGACGCGG CCTTGCTGGA CGATACGCGT
GACAACCTGC TGCTTGCCAT TGCACATGGC GAAGGTGTAT TAGGGTTGGC ACGCATCAAT
CTTGCTTCAG GCCGGTTCAT TCTGAGTGAA ATAACGCCCG GACTATTGGC GCAGGAGTTG
GAGCGCATCA GTCCGGCTGA AATTCTTTAT CCCGATGATT TTTACCATAT GGCGCTGGAG
CAGGTGAAAT GCCCAAAGAA GCGTCTGGCG CCTTGGCAGT TCGACCTGGA TTCATCCATC
CAGACCTTGA CCAAGCAGTT CAGCACCTAC GATCTGGATG GCTTCGGCTG TGCTCATATG
CTGGCCGCCA TCATGGCGGC TGGCGCCTTG CTTGATTATG TCAAGCATAC GCAGCGTACG
AGCCTGCCAC ATATCCAGTC CCTGATGGTT GAGCAGGGAA GCCAGTTCAT CCAGCTGGAT
GCTGCGACAC GCCGCAACCT GGAAATCGAC CAGACCTTGC GTGGTGAATC TTCGCCCACG
TTGTACTCTT TATTGAATAC CACCGTCACT GCCATGGGTG CGCGTTTGCT GCGCTCTTGG
CTGCATCATC CTTTGCAGCA TCAAGCAGAT ATTCAGGCAC GCCTGCAGGC GGTGAAGGTT
TTGCAGGCGC AATATGATGG ATTGCGCCCC TTATTGCGCA ATGTCGGGGA CATCGAGCGC
ATGGCAGCAA GGGTGGCTTT GAAAACGGCG CGACCCCGGG ATTTGTCCGG TTTGCGCGAC
AGTCTGCAAC AGCTTCCCGC ACTGCAACGA GTGTTGCGGC CTGAAGATTC CGCACTGTTG
CACTCCTTGC AACAACAACT GGATGTGCCA CAAGCAGCGC TTGATATGCT GATTGCTGCG
ATCAAGGATG AGCCGGCTGC TGTGCTGCGC GAAGGCGGCG TGATCGCCGA TGGTTTTGAT
GCCGAGCTGG ATGAGTTACG CGCCATACAG AGCAATTGCG GGGAGTTTCT CCTTCAGTTT
GAAGCGCAGG AGCGCGAGCG CAGTGGTATC AGCAACCTCA AGGTAGAGTA CAACAGCGTG
CATGGCTTTT ACATCGAAAT CAGCCGTGCG CAATCTGAGA ATGTACCGGC CGAATACCGC
CGTCGCCAGA CGTTGAAGAA TGTCGAGCGC TATATCACCC CTGAGCTGAA AACCTTCGAG
GATAAAGTAC TGTCCGCCAA TGAGCGTGCC TTGGCGCGTG AAAAGTTCTT GTTCGATGAA
TTGTTAGGGA ATTTGCAACC CGCATTGGCT GCCTGGCAAC GCAATGCAGA GGCTGTGGCG
CAGCTGGATG TGCTGGCAAC GTTTGCCGAG CGCGCAGATG TTCTGAAGTA TGTTGCGCCT
CAGTTCAGCA GCGAAGCCGG ATTGGATATC GTGGATGGTC GCCATCCCGT GGTGGAGCAG
CTTGCCCAAC CTTTCATTGC CAATAGTGTT TCATTATCGC CTTACCGTCA GCTTTTGCTG
ATTACTGGTC CGAATATGGG CGGTAAGTCA ACCTATATGC GGCAGACGGC TTTGATCGTG
CTGCTTGCAC ACTGTGGATG CTTTGTTCCG GCGAAATCGG CGAGGATAGG ACCTATAGAC
CGTATTTTCA CTCGCATCGG CGCATCAGAC GATTTGGCCG GGGGGCGCTC TACTTTTATG
GTGGAAATGA CCGAGACAGC GAACATCCTG CACAATGCCA CTGAGCGCAG CCTGGTGCTG
CTGGATGAGA TTGGACGCGG TACCTCGACA TTTGACGGGT TATCCCTGGC ATGGGCAGTG
GCCCGACAGT TGCTGGAGAA GAACCGTAGC TATACCTTGT TTGCCACCCA TTACTTCGAG
CTAACCAGAA TCAGCGAAGA GTTCAAGCAT GCAGCCAATG TGCATCTGGA TGCGGTTGAG
CACGGTGATG GGATTGTGTT CCTGCATAAT GTGGAGGAAG GGCCAGCAAG TCAGAGCTAT
GGTTTGCAGG TGGCCCAGTT GGCAGGCATT CCACGCACTG TGGTGAATGC CGCCAAGCGC
AAACTGGTGC AGCTTGAGCA AAGCCAGGTC ATGCAACAAT CATTGGCGGG GCAGGGCGAT
ATGTTTGTTG CTGCCAATGT CGAGCCAGAG CCCGCCACAC ATCCGGTGGT ATCCGAGTTG
GAAAGCATCG ATCCTGACAG CCTCACCCCC AGGCAGGCAC TGGATGTTTT ATACAAATTG
AAAAAATTAA TATAA
 
Protein sequence
MHGEHDRGGV IIKVCDDVLA LAGKHERAIN LTKLDNTLEQ HTPMMRQYLG IKAQYPDMLV 
FYRMGDFYEL FHDDAEKASR LLGITLTKRG SSNGEPIRMA GVPYHAAEQY LAKLAKLGEA
VAICEQVGDP AKSKGPVERQ VTRILTPGTL TDAALLDDTR DNLLLAIAHG EGVLGLARIN
LASGRFILSE ITPGLLAQEL ERISPAEILY PDDFYHMALE QVKCPKKRLA PWQFDLDSSI
QTLTKQFSTY DLDGFGCAHM LAAIMAAGAL LDYVKHTQRT SLPHIQSLMV EQGSQFIQLD
AATRRNLEID QTLRGESSPT LYSLLNTTVT AMGARLLRSW LHHPLQHQAD IQARLQAVKV
LQAQYDGLRP LLRNVGDIER MAARVALKTA RPRDLSGLRD SLQQLPALQR VLRPEDSALL
HSLQQQLDVP QAALDMLIAA IKDEPAAVLR EGGVIADGFD AELDELRAIQ SNCGEFLLQF
EAQERERSGI SNLKVEYNSV HGFYIEISRA QSENVPAEYR RRQTLKNVER YITPELKTFE
DKVLSANERA LAREKFLFDE LLGNLQPALA AWQRNAEAVA QLDVLATFAE RADVLKYVAP
QFSSEAGLDI VDGRHPVVEQ LAQPFIANSV SLSPYRQLLL ITGPNMGGKS TYMRQTALIV
LLAHCGCFVP AKSARIGPID RIFTRIGASD DLAGGRSTFM VEMTETANIL HNATERSLVL
LDEIGRGTST FDGLSLAWAV ARQLLEKNRS YTLFATHYFE LTRISEEFKH AANVHLDAVE
HGDGIVFLHN VEEGPASQSY GLQVAQLAGI PRTVVNAAKR KLVQLEQSQV MQQSLAGQGD
MFVAANVEPE PATHPVVSEL ESIDPDSLTP RQALDVLYKL KKLI