Gene Mfla_1175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1175 
Symbol 
ID4001095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1218415 
End bp1221357 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content64% 
IMG OID637938076 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_545284 
Protein GI91775528 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.770188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACGC TCAAGATCAG CGAAGCGGGC ACAGTGCAGT TCCCGATGGT GAAGCACGCG 
GTAGAGATCG GCTGGACGGC AATTACGCCG GATGATGCAC GCGCGAAGCG AGGCGGTGAG
GCAGGAGCTT TCTTCCGCGA CGTACTGGAA GCCAAACTCG CCGCGTTCAA CCCCTGGATG
TCCGCCGACG CCGTGCGCTC GGTGGTGGAA ACCCTGGACG CGCTGCCGGC CACCATCGAG
GGCAACCGCG AGCTGCTGGC CTGGCTGCGC GGCGAACGCC AGTGGTACGA CGAGACCGAG
AAGCGCCATC GACCAGTGAC GGTGATCGAC TTCGAGCACG TGGCGGATAA CGTCTTCCAT
GTGACCTGGG AGTGGAAGAT CAAGCCGCCC GCGCGCAAGG GCAACCGGGC CGACGTGATG
TTCGTCGTCA ACGGCGTGCC GGTGGTCATC GTCGAGCACA AGAACCCGAA GGACGGCGAC
GCCATCGAGC GCGCCATCAA GCAGCTGCGC CGCTACGAGC TCGAAACGCC GGAGCTGCTG
GCGACGGCCC AGTTGTTCAA CGTGACGCAC CTGCTCGATT ACTGGTACGG CGTGACCTGG
AACGCCAACC GGCGCGACAT GGCGCGCTGG AAACAGGCGC CGGAGGAAAC CTACCGCTTT
GCGGTGCAAG CCTTCTTCGA GCCGACCGAC TTCCTGCGCA CCCTGCGGCA CTGGATCTTG
TTCTACGTGC AGGACGGCGA GACGCGCAAG TCGGTGCTGC GCCAGCACCA GCGGCGCGCC
TGTGAGGCCA TCCTGAACCG CTGCGCCGAC CCGACGAAGA CACGTGGCCT CATCTGGCAC
ACCCAGGGCT CGGGCAAGAC CTTCACCCTG CTGACCGCCG CTCGCCTGAT CCTGGAGGAC
AAGGCGCGCT TCGCCAACGC AACGGTGGTG CTGGTGGTGG ACCGCACCGA GCTGGAAGGT
CAGTTGAAGG GCTGGGTCGA GCGCCTGCTC GGCGAGATGC AGAGCCAGGA CATCGCGGTC
CGGCGGGCCA ACAACAAGGC CGAACTTCAG TCCCTGCTGG ACGCCGACTT TCGCGGCCTG
ATTATCTCGA TGATCCACAA GTTCGAGGCC ATCCGCAAAG ACAGCTGCCT GCGCGACAAC
GTCTACGTGT TCATCGACGA AGCGCACCGG TCGGTCGCCA AGGACCTCGG CACCTATCTG
ATGGCGGCCG TGCCGAAGGC CACCATCATC GGTTTCACCG GCACGCCGAT TGCGCGCACG
TCGCAAGGCG AAGGTACGTT CAAGATCTTC GGCCGGGAGG ACGAACAGGG CTACCTGGAC
AAGTACTCGA TCAAGGAGTC CATCGAGGAC GAGACCACCC TGCCGATCAA ACACGTGATG
GCGCCCAGCG AGATGACGGT GCCGGCCGAA CGGCTGGACA AGGAGTTCTT CGCGCTGGCC
GAGGGCGAGG GCGTGACCGA TGTCGAGGAA CTGAACAAGG TGCTCGACCG CGCGGTCGGC
CTGCGCACCT TCCTCACGGC CGACGACCGG ATCGAGAAGG TTTCGGCCTT CATCGCCGAG
CACTTCAAGG AGAACGTGCT GCCCTTGGGC TACAAGGCCT TCGTGGTGGC GGTGAACCGC
GAGGCCTGTG CCAAGTACAA GAAGGCGCTG GACAAGCATC TGCCACCCGA GTGGAGCGCG
CCGGTCTACA CCGAGAACTC CGCCGATGTG GTGGATCGGC CGCTGGTGGC CGAGTTGCAG
CTCTCAGACG ACGCCGAAGA GCAGGTGCGC TTGCTGTTCA AGAAGCCCAC CGAGAACCCG
AAGATCCTGA TCGTCACTGA CAAGCTACTC ACGGGCTACG ACGCGCCGCC GCTTTACTGC
TTGTACCTCG ACAAGCCGAT GCGCGACCAT GTCCTGCTGC AGTCGATTGC ACGGGTGAAC
CGGCCTTATG TAGACGCGAA CGGTGTGCAG AAGCGCGTCG GCTTGGTGGT CGACTTTGTC
GGTGTGCTGC GCGAGCTAAA AAAGGCGCTG CAGTTCGATT CCAGCGACGT CAGCGGCGTG
ATTGAGGACC TCGACGTGCT GCTGCAGGAC TTCCTGCAAC GCATCGAGCA GGCCAAGAAG
GACTACCTGG AGTCAGACGC CGGCGGCACC CCCGACGAGC GGCTGGAGCG CCTGGTGTTC
GGACGCTTCC TGACGCCCGA GGCGCGTAAG ACCTTCTTCG AGAGCTACAA GGAGGTCGAG
GCGCTGTGGG AGATCCTCTC GCCCGACCCG GCGCTGCGTG ACCACATCGC GACCTACAAG
CAGCTCAGTC AGCTCTATGC GGCCGTGCGC AATGCTTACG CCGAAAAAGT CGGGTTCGTG
GCTGACCTGG CCTACAAGAC GCGGCGTCTG ATCGAGGAAA GCGCCGAGCA ACATGGTCTT
GGACGATTGA CCAAGACCGT GACCTTTGAT GTGGCGACCC TGAAGTCATT GCGCAGTGAG
GATGGGACCG ATGAGGGCAA AGTGTTCAAC CTTGTGCGCG GTCTGCAGCA CGAGATTGAC
GAGGACCCTG CCGCAGCACC GGTGCTGCAA CCGCTGAAAG ATCGTGCCGA GCGCATCCTG
AAGGATCTGG AAGAGCGCAA GACGACCGGT CTGGCGGCGA TGGACCAACT GGCGGCGTTG
GCTGCCGAGA AGGAAGCGGC CATGAAGGCG GCGCGCGACA GCGGCCTGTC TGCTCGTGCC
TTTGCAGTCG CCTGGGTACT GCGTGAGGAT GCGGCCGTCA AGGCAGCGGG CATCGACCCC
ATGACACTGG CCAAGGATGC CGAAGAGTTG CTCGGGCGTT TCCCGAATGC CTCGGTCAAC
GCCGACGAGC AGCGACGGCT CCGTGCGTCG CTCTACAAGC CCCTCCTCGC GCTGGCGCAG
GACGAGCGGG CACGGGTCGT CGATCTCGTT GTGCGTTTGC TGCTCACGGA GGGTGGCGAA
TGA
 
Protein sequence
MSTLKISEAG TVQFPMVKHA VEIGWTAITP DDARAKRGGE AGAFFRDVLE AKLAAFNPWM 
SADAVRSVVE TLDALPATIE GNRELLAWLR GERQWYDETE KRHRPVTVID FEHVADNVFH
VTWEWKIKPP ARKGNRADVM FVVNGVPVVI VEHKNPKDGD AIERAIKQLR RYELETPELL
ATAQLFNVTH LLDYWYGVTW NANRRDMARW KQAPEETYRF AVQAFFEPTD FLRTLRHWIL
FYVQDGETRK SVLRQHQRRA CEAILNRCAD PTKTRGLIWH TQGSGKTFTL LTAARLILED
KARFANATVV LVVDRTELEG QLKGWVERLL GEMQSQDIAV RRANNKAELQ SLLDADFRGL
IISMIHKFEA IRKDSCLRDN VYVFIDEAHR SVAKDLGTYL MAAVPKATII GFTGTPIART
SQGEGTFKIF GREDEQGYLD KYSIKESIED ETTLPIKHVM APSEMTVPAE RLDKEFFALA
EGEGVTDVEE LNKVLDRAVG LRTFLTADDR IEKVSAFIAE HFKENVLPLG YKAFVVAVNR
EACAKYKKAL DKHLPPEWSA PVYTENSADV VDRPLVAELQ LSDDAEEQVR LLFKKPTENP
KILIVTDKLL TGYDAPPLYC LYLDKPMRDH VLLQSIARVN RPYVDANGVQ KRVGLVVDFV
GVLRELKKAL QFDSSDVSGV IEDLDVLLQD FLQRIEQAKK DYLESDAGGT PDERLERLVF
GRFLTPEARK TFFESYKEVE ALWEILSPDP ALRDHIATYK QLSQLYAAVR NAYAEKVGFV
ADLAYKTRRL IEESAEQHGL GRLTKTVTFD VATLKSLRSE DGTDEGKVFN LVRGLQHEID
EDPAAAPVLQ PLKDRAERIL KDLEERKTTG LAAMDQLAAL AAEKEAAMKA ARDSGLSARA
FAVAWVLRED AAVKAAGIDP MTLAKDAEEL LGRFPNASVN ADEQRRLRAS LYKPLLALAQ
DERARVVDLV VRLLLTEGGE