Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mfla_1175 |
Symbol | |
ID | 4001095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacillus flagellatus KT |
Kingdom | Bacteria |
Replicon accession | NC_007947 |
Strand | - |
Start bp | 1218415 |
End bp | 1221357 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637938076 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_545284 |
Protein GI | 91775528 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.770188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACGC TCAAGATCAG CGAAGCGGGC ACAGTGCAGT TCCCGATGGT GAAGCACGCG GTAGAGATCG GCTGGACGGC AATTACGCCG GATGATGCAC GCGCGAAGCG AGGCGGTGAG GCAGGAGCTT TCTTCCGCGA CGTACTGGAA GCCAAACTCG CCGCGTTCAA CCCCTGGATG TCCGCCGACG CCGTGCGCTC GGTGGTGGAA ACCCTGGACG CGCTGCCGGC CACCATCGAG GGCAACCGCG AGCTGCTGGC CTGGCTGCGC GGCGAACGCC AGTGGTACGA CGAGACCGAG AAGCGCCATC GACCAGTGAC GGTGATCGAC TTCGAGCACG TGGCGGATAA CGTCTTCCAT GTGACCTGGG AGTGGAAGAT CAAGCCGCCC GCGCGCAAGG GCAACCGGGC CGACGTGATG TTCGTCGTCA ACGGCGTGCC GGTGGTCATC GTCGAGCACA AGAACCCGAA GGACGGCGAC GCCATCGAGC GCGCCATCAA GCAGCTGCGC CGCTACGAGC TCGAAACGCC GGAGCTGCTG GCGACGGCCC AGTTGTTCAA CGTGACGCAC CTGCTCGATT ACTGGTACGG CGTGACCTGG AACGCCAACC GGCGCGACAT GGCGCGCTGG AAACAGGCGC CGGAGGAAAC CTACCGCTTT GCGGTGCAAG CCTTCTTCGA GCCGACCGAC TTCCTGCGCA CCCTGCGGCA CTGGATCTTG TTCTACGTGC AGGACGGCGA GACGCGCAAG TCGGTGCTGC GCCAGCACCA GCGGCGCGCC TGTGAGGCCA TCCTGAACCG CTGCGCCGAC CCGACGAAGA CACGTGGCCT CATCTGGCAC ACCCAGGGCT CGGGCAAGAC CTTCACCCTG CTGACCGCCG CTCGCCTGAT CCTGGAGGAC AAGGCGCGCT TCGCCAACGC AACGGTGGTG CTGGTGGTGG ACCGCACCGA GCTGGAAGGT CAGTTGAAGG GCTGGGTCGA GCGCCTGCTC GGCGAGATGC AGAGCCAGGA CATCGCGGTC CGGCGGGCCA ACAACAAGGC CGAACTTCAG TCCCTGCTGG ACGCCGACTT TCGCGGCCTG ATTATCTCGA TGATCCACAA GTTCGAGGCC ATCCGCAAAG ACAGCTGCCT GCGCGACAAC GTCTACGTGT TCATCGACGA AGCGCACCGG TCGGTCGCCA AGGACCTCGG CACCTATCTG ATGGCGGCCG TGCCGAAGGC CACCATCATC GGTTTCACCG GCACGCCGAT TGCGCGCACG TCGCAAGGCG AAGGTACGTT CAAGATCTTC GGCCGGGAGG ACGAACAGGG CTACCTGGAC AAGTACTCGA TCAAGGAGTC CATCGAGGAC GAGACCACCC TGCCGATCAA ACACGTGATG GCGCCCAGCG AGATGACGGT GCCGGCCGAA CGGCTGGACA AGGAGTTCTT CGCGCTGGCC GAGGGCGAGG GCGTGACCGA TGTCGAGGAA CTGAACAAGG TGCTCGACCG CGCGGTCGGC CTGCGCACCT TCCTCACGGC CGACGACCGG ATCGAGAAGG TTTCGGCCTT CATCGCCGAG CACTTCAAGG AGAACGTGCT GCCCTTGGGC TACAAGGCCT TCGTGGTGGC GGTGAACCGC GAGGCCTGTG CCAAGTACAA GAAGGCGCTG GACAAGCATC TGCCACCCGA GTGGAGCGCG CCGGTCTACA CCGAGAACTC CGCCGATGTG GTGGATCGGC CGCTGGTGGC CGAGTTGCAG CTCTCAGACG ACGCCGAAGA GCAGGTGCGC TTGCTGTTCA AGAAGCCCAC CGAGAACCCG AAGATCCTGA TCGTCACTGA CAAGCTACTC ACGGGCTACG ACGCGCCGCC GCTTTACTGC TTGTACCTCG ACAAGCCGAT GCGCGACCAT GTCCTGCTGC AGTCGATTGC ACGGGTGAAC CGGCCTTATG TAGACGCGAA CGGTGTGCAG AAGCGCGTCG GCTTGGTGGT CGACTTTGTC GGTGTGCTGC GCGAGCTAAA AAAGGCGCTG CAGTTCGATT CCAGCGACGT CAGCGGCGTG ATTGAGGACC TCGACGTGCT GCTGCAGGAC TTCCTGCAAC GCATCGAGCA GGCCAAGAAG GACTACCTGG AGTCAGACGC CGGCGGCACC CCCGACGAGC GGCTGGAGCG CCTGGTGTTC GGACGCTTCC TGACGCCCGA GGCGCGTAAG ACCTTCTTCG AGAGCTACAA GGAGGTCGAG GCGCTGTGGG AGATCCTCTC GCCCGACCCG GCGCTGCGTG ACCACATCGC GACCTACAAG CAGCTCAGTC AGCTCTATGC GGCCGTGCGC AATGCTTACG CCGAAAAAGT CGGGTTCGTG GCTGACCTGG CCTACAAGAC GCGGCGTCTG ATCGAGGAAA GCGCCGAGCA ACATGGTCTT GGACGATTGA CCAAGACCGT GACCTTTGAT GTGGCGACCC TGAAGTCATT GCGCAGTGAG GATGGGACCG ATGAGGGCAA AGTGTTCAAC CTTGTGCGCG GTCTGCAGCA CGAGATTGAC GAGGACCCTG CCGCAGCACC GGTGCTGCAA CCGCTGAAAG ATCGTGCCGA GCGCATCCTG AAGGATCTGG AAGAGCGCAA GACGACCGGT CTGGCGGCGA TGGACCAACT GGCGGCGTTG GCTGCCGAGA AGGAAGCGGC CATGAAGGCG GCGCGCGACA GCGGCCTGTC TGCTCGTGCC TTTGCAGTCG CCTGGGTACT GCGTGAGGAT GCGGCCGTCA AGGCAGCGGG CATCGACCCC ATGACACTGG CCAAGGATGC CGAAGAGTTG CTCGGGCGTT TCCCGAATGC CTCGGTCAAC GCCGACGAGC AGCGACGGCT CCGTGCGTCG CTCTACAAGC CCCTCCTCGC GCTGGCGCAG GACGAGCGGG CACGGGTCGT CGATCTCGTT GTGCGTTTGC TGCTCACGGA GGGTGGCGAA TGA
|
Protein sequence | MSTLKISEAG TVQFPMVKHA VEIGWTAITP DDARAKRGGE AGAFFRDVLE AKLAAFNPWM SADAVRSVVE TLDALPATIE GNRELLAWLR GERQWYDETE KRHRPVTVID FEHVADNVFH VTWEWKIKPP ARKGNRADVM FVVNGVPVVI VEHKNPKDGD AIERAIKQLR RYELETPELL ATAQLFNVTH LLDYWYGVTW NANRRDMARW KQAPEETYRF AVQAFFEPTD FLRTLRHWIL FYVQDGETRK SVLRQHQRRA CEAILNRCAD PTKTRGLIWH TQGSGKTFTL LTAARLILED KARFANATVV LVVDRTELEG QLKGWVERLL GEMQSQDIAV RRANNKAELQ SLLDADFRGL IISMIHKFEA IRKDSCLRDN VYVFIDEAHR SVAKDLGTYL MAAVPKATII GFTGTPIART SQGEGTFKIF GREDEQGYLD KYSIKESIED ETTLPIKHVM APSEMTVPAE RLDKEFFALA EGEGVTDVEE LNKVLDRAVG LRTFLTADDR IEKVSAFIAE HFKENVLPLG YKAFVVAVNR EACAKYKKAL DKHLPPEWSA PVYTENSADV VDRPLVAELQ LSDDAEEQVR LLFKKPTENP KILIVTDKLL TGYDAPPLYC LYLDKPMRDH VLLQSIARVN RPYVDANGVQ KRVGLVVDFV GVLRELKKAL QFDSSDVSGV IEDLDVLLQD FLQRIEQAKK DYLESDAGGT PDERLERLVF GRFLTPEARK TFFESYKEVE ALWEILSPDP ALRDHIATYK QLSQLYAAVR NAYAEKVGFV ADLAYKTRRL IEESAEQHGL GRLTKTVTFD VATLKSLRSE DGTDEGKVFN LVRGLQHEID EDPAAAPVLQ PLKDRAERIL KDLEERKTTG LAAMDQLAAL AAEKEAAMKA ARDSGLSARA FAVAWVLRED AAVKAAGIDP MTLAKDAEEL LGRFPNASVN ADEQRRLRAS LYKPLLALAQ DERARVVDLV VRLLLTEGGE
|
| |