Gene Mfla_1215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1215 
Symbol 
ID4000172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1274381 
End bp1277353 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content54% 
IMG OID637938119 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_545324 
Protein GI91775568 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCA CTGAAAGCCA GATTGAGCAG AATCTAATTA ACAAGCTAGC CGATCTCAAA 
TATACCTTGC GTCCAGATAT CCGAGACCGT GCGACCTTGG AAGCTAACTT CCGAACCAAG
TTCGAAGCGC TTAATCGCGT ACGCTTGACT GACAATGAAT TTCAACGCCT GCTTGACAGC
ATCATCACGC CCGACGTTTA TAGCACTGCT CAAACATTGC GCAACATCAA CAGCTTCGAG
CGCGACGATG GCACACCGCT GAACTACACT CTGGTCAACA TCCGGGACTG GTGCAAGAAC
GACTTCGAGG TTGTCAACCA GCTACGCATG AACACTGACA ACAGTCATCA CCGCTACGAC
GTGATGCTGC TGATCAATGG CGTGCCCGTG GTGCAAATTG AGCTGAAGAC GCTGGCCGTT
AGCCCACGCC GCGCCATGCA GCAGATCGTT GATTACAAGG CTGATCCCGG CAATGGCTAC
GGCAAAACGC TGCTGTGCTT CCTGCAGCTC TTCATTGTCA GCAACCGCAC GGATACCTGG
TACTTCGCCA ACAACAACTC ACGTCACTTC AGCTTCAACG CGGATGAGCG TTTTCTGCCG
GTTTACCAGT TCGCCAGCGA AGACAACAAG AAGATCACCC AGCTAGATAG CTTTGCCGAG
AAGTTCCTGG CCAAGTGCAC CTTGGGCCAG ATGATCAGCC GCTACATGGT ACTGGTGGCT
AGCGAGCAGA AGCTGCTCAT GATGCGGCCT TACCAAATCT ATGCCGTCAA AGCCATTGTG
GAGTGCATCC ACCAAAATTG CGGCAACGGC TACATCTGGC ATACCACCGG CTCAGGCAAG
ACGCTGACTT CCTTCAAAGC GTCCACCCTG CTCAAGGACA ACCCGGACAT CGACAAGTGT
CTGTTTGTGG TAGATCGCAA AGACCTGGAT CGGCAGACCC GCGAAGAATT CAACCGCTTC
CAGGAAGGCT GCGTCGAAGA GAACACCAAC ACCGAGACCC TGGTGCGCCG CCTGCTGTCG
GATGATTATG CCGACAAGGT CATCGTCACC ACCATTCAGA AGCTGGGTCT GGCGCTGGAT
GGCGCCAACA AGCGCAACTA CAAAGAACGG CTGGAACCGC TACGCAACCA ACGCATGGTG
TTCATCTTTG ATGAGTGCCA CCGCTCGCAA TTTGGTGACA ACCACAAAGC CATCAAGGAG
TTCTTCCCCA ACGCCCAGCT TTTTGGCTTC ACCGGCACAC CTATCTTCGA GAAAAACGCC
AGCTACCAGC AAATCGAAGG CCAACAGGCC AGCTATCGAA CCACCGACGA TTTGTTCCAG
CGCTGCCTGC ACCAGTACAC CATCACCCAC GCCATTGAAG ATCGCAACGT ACTGCGCTTC
CACGTGGACT ACTTCAAGCC TGAAGGGAAA AAACCGCCCA AGCCCGGCGA AGGCATCGCC
AAAGCCAAGG TCATCGAAAC CATTCTTGCC AAGCATGACA CCTCCACCAA TGGCCGCAAG
TTCAATGCCG TGTTGGCAAC CGCCAGCATC AATGACGCCA TCGAATACTT CGAGCTGTTC
GCGGAAATTC AGCAGCAAAA AGCCGAGCAA GATCCGGAGT TCCGCCCGCT GAACATTGCC
TGCGTATTTT CTCCGCCTGC AGAGGGCAAC AAAGACGTAC AGCAGATTCA GGAAGACCTG
CCGCAAGAGC AGGAAGATAA CAAAAAAGAC CCAGAGGCCA AAAAGGCAGC GCTTACACGC
ATCATCGCTG ATTACAATAC CCGCTTCGGT ACCAATCACC GCATCAGCGA TTTCGATCTG
TATTATCAGG ATGTGCAAAA GCGCATCAAG GATCAGCAGT ATCCCAACAG CGATCTTCCC
GCCGCGCAAA AGATCGACAT CACCATCGTG GTGGACATGC TACTCACCGG GTTTGACTCC
AAGTACCTCA ACACCCTGTA CGTGGATAAG AACCTCAAGC ATCACGGCTT GATCCAGGCA
TTTTCACGCA CCAATCGCGT ACTGAACGAC AGCAAGCCTT ACGGAAATAT TCTCGATTTT
CGCCAGCAGC AAAGCGCGGT GGAAGAGGCC ATTGCCCTGT TTTCCGGCGA ACGGATCGAC
AACCCCCGGG AAATCTGGCT GGTGGAATCC GCGGCAGAGG TTATTCGCAA ATACGAAGCC
GCTGTGGCGG GCATGTCGGA CTTCATGGCA GACAAGAACC TCGTTTGCGA ACCCGAAGCG
GTCTACAACC TCAAGGGCGA TACGGCGCGT ATCGAGTTCA TCAACCGTTT CAAGGAAGTG
CAACGGCTGA AAACCCAACT TGACCAGTAC ACCGATCTGG CACCCGAACA GAAAACCCGC
ATTGACACCA TCCTGCCACC CGACCAGTTG CAGAGCTTCC GCAGTACCTA CCTGGAAACG
GCCAAGCAGC TGAAAGAAAT TCAGAGCAAG GAAGGTGATC AAGCTCCGCC GGAAATACAA
CAACTGGACT TTGAGTTCGT GCTTTTTGCG TCGGCAGTGA TCGACTACGA CTACATCATG
GGCCTGATTT CACGCATGAC GCAGCAAGGC CCCTCGAAAC TGAAGATGAA CCGTGAGCAG
TTGATCAGCC TGATCCAGTC CGATGCCAAG TTTATTGACG AGCGCGAGGA TATTGCCGAG
TACATCCGCA GCCTGCCAGC CAATGAGGCG CTGGATGAAA AGCAAATCCG GGCGGGCTAC
AACCGCTTCA AGGATGAAAA GAAGGCAAGA GAGCTGACTG ACATTGCCAG CCGTCATGGG
CTGGAACCCG ATGCCTTGCA AGATTTTGTC GATGAAATCC TGCGTCGTTG CATTTTTGAT
GGCGAGCGCC TTTCCGAGCT AATGGCACCG TTGAACCTGG GCTGGAAAGC ACGCACCCAG
AAGGAGCTGG CACTGATGGA AGAGCTGGCA CCGCTGCTGC ACAAACTTGC TCAGGGGCGT
GAGATTTCCG GGCTCAAGGC CTACGAGGAA TAA
 
Protein sequence
MNITESQIEQ NLINKLADLK YTLRPDIRDR ATLEANFRTK FEALNRVRLT DNEFQRLLDS 
IITPDVYSTA QTLRNINSFE RDDGTPLNYT LVNIRDWCKN DFEVVNQLRM NTDNSHHRYD
VMLLINGVPV VQIELKTLAV SPRRAMQQIV DYKADPGNGY GKTLLCFLQL FIVSNRTDTW
YFANNNSRHF SFNADERFLP VYQFASEDNK KITQLDSFAE KFLAKCTLGQ MISRYMVLVA
SEQKLLMMRP YQIYAVKAIV ECIHQNCGNG YIWHTTGSGK TLTSFKASTL LKDNPDIDKC
LFVVDRKDLD RQTREEFNRF QEGCVEENTN TETLVRRLLS DDYADKVIVT TIQKLGLALD
GANKRNYKER LEPLRNQRMV FIFDECHRSQ FGDNHKAIKE FFPNAQLFGF TGTPIFEKNA
SYQQIEGQQA SYRTTDDLFQ RCLHQYTITH AIEDRNVLRF HVDYFKPEGK KPPKPGEGIA
KAKVIETILA KHDTSTNGRK FNAVLATASI NDAIEYFELF AEIQQQKAEQ DPEFRPLNIA
CVFSPPAEGN KDVQQIQEDL PQEQEDNKKD PEAKKAALTR IIADYNTRFG TNHRISDFDL
YYQDVQKRIK DQQYPNSDLP AAQKIDITIV VDMLLTGFDS KYLNTLYVDK NLKHHGLIQA
FSRTNRVLND SKPYGNILDF RQQQSAVEEA IALFSGERID NPREIWLVES AAEVIRKYEA
AVAGMSDFMA DKNLVCEPEA VYNLKGDTAR IEFINRFKEV QRLKTQLDQY TDLAPEQKTR
IDTILPPDQL QSFRSTYLET AKQLKEIQSK EGDQAPPEIQ QLDFEFVLFA SAVIDYDYIM
GLISRMTQQG PSKLKMNREQ LISLIQSDAK FIDEREDIAE YIRSLPANEA LDEKQIRAGY
NRFKDEKKAR ELTDIASRHG LEPDALQDFV DEILRRCIFD GERLSELMAP LNLGWKARTQ
KELALMEELA PLLHKLAQGR EISGLKAYEE