Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mfla_1215 |
Symbol | |
ID | 4000172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacillus flagellatus KT |
Kingdom | Bacteria |
Replicon accession | NC_007947 |
Strand | + |
Start bp | 1274381 |
End bp | 1277353 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637938119 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_545324 |
Protein GI | 91775568 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCA CTGAAAGCCA GATTGAGCAG AATCTAATTA ACAAGCTAGC CGATCTCAAA TATACCTTGC GTCCAGATAT CCGAGACCGT GCGACCTTGG AAGCTAACTT CCGAACCAAG TTCGAAGCGC TTAATCGCGT ACGCTTGACT GACAATGAAT TTCAACGCCT GCTTGACAGC ATCATCACGC CCGACGTTTA TAGCACTGCT CAAACATTGC GCAACATCAA CAGCTTCGAG CGCGACGATG GCACACCGCT GAACTACACT CTGGTCAACA TCCGGGACTG GTGCAAGAAC GACTTCGAGG TTGTCAACCA GCTACGCATG AACACTGACA ACAGTCATCA CCGCTACGAC GTGATGCTGC TGATCAATGG CGTGCCCGTG GTGCAAATTG AGCTGAAGAC GCTGGCCGTT AGCCCACGCC GCGCCATGCA GCAGATCGTT GATTACAAGG CTGATCCCGG CAATGGCTAC GGCAAAACGC TGCTGTGCTT CCTGCAGCTC TTCATTGTCA GCAACCGCAC GGATACCTGG TACTTCGCCA ACAACAACTC ACGTCACTTC AGCTTCAACG CGGATGAGCG TTTTCTGCCG GTTTACCAGT TCGCCAGCGA AGACAACAAG AAGATCACCC AGCTAGATAG CTTTGCCGAG AAGTTCCTGG CCAAGTGCAC CTTGGGCCAG ATGATCAGCC GCTACATGGT ACTGGTGGCT AGCGAGCAGA AGCTGCTCAT GATGCGGCCT TACCAAATCT ATGCCGTCAA AGCCATTGTG GAGTGCATCC ACCAAAATTG CGGCAACGGC TACATCTGGC ATACCACCGG CTCAGGCAAG ACGCTGACTT CCTTCAAAGC GTCCACCCTG CTCAAGGACA ACCCGGACAT CGACAAGTGT CTGTTTGTGG TAGATCGCAA AGACCTGGAT CGGCAGACCC GCGAAGAATT CAACCGCTTC CAGGAAGGCT GCGTCGAAGA GAACACCAAC ACCGAGACCC TGGTGCGCCG CCTGCTGTCG GATGATTATG CCGACAAGGT CATCGTCACC ACCATTCAGA AGCTGGGTCT GGCGCTGGAT GGCGCCAACA AGCGCAACTA CAAAGAACGG CTGGAACCGC TACGCAACCA ACGCATGGTG TTCATCTTTG ATGAGTGCCA CCGCTCGCAA TTTGGTGACA ACCACAAAGC CATCAAGGAG TTCTTCCCCA ACGCCCAGCT TTTTGGCTTC ACCGGCACAC CTATCTTCGA GAAAAACGCC AGCTACCAGC AAATCGAAGG CCAACAGGCC AGCTATCGAA CCACCGACGA TTTGTTCCAG CGCTGCCTGC ACCAGTACAC CATCACCCAC GCCATTGAAG ATCGCAACGT ACTGCGCTTC CACGTGGACT ACTTCAAGCC TGAAGGGAAA AAACCGCCCA AGCCCGGCGA AGGCATCGCC AAAGCCAAGG TCATCGAAAC CATTCTTGCC AAGCATGACA CCTCCACCAA TGGCCGCAAG TTCAATGCCG TGTTGGCAAC CGCCAGCATC AATGACGCCA TCGAATACTT CGAGCTGTTC GCGGAAATTC AGCAGCAAAA AGCCGAGCAA GATCCGGAGT TCCGCCCGCT GAACATTGCC TGCGTATTTT CTCCGCCTGC AGAGGGCAAC AAAGACGTAC AGCAGATTCA GGAAGACCTG CCGCAAGAGC AGGAAGATAA CAAAAAAGAC CCAGAGGCCA AAAAGGCAGC GCTTACACGC ATCATCGCTG ATTACAATAC CCGCTTCGGT ACCAATCACC GCATCAGCGA TTTCGATCTG TATTATCAGG ATGTGCAAAA GCGCATCAAG GATCAGCAGT ATCCCAACAG CGATCTTCCC GCCGCGCAAA AGATCGACAT CACCATCGTG GTGGACATGC TACTCACCGG GTTTGACTCC AAGTACCTCA ACACCCTGTA CGTGGATAAG AACCTCAAGC ATCACGGCTT GATCCAGGCA TTTTCACGCA CCAATCGCGT ACTGAACGAC AGCAAGCCTT ACGGAAATAT TCTCGATTTT CGCCAGCAGC AAAGCGCGGT GGAAGAGGCC ATTGCCCTGT TTTCCGGCGA ACGGATCGAC AACCCCCGGG AAATCTGGCT GGTGGAATCC GCGGCAGAGG TTATTCGCAA ATACGAAGCC GCTGTGGCGG GCATGTCGGA CTTCATGGCA GACAAGAACC TCGTTTGCGA ACCCGAAGCG GTCTACAACC TCAAGGGCGA TACGGCGCGT ATCGAGTTCA TCAACCGTTT CAAGGAAGTG CAACGGCTGA AAACCCAACT TGACCAGTAC ACCGATCTGG CACCCGAACA GAAAACCCGC ATTGACACCA TCCTGCCACC CGACCAGTTG CAGAGCTTCC GCAGTACCTA CCTGGAAACG GCCAAGCAGC TGAAAGAAAT TCAGAGCAAG GAAGGTGATC AAGCTCCGCC GGAAATACAA CAACTGGACT TTGAGTTCGT GCTTTTTGCG TCGGCAGTGA TCGACTACGA CTACATCATG GGCCTGATTT CACGCATGAC GCAGCAAGGC CCCTCGAAAC TGAAGATGAA CCGTGAGCAG TTGATCAGCC TGATCCAGTC CGATGCCAAG TTTATTGACG AGCGCGAGGA TATTGCCGAG TACATCCGCA GCCTGCCAGC CAATGAGGCG CTGGATGAAA AGCAAATCCG GGCGGGCTAC AACCGCTTCA AGGATGAAAA GAAGGCAAGA GAGCTGACTG ACATTGCCAG CCGTCATGGG CTGGAACCCG ATGCCTTGCA AGATTTTGTC GATGAAATCC TGCGTCGTTG CATTTTTGAT GGCGAGCGCC TTTCCGAGCT AATGGCACCG TTGAACCTGG GCTGGAAAGC ACGCACCCAG AAGGAGCTGG CACTGATGGA AGAGCTGGCA CCGCTGCTGC ACAAACTTGC TCAGGGGCGT GAGATTTCCG GGCTCAAGGC CTACGAGGAA TAA
|
Protein sequence | MNITESQIEQ NLINKLADLK YTLRPDIRDR ATLEANFRTK FEALNRVRLT DNEFQRLLDS IITPDVYSTA QTLRNINSFE RDDGTPLNYT LVNIRDWCKN DFEVVNQLRM NTDNSHHRYD VMLLINGVPV VQIELKTLAV SPRRAMQQIV DYKADPGNGY GKTLLCFLQL FIVSNRTDTW YFANNNSRHF SFNADERFLP VYQFASEDNK KITQLDSFAE KFLAKCTLGQ MISRYMVLVA SEQKLLMMRP YQIYAVKAIV ECIHQNCGNG YIWHTTGSGK TLTSFKASTL LKDNPDIDKC LFVVDRKDLD RQTREEFNRF QEGCVEENTN TETLVRRLLS DDYADKVIVT TIQKLGLALD GANKRNYKER LEPLRNQRMV FIFDECHRSQ FGDNHKAIKE FFPNAQLFGF TGTPIFEKNA SYQQIEGQQA SYRTTDDLFQ RCLHQYTITH AIEDRNVLRF HVDYFKPEGK KPPKPGEGIA KAKVIETILA KHDTSTNGRK FNAVLATASI NDAIEYFELF AEIQQQKAEQ DPEFRPLNIA CVFSPPAEGN KDVQQIQEDL PQEQEDNKKD PEAKKAALTR IIADYNTRFG TNHRISDFDL YYQDVQKRIK DQQYPNSDLP AAQKIDITIV VDMLLTGFDS KYLNTLYVDK NLKHHGLIQA FSRTNRVLND SKPYGNILDF RQQQSAVEEA IALFSGERID NPREIWLVES AAEVIRKYEA AVAGMSDFMA DKNLVCEPEA VYNLKGDTAR IEFINRFKEV QRLKTQLDQY TDLAPEQKTR IDTILPPDQL QSFRSTYLET AKQLKEIQSK EGDQAPPEIQ QLDFEFVLFA SAVIDYDYIM GLISRMTQQG PSKLKMNREQ LISLIQSDAK FIDEREDIAE YIRSLPANEA LDEKQIRAGY NRFKDEKKAR ELTDIASRHG LEPDALQDFV DEILRRCIFD GERLSELMAP LNLGWKARTQ KELALMEELA PLLHKLAQGR EISGLKAYEE
|
| |