Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1670 |
Symbol | |
ID | 3831941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1704445 |
End bp | 1707342 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637829595 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_430515 |
Protein GI | 83590506 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000414451 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCCTTAG GCAGTGAACG GGCCGCAGTA CAAAACCCAT TTATCCGCTA TGCTACTGAA GCAGGCTGGC AGTACATAAT CCCGCAGGAT TTAGAGCTCA GGCGGGGTGG TGTAGGTGGT TTAATTGATC GCCAGGTATT CATCAGCCAG CTCCAGAAGT TAAACCCCGG TGTGGTCGAC CACCTGCGTG CCGAAGAGGT ATTAAAGCGG CTGGAGCGGT TACCGCCTAC CATCGAAGGC AACCTTCAGG CCTGGGAATA CCTGAAGGGC TTGCGGACGG TATACGTTGA GACCGAGAAA AGGGAGCGGA ATATCCGCCT GCTGGACCTG GACAACTGGC GGGCCAACAT CTTTCAGGTT ACCGATGAAT TTACCTATAC CAATGGCACA TATACCATTC GTGCTGATAT CGTCTTTTTA ATTAATGGCA TCCCGGTACT GGTGGTAGAA ACGAAATCCA CCAGTGAGAT AGAAGGCATT GCCAAAGCCC TGGACCAGAT GCGCCGCTAC CACCGGGAGG CTCCGGAGCT CATGGCCCTG GAACAGGTCT ATGCCTTGAC CCACCTGGTC CATTTCTATT ATGGGACCAC CTGGAGCTTA AGCCGGAAAA ACCTTTTCAA CTGGCGGGAC GAAAGTACAG GTCAGGATTA CGAAACCATG GTAAAAAGCT TCATCCATCC CCAGCGGATT TTACGGGTCT TAACCGACTT TATCATGTTT ACCCGCAGGG ATGATGAATT AAGTAAAGTT ATCTTACGGC CGCACCAGAT GCGGGCCCTT GGCCGCGTCC TTAACCGCGC CCTGGACCCC ACCAAAAAGC GGGGCCTGGT CTGGCACACC CAGGGGTCGG GTAAAACTTA CACCATGATC ACCATTGCCA AAAAGCTGAT TGAAGACCCT CGTTTTAATA ATCCCACGGT GCTTATGCTG GTTGACCGCA ATGAGCTGGA ACAGCAACTC TTCAATAACC TGGCCGCCTG TGGCCTGGGT ATGGTCGAAG TAGCCCGGAG TAAACGCCAC TTGCAAGAAC TCCTGGCTTC CGACCGCCGG GGCCTTATTG TTAGCATGAT CCATAAATTT GACGACATCC CGGCTAAGCT CAACTGCCGC GACAATATCT TTGTCCTCGT AGATGAGGCC CACCGTACTA CCGGGGGCAA TTTGGGCAAT TACCTCATGG GGGCTCTCCC CAATGCTACC TATATCGGTT TTACCGGTAC ACCCATTGAC CGCACCGCCC ACGGTAAGGG TACCTTCAAA GTTTTTGGCT GCGACGACCA ACCTCAAGGT TACTTAGATA AATACAGCAT TGCGGAATCT ATCGCCGACG GTACGACCGT GCCCCTCCAT TACTCCCTGG CACCTAACGA TCTGCGCGTT GACCGGGAAA CCCTGGAAGA AGAATTTCTA AACCTGGCGG CGGCCGAAGG GGTCAGCGAT ATCGAGGAAC TGAATAAGGT CTTAGAGAAG GCCGTTACCC TAAGGAATAT GCTCAAAAAC CGGGAGCGGG TGGATCGCAT CGCTGCTTAT GTAGCCCGGC ATTACCGGGA ATATATCGAA CCGATGGGTT ATAAGGCTTT CCTGGTGGCT GTGGACCGTG AGGCTTGCAC CCTGTATAAA GAGGCCCTGG ATAAATACCT GCCGCCGGAG TATTCGCAGG TAGTGATCAG CAGGGGCCAC AATGACCAGC CGCATCTGGC GCGTTATTAT CTTTCCGAAG AAGAGGAAAA GCGGGTACGC AGGGCCTTCC GCCAACCTGG GGAATTGCCC AAGATACTCA TCGTCACAGA GAAACTCCTT ACCGGTTATG ATGCCCCGGT GCTTTACTGT ATGTATTTGG ATAAACCCAT GCGGGACCAT GTCCTCTTGC AGGCCATCGC CCGGGTGAAC CGTCCCTATG AAGATGAAAA TGGCCACCAA AAACCGGCTG GCTTTGTCCT CGATTTTGTC GGTATCTTTG CAAACCTGGA AAAGGCCCTG GCCTTTGACT CCAAAGACAT TGAAGGGACT ATCCAGGACC TGGATATCCT TAAAAATCGT TTTCAAGAAA TGATAACTGA AGGTAGAAGG AAATACCTGG CTATTATCGA AAACTATAAG CAAGATAAGG CGGCAGAAAA AGCCCTGGAA CACTTCCGGG AGCCCCAAGC GCGCCAGGAG TATTATGCTT TCTTCAAAGA GCTGGCCAAC CTGTTCGAGA TTATATCGCC AGATCCCTTT TTAAGGGAGT ACCTGGATGA CTACGACCAG TTGGCCCGGC TATATCGGCT CCTGCGTTCT GTTGAGCCGG GGGTCAATCT GGACCATGAG TTAATGCGTA AAACAGCCCG TCTGGTCCAG GAACATACGC TACCTGGTGC TATCCAGGAT GCAGTGACAG TTTACCAAAT AAATGAAGAA GCTCTGGAAA GAATAGCTGA GACCAGGGCG TCATACACCG TTGAGGTATT TAACCTCTTG AAAAGCCTCG AGAATACTGT TGCCGCGAAA AGACAGGAAG CACCTTACCT TTTATCCATT GGCGAGATGG CGGAAAGTAT AGCCCAGGCC TACCAGGAGC GGCAGCTAAG TACCCAGGAA GCATTGCAGC GTTTGCAAAG TTTAATTCAA GAATACCTTG TTTCGGAGCG GGAACGAGCC GAAAAGGATA TGGCTCCTGA AACCTATGCT GTTTACTGGT ATTTGAAGCG GCAAGGGATA AAGGGTGCTG AAAAGGTAGC CCTAAATATG GCCAGGGCTT TTATAAATTA TCCCTACTGG AAGAGCAGCG AAAGCCAGGA GCGGCAAATC GTATGGGAGT TTTATAAAAA TCTCAAGAGA GCAGGACTGA AGGCGGGTAA TAAATTGGGA GACATTGTAG ATCAGATTAT GAAACTGATA AAGGGGGCCA GGCGATGA
|
Protein sequence | MPLGSERAAV QNPFIRYATE AGWQYIIPQD LELRRGGVGG LIDRQVFISQ LQKLNPGVVD HLRAEEVLKR LERLPPTIEG NLQAWEYLKG LRTVYVETEK RERNIRLLDL DNWRANIFQV TDEFTYTNGT YTIRADIVFL INGIPVLVVE TKSTSEIEGI AKALDQMRRY HREAPELMAL EQVYALTHLV HFYYGTTWSL SRKNLFNWRD ESTGQDYETM VKSFIHPQRI LRVLTDFIMF TRRDDELSKV ILRPHQMRAL GRVLNRALDP TKKRGLVWHT QGSGKTYTMI TIAKKLIEDP RFNNPTVLML VDRNELEQQL FNNLAACGLG MVEVARSKRH LQELLASDRR GLIVSMIHKF DDIPAKLNCR DNIFVLVDEA HRTTGGNLGN YLMGALPNAT YIGFTGTPID RTAHGKGTFK VFGCDDQPQG YLDKYSIAES IADGTTVPLH YSLAPNDLRV DRETLEEEFL NLAAAEGVSD IEELNKVLEK AVTLRNMLKN RERVDRIAAY VARHYREYIE PMGYKAFLVA VDREACTLYK EALDKYLPPE YSQVVISRGH NDQPHLARYY LSEEEEKRVR RAFRQPGELP KILIVTEKLL TGYDAPVLYC MYLDKPMRDH VLLQAIARVN RPYEDENGHQ KPAGFVLDFV GIFANLEKAL AFDSKDIEGT IQDLDILKNR FQEMITEGRR KYLAIIENYK QDKAAEKALE HFREPQARQE YYAFFKELAN LFEIISPDPF LREYLDDYDQ LARLYRLLRS VEPGVNLDHE LMRKTARLVQ EHTLPGAIQD AVTVYQINEE ALERIAETRA SYTVEVFNLL KSLENTVAAK RQEAPYLLSI GEMAESIAQA YQERQLSTQE ALQRLQSLIQ EYLVSERERA EKDMAPETYA VYWYLKRQGI KGAEKVALNM ARAFINYPYW KSSESQERQI VWEFYKNLKR AGLKAGNKLG DIVDQIMKLI KGARR
|
| |