Gene Moth_1670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1670 
Symbol 
ID3831941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1704445 
End bp1707342 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content51% 
IMG OID637829595 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_430515 
Protein GI83590506 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000414451 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCCTTAG GCAGTGAACG GGCCGCAGTA CAAAACCCAT TTATCCGCTA TGCTACTGAA 
GCAGGCTGGC AGTACATAAT CCCGCAGGAT TTAGAGCTCA GGCGGGGTGG TGTAGGTGGT
TTAATTGATC GCCAGGTATT CATCAGCCAG CTCCAGAAGT TAAACCCCGG TGTGGTCGAC
CACCTGCGTG CCGAAGAGGT ATTAAAGCGG CTGGAGCGGT TACCGCCTAC CATCGAAGGC
AACCTTCAGG CCTGGGAATA CCTGAAGGGC TTGCGGACGG TATACGTTGA GACCGAGAAA
AGGGAGCGGA ATATCCGCCT GCTGGACCTG GACAACTGGC GGGCCAACAT CTTTCAGGTT
ACCGATGAAT TTACCTATAC CAATGGCACA TATACCATTC GTGCTGATAT CGTCTTTTTA
ATTAATGGCA TCCCGGTACT GGTGGTAGAA ACGAAATCCA CCAGTGAGAT AGAAGGCATT
GCCAAAGCCC TGGACCAGAT GCGCCGCTAC CACCGGGAGG CTCCGGAGCT CATGGCCCTG
GAACAGGTCT ATGCCTTGAC CCACCTGGTC CATTTCTATT ATGGGACCAC CTGGAGCTTA
AGCCGGAAAA ACCTTTTCAA CTGGCGGGAC GAAAGTACAG GTCAGGATTA CGAAACCATG
GTAAAAAGCT TCATCCATCC CCAGCGGATT TTACGGGTCT TAACCGACTT TATCATGTTT
ACCCGCAGGG ATGATGAATT AAGTAAAGTT ATCTTACGGC CGCACCAGAT GCGGGCCCTT
GGCCGCGTCC TTAACCGCGC CCTGGACCCC ACCAAAAAGC GGGGCCTGGT CTGGCACACC
CAGGGGTCGG GTAAAACTTA CACCATGATC ACCATTGCCA AAAAGCTGAT TGAAGACCCT
CGTTTTAATA ATCCCACGGT GCTTATGCTG GTTGACCGCA ATGAGCTGGA ACAGCAACTC
TTCAATAACC TGGCCGCCTG TGGCCTGGGT ATGGTCGAAG TAGCCCGGAG TAAACGCCAC
TTGCAAGAAC TCCTGGCTTC CGACCGCCGG GGCCTTATTG TTAGCATGAT CCATAAATTT
GACGACATCC CGGCTAAGCT CAACTGCCGC GACAATATCT TTGTCCTCGT AGATGAGGCC
CACCGTACTA CCGGGGGCAA TTTGGGCAAT TACCTCATGG GGGCTCTCCC CAATGCTACC
TATATCGGTT TTACCGGTAC ACCCATTGAC CGCACCGCCC ACGGTAAGGG TACCTTCAAA
GTTTTTGGCT GCGACGACCA ACCTCAAGGT TACTTAGATA AATACAGCAT TGCGGAATCT
ATCGCCGACG GTACGACCGT GCCCCTCCAT TACTCCCTGG CACCTAACGA TCTGCGCGTT
GACCGGGAAA CCCTGGAAGA AGAATTTCTA AACCTGGCGG CGGCCGAAGG GGTCAGCGAT
ATCGAGGAAC TGAATAAGGT CTTAGAGAAG GCCGTTACCC TAAGGAATAT GCTCAAAAAC
CGGGAGCGGG TGGATCGCAT CGCTGCTTAT GTAGCCCGGC ATTACCGGGA ATATATCGAA
CCGATGGGTT ATAAGGCTTT CCTGGTGGCT GTGGACCGTG AGGCTTGCAC CCTGTATAAA
GAGGCCCTGG ATAAATACCT GCCGCCGGAG TATTCGCAGG TAGTGATCAG CAGGGGCCAC
AATGACCAGC CGCATCTGGC GCGTTATTAT CTTTCCGAAG AAGAGGAAAA GCGGGTACGC
AGGGCCTTCC GCCAACCTGG GGAATTGCCC AAGATACTCA TCGTCACAGA GAAACTCCTT
ACCGGTTATG ATGCCCCGGT GCTTTACTGT ATGTATTTGG ATAAACCCAT GCGGGACCAT
GTCCTCTTGC AGGCCATCGC CCGGGTGAAC CGTCCCTATG AAGATGAAAA TGGCCACCAA
AAACCGGCTG GCTTTGTCCT CGATTTTGTC GGTATCTTTG CAAACCTGGA AAAGGCCCTG
GCCTTTGACT CCAAAGACAT TGAAGGGACT ATCCAGGACC TGGATATCCT TAAAAATCGT
TTTCAAGAAA TGATAACTGA AGGTAGAAGG AAATACCTGG CTATTATCGA AAACTATAAG
CAAGATAAGG CGGCAGAAAA AGCCCTGGAA CACTTCCGGG AGCCCCAAGC GCGCCAGGAG
TATTATGCTT TCTTCAAAGA GCTGGCCAAC CTGTTCGAGA TTATATCGCC AGATCCCTTT
TTAAGGGAGT ACCTGGATGA CTACGACCAG TTGGCCCGGC TATATCGGCT CCTGCGTTCT
GTTGAGCCGG GGGTCAATCT GGACCATGAG TTAATGCGTA AAACAGCCCG TCTGGTCCAG
GAACATACGC TACCTGGTGC TATCCAGGAT GCAGTGACAG TTTACCAAAT AAATGAAGAA
GCTCTGGAAA GAATAGCTGA GACCAGGGCG TCATACACCG TTGAGGTATT TAACCTCTTG
AAAAGCCTCG AGAATACTGT TGCCGCGAAA AGACAGGAAG CACCTTACCT TTTATCCATT
GGCGAGATGG CGGAAAGTAT AGCCCAGGCC TACCAGGAGC GGCAGCTAAG TACCCAGGAA
GCATTGCAGC GTTTGCAAAG TTTAATTCAA GAATACCTTG TTTCGGAGCG GGAACGAGCC
GAAAAGGATA TGGCTCCTGA AACCTATGCT GTTTACTGGT ATTTGAAGCG GCAAGGGATA
AAGGGTGCTG AAAAGGTAGC CCTAAATATG GCCAGGGCTT TTATAAATTA TCCCTACTGG
AAGAGCAGCG AAAGCCAGGA GCGGCAAATC GTATGGGAGT TTTATAAAAA TCTCAAGAGA
GCAGGACTGA AGGCGGGTAA TAAATTGGGA GACATTGTAG ATCAGATTAT GAAACTGATA
AAGGGGGCCA GGCGATGA
 
Protein sequence
MPLGSERAAV QNPFIRYATE AGWQYIIPQD LELRRGGVGG LIDRQVFISQ LQKLNPGVVD 
HLRAEEVLKR LERLPPTIEG NLQAWEYLKG LRTVYVETEK RERNIRLLDL DNWRANIFQV
TDEFTYTNGT YTIRADIVFL INGIPVLVVE TKSTSEIEGI AKALDQMRRY HREAPELMAL
EQVYALTHLV HFYYGTTWSL SRKNLFNWRD ESTGQDYETM VKSFIHPQRI LRVLTDFIMF
TRRDDELSKV ILRPHQMRAL GRVLNRALDP TKKRGLVWHT QGSGKTYTMI TIAKKLIEDP
RFNNPTVLML VDRNELEQQL FNNLAACGLG MVEVARSKRH LQELLASDRR GLIVSMIHKF
DDIPAKLNCR DNIFVLVDEA HRTTGGNLGN YLMGALPNAT YIGFTGTPID RTAHGKGTFK
VFGCDDQPQG YLDKYSIAES IADGTTVPLH YSLAPNDLRV DRETLEEEFL NLAAAEGVSD
IEELNKVLEK AVTLRNMLKN RERVDRIAAY VARHYREYIE PMGYKAFLVA VDREACTLYK
EALDKYLPPE YSQVVISRGH NDQPHLARYY LSEEEEKRVR RAFRQPGELP KILIVTEKLL
TGYDAPVLYC MYLDKPMRDH VLLQAIARVN RPYEDENGHQ KPAGFVLDFV GIFANLEKAL
AFDSKDIEGT IQDLDILKNR FQEMITEGRR KYLAIIENYK QDKAAEKALE HFREPQARQE
YYAFFKELAN LFEIISPDPF LREYLDDYDQ LARLYRLLRS VEPGVNLDHE LMRKTARLVQ
EHTLPGAIQD AVTVYQINEE ALERIAETRA SYTVEVFNLL KSLENTVAAK RQEAPYLLSI
GEMAESIAQA YQERQLSTQE ALQRLQSLIQ EYLVSERERA EKDMAPETYA VYWYLKRQGI
KGAEKVALNM ARAFINYPYW KSSESQERQI VWEFYKNLKR AGLKAGNKLG DIVDQIMKLI
KGARR