Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1671 |
Symbol | |
ID | 3831942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1707326 |
End bp | 1708654 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637829596 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_430516 |
Protein GI | 83590507 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.517259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00037426 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGTAAGG AAGTTAATGA GGTGAAGGAA GGATATAAGG AGACGGAGAT TGGGGTGCTG CCGGAGGATT GGGAAGTGGT GAGGCTGGGG AAAGTTTTTG AAGAGGTAGA CAGACGTGTT AATAATGTGA AAAACGCTGC CAGCCTTCCA GTCTTGTCTT TAACAAAAAA CAATGGCATA ATCCCGCAAA CAGAACGTTT TAAAAAGCGA ATTGCAACAG ACGATTTGAG TAACTACAAG GTGGTCTACA AGAAAGAGTT AGTCTACAAC CCTTATGTGA TTTGGGAAGG AGCTATTCAT ATTCTCAATA GGTTAGAAGC TGGTCTTGTC AGCCCAGTAT ACCCAGTACT ATCGGTAAAT AAGAAGGTAG CTGATGCATA TTTCTTTGAC TTTTGGCTGA GAACACCCTC TGCGATAAAA GCCTACAGTC GTTATGCCTC TGGGGCCGTA AACCGTAGGC GGGCCATCCG CAAAACGGAT TTCAAGAACA TAGACGCACC TCTTCCTCCA CTGCATGAGC AGCGCAAGAT TGCCTATGTA CTTTCAACTA TCCAAAGGGC TATCCAACTT CAAGATAAGG TTATCGCCGC CACCCGGGAA CTGAAAAAGT CGCTCATGCG CCACCTCTTC ACCTATGGCC CGGTACCTGT TGACCAGATC GACCGCGTAC CCTTGAAGGA AACTGAAATC GGAATGGTGC CGGAGCATTG GGAAGTAGTC AGGTTAAGAG AAGTAGCTGA CTTCACAAAA AAGCCCCGAG GCCTTAATTA TTCCGGTAAC ATTCCTTTTA TTCCAATGGA GCTAATACCT ATTGGAAGAG TCAATATCCA AAAGTATATC ATTAAGCCTA GTTCCGAGAT TAGTAGTGGC GTTTATTGTG AACAAGGCGA TCTTTTACTA GCCAAGATTA CGCCATCATT TGAAAACTAT AAGCAAGGTA TTATTTCACA AATTCCAAAG CCTTTTGCAT TTGCTACAAC GGAAGTCTAT CCAATTAAGG CAAGAAAGGA TTTCTTAGAA ATCCTATATT TGTTTTACTA TCTTTTGATA CCACAAGTCA GGCAAGATAT AGCGGGTAAA ATGGAAGGCA CAACAGGAAG ACAGAGAATT TCTAAATCAG TAATCCAGAA TTACTTAATT CCTATCCCAC CCCTTTCTGA GCAACGCCAA ATTGCCCGCT TTCTTATTAC AGTGGATAAA AAAATCGAAG CCGAGGAATA TCGCAAATCC ACCCTCCAAT CCCTCTTCCA AACCATGCTT CACCTGCTTA TGACCGGCAA GGTGCGCGTC AAAGACCTGG AGGTGAAAGA AGATGCCCTT AGGCAGTGA
|
Protein sequence | MGKEVNEVKE GYKETEIGVL PEDWEVVRLG KVFEEVDRRV NNVKNAASLP VLSLTKNNGI IPQTERFKKR IATDDLSNYK VVYKKELVYN PYVIWEGAIH ILNRLEAGLV SPVYPVLSVN KKVADAYFFD FWLRTPSAIK AYSRYASGAV NRRRAIRKTD FKNIDAPLPP LHEQRKIAYV LSTIQRAIQL QDKVIAATRE LKKSLMRHLF TYGPVPVDQI DRVPLKETEI GMVPEHWEVV RLREVADFTK KPRGLNYSGN IPFIPMELIP IGRVNIQKYI IKPSSEISSG VYCEQGDLLL AKITPSFENY KQGIISQIPK PFAFATTEVY PIKARKDFLE ILYLFYYLLI PQVRQDIAGK MEGTTGRQRI SKSVIQNYLI PIPPLSEQRQ IARFLITVDK KIEAEEYRKS TLQSLFQTML HLLMTGKVRV KDLEVKEDAL RQ
|
| |