Gene Moth_1671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1671 
Symbol 
ID3831942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1707326 
End bp1708654 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content44% 
IMG OID637829596 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_430516 
Protein GI83590507 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.517259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00037426 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGTAAGG AAGTTAATGA GGTGAAGGAA GGATATAAGG AGACGGAGAT TGGGGTGCTG 
CCGGAGGATT GGGAAGTGGT GAGGCTGGGG AAAGTTTTTG AAGAGGTAGA CAGACGTGTT
AATAATGTGA AAAACGCTGC CAGCCTTCCA GTCTTGTCTT TAACAAAAAA CAATGGCATA
ATCCCGCAAA CAGAACGTTT TAAAAAGCGA ATTGCAACAG ACGATTTGAG TAACTACAAG
GTGGTCTACA AGAAAGAGTT AGTCTACAAC CCTTATGTGA TTTGGGAAGG AGCTATTCAT
ATTCTCAATA GGTTAGAAGC TGGTCTTGTC AGCCCAGTAT ACCCAGTACT ATCGGTAAAT
AAGAAGGTAG CTGATGCATA TTTCTTTGAC TTTTGGCTGA GAACACCCTC TGCGATAAAA
GCCTACAGTC GTTATGCCTC TGGGGCCGTA AACCGTAGGC GGGCCATCCG CAAAACGGAT
TTCAAGAACA TAGACGCACC TCTTCCTCCA CTGCATGAGC AGCGCAAGAT TGCCTATGTA
CTTTCAACTA TCCAAAGGGC TATCCAACTT CAAGATAAGG TTATCGCCGC CACCCGGGAA
CTGAAAAAGT CGCTCATGCG CCACCTCTTC ACCTATGGCC CGGTACCTGT TGACCAGATC
GACCGCGTAC CCTTGAAGGA AACTGAAATC GGAATGGTGC CGGAGCATTG GGAAGTAGTC
AGGTTAAGAG AAGTAGCTGA CTTCACAAAA AAGCCCCGAG GCCTTAATTA TTCCGGTAAC
ATTCCTTTTA TTCCAATGGA GCTAATACCT ATTGGAAGAG TCAATATCCA AAAGTATATC
ATTAAGCCTA GTTCCGAGAT TAGTAGTGGC GTTTATTGTG AACAAGGCGA TCTTTTACTA
GCCAAGATTA CGCCATCATT TGAAAACTAT AAGCAAGGTA TTATTTCACA AATTCCAAAG
CCTTTTGCAT TTGCTACAAC GGAAGTCTAT CCAATTAAGG CAAGAAAGGA TTTCTTAGAA
ATCCTATATT TGTTTTACTA TCTTTTGATA CCACAAGTCA GGCAAGATAT AGCGGGTAAA
ATGGAAGGCA CAACAGGAAG ACAGAGAATT TCTAAATCAG TAATCCAGAA TTACTTAATT
CCTATCCCAC CCCTTTCTGA GCAACGCCAA ATTGCCCGCT TTCTTATTAC AGTGGATAAA
AAAATCGAAG CCGAGGAATA TCGCAAATCC ACCCTCCAAT CCCTCTTCCA AACCATGCTT
CACCTGCTTA TGACCGGCAA GGTGCGCGTC AAAGACCTGG AGGTGAAAGA AGATGCCCTT
AGGCAGTGA
 
Protein sequence
MGKEVNEVKE GYKETEIGVL PEDWEVVRLG KVFEEVDRRV NNVKNAASLP VLSLTKNNGI 
IPQTERFKKR IATDDLSNYK VVYKKELVYN PYVIWEGAIH ILNRLEAGLV SPVYPVLSVN
KKVADAYFFD FWLRTPSAIK AYSRYASGAV NRRRAIRKTD FKNIDAPLPP LHEQRKIAYV
LSTIQRAIQL QDKVIAATRE LKKSLMRHLF TYGPVPVDQI DRVPLKETEI GMVPEHWEVV
RLREVADFTK KPRGLNYSGN IPFIPMELIP IGRVNIQKYI IKPSSEISSG VYCEQGDLLL
AKITPSFENY KQGIISQIPK PFAFATTEVY PIKARKDFLE ILYLFYYLLI PQVRQDIAGK
MEGTTGRQRI SKSVIQNYLI PIPPLSEQRQ IARFLITVDK KIEAEEYRKS TLQSLFQTML
HLLMTGKVRV KDLEVKEDAL RQ