Gene Moth_0026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0026 
Symbol 
ID3832119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp27244 
End bp28371 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content51% 
IMG OID637827958 
ProductDNA adenine methylase 
Protein accessionYP_428909 
Protein GI83588900 
COG category[L] Replication, recombination and repair 
COG ID[COG0338] Site-specific DNA methylase 
TIGRFAM ID[TIGR00571] DNA adenine methylase (dam) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00842567 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000104087 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAGAA ATCCAAGGTT AAGCCCCATT ATAAAATGGG CCGGCGGCAA GGAAAAGGAA 
CTTCGTTACA TCCTCCCCAA CCTCCCCAGC GGGGTCCGCA ATTACTATGA ACCCTTTGTC
GGCGGCGGGG CCGTCTTTTT TGCCGTTGAC CGGCCGGCCA TGTATATCAA TGATAAAGCG
CCGGAGCTAA TTACCCTTTA TAACATGATT CGCGAAAATA ACCGGGATTT TTTTCAAACC
CTGGAGGCCC TGGCCTGGAG CTGGGAACGC CTTACCATGG TTGTAAGGGA TAATGGCACC
ATTTTGGAAG ATATTTACAG AAAGTATGCC AGCGAGGTCT ATTCCGGTAC GCAATTGTCC
CGGGCAGTGG CAGAGCTTAT TAAAACGCGG GGGCAGGAGT TCAGGGAGAT TCTGGCACCT
TTTATTAACC CGGAAAACTT CTTCGGGGAA ATAAAAAGAA ACCTTCTCAA CAAGATAACC
AGGATGAAAA AAATTGCCGC CCGGAAGGGC GCCCTGGCCA GCAGGGATAT CCTGGCCAAC
CTGGAAGGCG CCTTTAAAAG TGCTTTCTAC ATGCACTGCC GCTACCTGTA CAACCATACC
GCTACCTTCG GCTTGGGCCC GGGAACCGCC ACCGCCCTCT TTTACTTTAT CCGGGAATAC
TGCTATGCCG CCATGTTTCG TTATAACAGT CGGGGCGAAT TTAACGTCCC TTACAGCGGT
ATTTCTTATA ACGATAAGGA TTTCGCCCGT AAGATAGCCT ACCTCAAATC CCGTGAAGTA
CGCGAACACC TGGGTAAGGC CAGGATTTTT TGCCTGGATT TCGAAGAGTT TCTCCGCCAG
ACTGCCCCCG GGCCGGAGGA TTTTATCTTC CTGGACCCGC CCTATGATAG TGATTTCAGC
GCCTACGCCG GGATGTCTTT CGGACCAGCC GATCAGGAGC GCCTGGCAGC CTATCTCGCT
GGCGCATGCC GGGCAAGGTT TATGCTGGTA ATTAAAAATA CGCCCTTTAT TTACGACCTC
TACCGGGATC ACGGCTTTAG AATCCGGGCC TTTAACAAAA GATACCAGGT TAGTTTTCAG
AACCGAAACG ATAAGTCCGC GAGGCATCTG ATGATTACCA ACTACTGA
 
Protein sequence
MIRNPRLSPI IKWAGGKEKE LRYILPNLPS GVRNYYEPFV GGGAVFFAVD RPAMYINDKA 
PELITLYNMI RENNRDFFQT LEALAWSWER LTMVVRDNGT ILEDIYRKYA SEVYSGTQLS
RAVAELIKTR GQEFREILAP FINPENFFGE IKRNLLNKIT RMKKIAARKG ALASRDILAN
LEGAFKSAFY MHCRYLYNHT ATFGLGPGTA TALFYFIREY CYAAMFRYNS RGEFNVPYSG
ISYNDKDFAR KIAYLKSREV REHLGKARIF CLDFEEFLRQ TAPGPEDFIF LDPPYDSDFS
AYAGMSFGPA DQERLAAYLA GACRARFMLV IKNTPFIYDL YRDHGFRIRA FNKRYQVSFQ
NRNDKSARHL MITNY