Gene Moth_1213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1213 
Symbol 
ID3832848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1252186 
End bp1253598 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content55% 
IMG OID637829148 
Productsite-specific DNA-methyltransferase (adenine-specific) 
Protein accessionYP_430070 
Protein GI83590061 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTAAAC GTCTGACAAC CCGCGGTTCG GCCTGCCTGG TTTGGGATAC AAAAACGGCA 
GGAGAAATCC ATCAGGGTAA ATATGCCTTA CAACCAAAAG AACTGGTTAT ACCAACGGCA
GGAAATGGCC TGGACCGGCA GCAATCCTTA TTATCAGAGT CAGAAGGCAA CCTCCTAATC
CAGGGCGACA ACCTCCAAGC CATGCAGGGG CTCCTGGACA GGGGATATGA AGGTAAAATC
CATCTTATTT ATATCGATCC ACCTTTTTTC AGCCAGGACA ATTACAGCCA CCGGGTTCCC
CTCGCCGGGA CGGCCGCCGG CCAGGAACGC CGGGTCATAG AACGGGCGGC CTACCGGGAC
ACCTGGAGGG GGGGAATTGA TGCCTATCTG GATATGCTGT ACCCCCGGCT CCAGCTGATG
AAAAGGCTAC TGGCGTCGAA TGGTAGCATT TACGTTCATC TGGATGCCAG TATCAGCCAC
TATAACTGGG TTCGTAATCA CGATACCCTG CTTTTCTATG TCAGGGACCC GGCCAGGTTC
ACCTTTAATA AAGAGTACCT TCCCTACCCG CCCGGCTACC GGCGACGCGG CAGCCGGGAA
ACAAAGGGGA AGGGTTATCC TCTGGATGAC GTCTGGAATG CCAATCCATT CGAATTTGAA
TTAAAAGGGG AAGAAAGCCT GGATTCCATC CAGATCAAAA GCTTTTCCCG GGAGAAGACT
GGCTTTGCCA CCCAAAAAAA CCTTAGCCTC CTCCGGCGGA TTATCAAGGC TTCTTCCAAC
CCCGGAGACC TGGTGGCTGA CTTTTTTTGC GGTTCCGGTA CCACTCTGGT GGCCGCCGAA
GCCCTGGACC GGAAATGGCT GGGTTGTGAA ATAGGGTGGA CCGGTCTTCA GGTAGCCCGC
AAGCGTCTGG TAGCGGCAGG AGCCGGCCCC TTCTTTATTG AGGTAGTGCA GCCGGCAAAC
CAGCCAGCCG CCGGTTTAGC GCCTGTCGTT CTCTATCAAA ACGACTCGGG AACCGTTGCC
CAGAGACTGC CCCGTTTGCT GGCGAAGGCG ACTAGAACCC CGGTGGATAA CGGCCTGGAA
GAGGTGACCA TCGACCTGAA AGGGTATCAC CTGCCGGCAA TTCCGGCCAA CCGGTTATCC
CGTAAAGGTA GGAGCGACCT GAAGACGGCC ACAAGAGCCA GTCGGGAGAA TTTCGCCCTT
ATTATCGATT ATTGGGCCGT GGACTGGGAT TACGACGGGC GTATTTTTAA GAGCAGTTGG
CAGGCGTGGC GGGGTTACAG CAAAAACGAA CCGCCAGTAC CGGTCCAGGC CCGGGCCATC
CTGGCAGCAA AAAAAGAAAG GACCATTGCC GTCCAGGTAG TTGATATTCT GGGAAATGAG
ATTCTAACTG TCATAGAAAC GGGTAAGCCC TGA
 
Protein sequence
MAKRLTTRGS ACLVWDTKTA GEIHQGKYAL QPKELVIPTA GNGLDRQQSL LSESEGNLLI 
QGDNLQAMQG LLDRGYEGKI HLIYIDPPFF SQDNYSHRVP LAGTAAGQER RVIERAAYRD
TWRGGIDAYL DMLYPRLQLM KRLLASNGSI YVHLDASISH YNWVRNHDTL LFYVRDPARF
TFNKEYLPYP PGYRRRGSRE TKGKGYPLDD VWNANPFEFE LKGEESLDSI QIKSFSREKT
GFATQKNLSL LRRIIKASSN PGDLVADFFC GSGTTLVAAE ALDRKWLGCE IGWTGLQVAR
KRLVAAGAGP FFIEVVQPAN QPAAGLAPVV LYQNDSGTVA QRLPRLLAKA TRTPVDNGLE
EVTIDLKGYH LPAIPANRLS RKGRSDLKTA TRASRENFAL IIDYWAVDWD YDGRIFKSSW
QAWRGYSKNE PPVPVQARAI LAAKKERTIA VQVVDILGNE ILTVIETGKP