Gene Athe_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2549 
Symbol 
ID7409419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2665219 
End bp2667492 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content36% 
IMG OID643716913 
Product5-methyltetrahydropteroyltriglutamate-- homocysteine S-methyltransferase 
Protein accessionYP_002574390 
Protein GI222530508 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0620] Methionine synthase II (cobalamin-independent) 
TIGRFAM ID[TIGR01371] 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCGG TTGTTGGTTT TCCAAGAATA GGACAGAACA GAGAGCTAAA AAGATGGGTA 
GAAAGCTATC TTGACAAAAA GATTTCAAAA GAAGAGCTCA TCCAAAATTC AAAAGAACTG
AAAAAAACTC ACTGGCAAAT TCAAAAAGAG TATGGTGTTG ACCTGATACC ATCAAATGAT
TTTTCGTTCT ATGACACTTT TTTAGACCAT GCAATGTTAG TAGGCTCAAT ACCCAAGGAA
TACAAGGCGG TTTTCTCAGA TGATCTCGAG CTTTACTTCG CGCTGGCAAA GGGATATCAA
GACCAAAACA TTGATCTTAA AGCTTTGCCA ATGAAAAAGT GGTTCTTTAC AAACTACCAT
TATCTTGTGC CTGAAATCAC TGAAAACACC AAATTTGAAC TTTCATCAAC AAAACCTTTT
GATGAATTTG TCGAAGCACT TTCGATAGGA ATCAAGACAA AACCAGTAAT AATCGGTGCT
CTTACTTTTC TGAAGCTTTC TAAAAAATCA AATGTGGATA TATACGACAA ATCTTTTTGG
GAAAAGCTAC TTGATGTATA TATTCAAATA CTAAGAAGGT TTGAAGAGTT GGGTTGCGAG
TTTGTTCAGA TAGATGAGCC AATACTTGTC ACAGACTTAA GTACAAAAGA CGTAGAGCTT
TTTGAAAATT TTTACCGCAA TCTTCTTTCT CACAAAGGAA AGCTAAAAGT ACTTCTTCAG
ACTTATTTTG GAGACGTCAG AGACTGCTTT GAAAAGATAA TTTCTCTTGA CTTTGACGCA
ATCGGACTTG ACTTTATTGA TGGAAAATTC AATTTAGAGC TCATTAAAAA ATTTGGTTTC
CCACGGGAAA AGCTCCTTTT TGCCGGAGTT GTAAATGGCA GAAATGTGTT TAAAAATAAC
TACAAAGATA CACTTGAGCT TTTAAATACT CTCTCATCCT TTGTTGACAA GAAAAATATT
GTAATTTCAA CATCATGTTC TTTACTCTTT GTGCCATACT CTTTAAAATT TGAAATACAG
CTTGACAGCA ATAAAAAGAA ATTTTTAGCG TTTGCTGAGG AGAAGCTAAA AGAGCTGTCT
GAGCTAAAGC TTTTATTTTC TCAAGGAAAC TTTACTGCAA ACAGCATTTA TCTTCAAAAC
GCTCAGCTTT TCGAAGAGCT GAATAAAAAC AAACTATCAG ATGTCAGCAC AGCTGTAAAT
AATCTTACAG ACGATGACTT TGAAAGAAAA CCGTGTTTTG AAGAGAGAAT CAAGCTTCAA
AAAGAGGTTT TGAAGCTCCC ACAGCTTCCG ACAACAACAA TTGGATCTTT CCCGCAAACC
ACAGATGTAA GATCAGCGCG AAGCAAACTC AAAAAAGGTG AGATAACATT TGAAGAATAT
GAAAGCTTTA TAAAATCCAA AATTGAAAGA GTAATAAAGC TTCAGGAAGA AATCGGGCTT
GATGTGCTCG TCCATGGCGA ATATGAGAGA AATGACATGG TAGAGTTTTT TGGTGAAAAT
TTAGAAGGAT TTTTAATTAC TCAAAACGGC TGGGTTCAGT CATACGGTAC AAGATGTGTA
AAACCGCCAA TTATATTCTC AGACATAAAG AGAAAAAGAC CAATTACAGT TGAGTATATA
AAATATGCAC AGAGCCTAAC AAAAAAACCT GTTAAGGGAA TTTTGACAGG GCCTGTGACA
ATCCTCAACT GGTCGTTTGT GCGTGAAGAC ATACCATTAA AAAATGTTGC TTTTCAGCTT
GCTCTTGCAA TAAAAGAAGA GGTTCTCGAA CTTGAAAAAG AAGGTGTAAA GATTATCCAG
ATTGACGAGG CAGCTCTGAT TGAAAAGCTT CCGCTCAGGC GCTGCCAGCA CAGTGAGTAT
CTGTCTTGGG CAATAAAGGC ATTTAGGCTC TCCTGTTCAA AGGTAAAACC AGAAACACAA
ATCCACACTC ATATGTGTTA CAGCAACTTT GATGAACTTT TAGACGAAAT TGCAAAGATG
GATGTCGATG TTATAACTTT TGAAGCTGCA AAATCTGATT TTACTTTGCT TGATAGCATC
AACAAAAGCC AATTGAAAGC TGAAGTTGGT CCAGGAGTTT TTGACGTTCA CTCACCAAGG
ATTGTCTCAA AAGAAGAGAT GAAAAAGCTT ATTTTAAAGA TGATAGAAAA GGTTGGCAAA
GACAGACTTT GGATAAACCC TGACTGCGGT CTTAAGACCA GAAAGGAAGA AGAAATTTTG
CCAACATTAC AAAATATGGT TCTGGCAGCG TGGGAAGTTA GGAACAACTT GTAA
 
Protein sequence
MISVVGFPRI GQNRELKRWV ESYLDKKISK EELIQNSKEL KKTHWQIQKE YGVDLIPSND 
FSFYDTFLDH AMLVGSIPKE YKAVFSDDLE LYFALAKGYQ DQNIDLKALP MKKWFFTNYH
YLVPEITENT KFELSSTKPF DEFVEALSIG IKTKPVIIGA LTFLKLSKKS NVDIYDKSFW
EKLLDVYIQI LRRFEELGCE FVQIDEPILV TDLSTKDVEL FENFYRNLLS HKGKLKVLLQ
TYFGDVRDCF EKIISLDFDA IGLDFIDGKF NLELIKKFGF PREKLLFAGV VNGRNVFKNN
YKDTLELLNT LSSFVDKKNI VISTSCSLLF VPYSLKFEIQ LDSNKKKFLA FAEEKLKELS
ELKLLFSQGN FTANSIYLQN AQLFEELNKN KLSDVSTAVN NLTDDDFERK PCFEERIKLQ
KEVLKLPQLP TTTIGSFPQT TDVRSARSKL KKGEITFEEY ESFIKSKIER VIKLQEEIGL
DVLVHGEYER NDMVEFFGEN LEGFLITQNG WVQSYGTRCV KPPIIFSDIK RKRPITVEYI
KYAQSLTKKP VKGILTGPVT ILNWSFVRED IPLKNVAFQL ALAIKEEVLE LEKEGVKIIQ
IDEAALIEKL PLRRCQHSEY LSWAIKAFRL SCSKVKPETQ IHTHMCYSNF DELLDEIAKM
DVDVITFEAA KSDFTLLDSI NKSQLKAEVG PGVFDVHSPR IVSKEEMKKL ILKMIEKVGK
DRLWINPDCG LKTRKEEEIL PTLQNMVLAA WEVRNNL