Gene Moth_0246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0246 
Symbol 
ID3833209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp249030 
End bp250163 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content62% 
IMG OID637828182 
Productpeptidase M23B 
Protein accessionYP_429124 
Protein GI83589115 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTCGCA AAGCCACAGC GCTGCTCCTG ATGGCTGCCC TTGCCCTGGG CACCGTGCCG 
GCCCACGGCG CCAGTGTAGA TGACCTGCAG CGACAGCAGC AGCAACTGCA GCAGAACATC
CAGGAACAGC AGAAACTCCT GCAGCAGAAA AACGACGAAG GAGAGGCCCT GCTGCAGCAA
CTACAGCAGA TCGAGGAGGA TATCCGGCAG AAACAGGCCC AGATAGCCAG CCTGGACCAG
CAGCTGGCAG CGGCCCAGGG ACGGGTCCAG CAGGTCGCTG CCGAACTGCA GAAGGCTGAA
GCCGCCCAGG AGACGCGAAT GAGCATTCTC AGGTCCAGGC TCAAGGACAT CTACCAGGTG
GGGCGGGTAA ACTACCTGGA GGTGCTCTTG CAGTCCACCA GCCTGGAGGA TTTCCTGGTG
CGCCTGGAAC TCCTGACCAA GATAGCCCGG GGCGACATCA ACCTGATCGA CGAGATCAAG
GCGGAAAAGG CGAAGATCGC CGCCCAGAAG GCCGAGCTGG AGGCCGAGCG GGATCACATC
GCCCAGCTCC GGCGCCAGGC AGACAACGAG AGGGTGCAGC TCGCTTCCCG GCAGGAGAAC
CAGCGCCAGC TCCTGGCCCA GGTGGAGCAG GAGAAAAAAC GGGTGGCCGC GGCCCTGGAC
GAGATGGAAG CCACGGCCCG GCAGATAGCC GCCAAGATCC GGGCCGAGCA GGCTAAAAGC
AACCGCAAGC TTTCGCCCAG TGGGACGAAG GGCATGCTCT GGCCGCTGCC GGGGTACACC
CAGATCTCCT CACCCTTCGG GTGGCGCATC CATCCCCTTC TGAAAACCAA CCGCTTCCAC
GACGGCGTCG ACCTGCCGGC ACCTGCGGGA ACAGAGATAA TTGCTCCTCT GGATGGGCAG
GTTATTTCCA CCGGCTATCT GGGGGGATAC GGCAACCATA TCGTCATCGA CCACGGCGGC
GGGCTTTCCA CCATGTACGC TCACCTGTCG GCCATCCTGG TCCAGAATGG CCAGGAGGTT
AAAAAGGGCC AGGTGATCGG CCGCGTGGGA TCTACGGGTT GGAGTACGGG CCCGCACCTG
CACTTCATGG TCCTGCTTCA GGGCGAGCCA ACTAATCCCA TGAATTATTA CTAA
 
Protein sequence
MRRKATALLL MAALALGTVP AHGASVDDLQ RQQQQLQQNI QEQQKLLQQK NDEGEALLQQ 
LQQIEEDIRQ KQAQIASLDQ QLAAAQGRVQ QVAAELQKAE AAQETRMSIL RSRLKDIYQV
GRVNYLEVLL QSTSLEDFLV RLELLTKIAR GDINLIDEIK AEKAKIAAQK AELEAERDHI
AQLRRQADNE RVQLASRQEN QRQLLAQVEQ EKKRVAAALD EMEATARQIA AKIRAEQAKS
NRKLSPSGTK GMLWPLPGYT QISSPFGWRI HPLLKTNRFH DGVDLPAPAG TEIIAPLDGQ
VISTGYLGGY GNHIVIDHGG GLSTMYAHLS AILVQNGQEV KKGQVIGRVG STGWSTGPHL
HFMVLLQGEP TNPMNYY