Gene Moth_2422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2422 
Symbol 
ID3832173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2544351 
End bp2545778 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content55% 
IMG OID637830341 
Productpeptidase M23B 
Protein accessionYP_431247 
Protein GI83591238 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000638116 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCGC CAGATAAGCT ACCGCAGCTA GCGGCCAGCG TAGGGCGCTT CCTAAAAAAG 
CTGCCCTTAC GGTGGGCAGG AATAAGGGAT AATAAAGGAA AAAGAGTATT AGTAATAACT
GGTATTTTAG CCGGCGGCCT GCTCTTAGCA GCCTGGCACC AGCTTACAAC CCCGAATGCC
CTGGCTGTTT TCATCAACGG CCAGCAGGTG GCAATAGTAG CCACCCGGGA ACAGTTCAAC
CGGGCCTTAC AGGAGATCCT AAAAGAACGA GGGGCTGGCA GTTACCAGGG TGTGCGCTAT
ACCGATCAGG TAGATTTTAA AGCAGTCCGG GCCAATCCGC AGGAAGTTAT AGCGGAGGAA
CAATTAAAAA GCCTCCTGGC CGAAAGATTG CACCTGGTGG CGGCGGCAAC GGTCATTACC
ATTGACGGGC AGCCGCGGCT GGTTTTGAAA GACGATGCTA CCGCTGACGC TGTTCTGGCC
GCGTTTAAAC AGGCCTTTGA ACCACCGGCG GCTATAGGGC AGGTCCAGGA GGTCAAATTT
TTGGAACAAG TGGCTTACGA GCACCGCCAG GCCAGTCCGG AAGAGATTTT AAGCCCGGAA
GCGGCCCTTG CCAAGTTGAA GGGAACAGCA GCCAGCAGTA GCACCTATAC TGTTAAAGAA
GGGGATTCCC TCTGGTCCAT TGCCAGGGAA CATAACCTGC TGGTTGACGA CATTAAAAGG
GCCAACCCCG AGATCCAGGG AGAACGCCTG GATATCGGGC AGCAATTGAA ACTGACAACA
GTACAACCAT TGCTTCAGGT GATGGTTGTT TATAATCAGG ATATCAAGGA ACCGGTACCC
TTTGAAACCC GGGTAGAAAA AACGGCTGAC CTTTTACGAG GCCAGGAAAA GATAATTCAG
GAGGGCACAG AGGGCGAACA ACTGGTTACT TACCAAGTAG TGACCAGAAA TGGTGTGGTC
GTAGACAAAA AGATACTTCG GCAGCAGGTC CTTACGGAAC CGGTTGCCCG GGTGGTACAG
CAGGGTACGG GGGTAACAGC CCTGGCTTCC CGGAGCGGCG GTTCCGGGCT CCTGGCCTGG
CCCATCCGCG GGTATATCAC TTCCCCCTAT GGCTACCGGG GTAGCGAGTT TCACTCAGGC
CTGGATATTG CGGGCAGTAT CGGCGAACCG GTGGGAGCGG CTGCAGGCGG AGTAGTTGTT
AGCACCGGTT ACGACGGTGG CTACGGTCGG ATGGTAGTAA TCGACCATGG TGGCCTGGTT
ACCAGGTATG CCCATTTATC TGGTTACAAT GTAAGGCCGG GCCAGCGGGT ATCTCAAGGC
CAGATCATCG GTTATGTAGG GGTTAGCGGC CGTACCACAG GCCCCCACCT GCACTTTGAG
GTCCTGGTCG GAGGCAGCTT CCGCAACCCG GCCAGTTATC TGAGGTGA
 
Protein sequence
MPPPDKLPQL AASVGRFLKK LPLRWAGIRD NKGKRVLVIT GILAGGLLLA AWHQLTTPNA 
LAVFINGQQV AIVATREQFN RALQEILKER GAGSYQGVRY TDQVDFKAVR ANPQEVIAEE
QLKSLLAERL HLVAAATVIT IDGQPRLVLK DDATADAVLA AFKQAFEPPA AIGQVQEVKF
LEQVAYEHRQ ASPEEILSPE AALAKLKGTA ASSSTYTVKE GDSLWSIARE HNLLVDDIKR
ANPEIQGERL DIGQQLKLTT VQPLLQVMVV YNQDIKEPVP FETRVEKTAD LLRGQEKIIQ
EGTEGEQLVT YQVVTRNGVV VDKKILRQQV LTEPVARVVQ QGTGVTALAS RSGGSGLLAW
PIRGYITSPY GYRGSEFHSG LDIAGSIGEP VGAAAGGVVV STGYDGGYGR MVVIDHGGLV
TRYAHLSGYN VRPGQRVSQG QIIGYVGVSG RTTGPHLHFE VLVGGSFRNP ASYLR