Gene Moth_2292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2292 
Symbol 
ID3831324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2405075 
End bp2406262 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content62% 
IMG OID637830212 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_431122 
Protein GI83591113 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.972059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGGCG TAAAAGAAAG GATTAGCCGG GCTATAGAGG AGATCAAGGA CCAGATTATC 
CAGGTGGCGG AAGCCATCTT TGATCACCCG GAGACGGGGA ACGAGGAGTA TTTTGCCGCC
GATCTCCTGA CCGGGATCCT GGCGGAAAAA GGTTTTAAAA TTACCAGGCC CCTCTGCCAG
TTACCGACGG CTTTTCGGGC CGAACTGGCC ACCGGGACCC CCGGGCCCCG GATAGGCCTG
CTGGCCGAGT ATGATGCCCT GCCGGAGCTG GGCCACGCCT GCGGCCATAA CCTCATCGCA
GCCGGCAGCC TGGGCGCCGC TTTAGGCCTG GCGGCCGCTG TCCGGGAATT GCGCGGGACT
ATTGTGGTCC TGGGAACCCC GGCCGAGGAG AGCCACGGCG CCAAGGTATT ACTTGCCAGG
GAGGGGGCAT TGGACGACCT GGATGTGGCC ATGATGTTTC ACCCCGGGGA CACCAATGCC
GTCGAGGTGA CTTCCCAAGC CCTGGAGGCC CTGGAGTTTA TTTTTGAAGG CCGGGCCGCC
CATGCCGCCT CCAGCCCGGA GGAAGGCATT AACGCTCTAG AGGCAGTTAT TCAGCTGTTT
AATAATATTC ATGCTTTGCG GCCTTATTTA AAGGACGAGG CCAGCATCCA CGGCATAATT
ACCGAGGGCG GGGTCTCGCC CAACATCATT CCGGAACGGG CGGTGGCCCG TTTTTACCTC
CGGGCCAGTA CCAGGGAAGC CCTGAACCGG GTGGCCCGGC GGGTGGAGGA CTGTGCCACA
GCGGCGGCCC TGGCCACCGG TACCCGTTTC TGGTACCACA ACTACGAACC CTCCTACGAA
GCCATGCTTG TCAACCGAAC CCTGGCAGGC GCCTGGCAGA GAAATTTGCA GGAACTGGGC
GTCACCGACC TGGCCCCGGC CTGCCGCAGC CGGGGTTCCC TGGATATGGG CAATGTCAGC
CGGGTAGTAC CGGCCATCCA CCCCTACCTG TCCCTCAATG CGGGGAAACT GGTACCCCAT
ACCCGAGAGT TTGCCCGGGC CGTCAGGGGT GAGGCCGGCC GGCGCCTGGT CATCCTGGCC
GCCAGGGCCC TGGCCTGGAC AGCGGTGGAT GTCATGCTGG ACCGGGAACT GCTGGCGCAG
ATCAAAGACG AATTCGCCGG CTGGCAGGCG GGTAACGTAG CTACTTGA
 
Protein sequence
MYGVKERISR AIEEIKDQII QVAEAIFDHP ETGNEEYFAA DLLTGILAEK GFKITRPLCQ 
LPTAFRAELA TGTPGPRIGL LAEYDALPEL GHACGHNLIA AGSLGAALGL AAAVRELRGT
IVVLGTPAEE SHGAKVLLAR EGALDDLDVA MMFHPGDTNA VEVTSQALEA LEFIFEGRAA
HAASSPEEGI NALEAVIQLF NNIHALRPYL KDEASIHGII TEGGVSPNII PERAVARFYL
RASTREALNR VARRVEDCAT AAALATGTRF WYHNYEPSYE AMLVNRTLAG AWQRNLQELG
VTDLAPACRS RGSLDMGNVS RVVPAIHPYL SLNAGKLVPH TREFARAVRG EAGRRLVILA
ARALAWTAVD VMLDRELLAQ IKDEFAGWQA GNVAT