Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2292 |
Symbol | |
ID | 3831324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2405075 |
End bp | 2406262 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637830212 |
Product | peptidase M20D, amidohydrolase |
Protein accession | YP_431122 |
Protein GI | 83591113 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.972059 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGGCG TAAAAGAAAG GATTAGCCGG GCTATAGAGG AGATCAAGGA CCAGATTATC CAGGTGGCGG AAGCCATCTT TGATCACCCG GAGACGGGGA ACGAGGAGTA TTTTGCCGCC GATCTCCTGA CCGGGATCCT GGCGGAAAAA GGTTTTAAAA TTACCAGGCC CCTCTGCCAG TTACCGACGG CTTTTCGGGC CGAACTGGCC ACCGGGACCC CCGGGCCCCG GATAGGCCTG CTGGCCGAGT ATGATGCCCT GCCGGAGCTG GGCCACGCCT GCGGCCATAA CCTCATCGCA GCCGGCAGCC TGGGCGCCGC TTTAGGCCTG GCGGCCGCTG TCCGGGAATT GCGCGGGACT ATTGTGGTCC TGGGAACCCC GGCCGAGGAG AGCCACGGCG CCAAGGTATT ACTTGCCAGG GAGGGGGCAT TGGACGACCT GGATGTGGCC ATGATGTTTC ACCCCGGGGA CACCAATGCC GTCGAGGTGA CTTCCCAAGC CCTGGAGGCC CTGGAGTTTA TTTTTGAAGG CCGGGCCGCC CATGCCGCCT CCAGCCCGGA GGAAGGCATT AACGCTCTAG AGGCAGTTAT TCAGCTGTTT AATAATATTC ATGCTTTGCG GCCTTATTTA AAGGACGAGG CCAGCATCCA CGGCATAATT ACCGAGGGCG GGGTCTCGCC CAACATCATT CCGGAACGGG CGGTGGCCCG TTTTTACCTC CGGGCCAGTA CCAGGGAAGC CCTGAACCGG GTGGCCCGGC GGGTGGAGGA CTGTGCCACA GCGGCGGCCC TGGCCACCGG TACCCGTTTC TGGTACCACA ACTACGAACC CTCCTACGAA GCCATGCTTG TCAACCGAAC CCTGGCAGGC GCCTGGCAGA GAAATTTGCA GGAACTGGGC GTCACCGACC TGGCCCCGGC CTGCCGCAGC CGGGGTTCCC TGGATATGGG CAATGTCAGC CGGGTAGTAC CGGCCATCCA CCCCTACCTG TCCCTCAATG CGGGGAAACT GGTACCCCAT ACCCGAGAGT TTGCCCGGGC CGTCAGGGGT GAGGCCGGCC GGCGCCTGGT CATCCTGGCC GCCAGGGCCC TGGCCTGGAC AGCGGTGGAT GTCATGCTGG ACCGGGAACT GCTGGCGCAG ATCAAAGACG AATTCGCCGG CTGGCAGGCG GGTAACGTAG CTACTTGA
|
Protein sequence | MYGVKERISR AIEEIKDQII QVAEAIFDHP ETGNEEYFAA DLLTGILAEK GFKITRPLCQ LPTAFRAELA TGTPGPRIGL LAEYDALPEL GHACGHNLIA AGSLGAALGL AAAVRELRGT IVVLGTPAEE SHGAKVLLAR EGALDDLDVA MMFHPGDTNA VEVTSQALEA LEFIFEGRAA HAASSPEEGI NALEAVIQLF NNIHALRPYL KDEASIHGII TEGGVSPNII PERAVARFYL RASTREALNR VARRVEDCAT AAALATGTRF WYHNYEPSYE AMLVNRTLAG AWQRNLQELG VTDLAPACRS RGSLDMGNVS RVVPAIHPYL SLNAGKLVPH TREFARAVRG EAGRRLVILA ARALAWTAVD VMLDRELLAQ IKDEFAGWQA GNVAT
|
| |