Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1962 |
Symbol | |
ID | 3831144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2041446 |
End bp | 2042636 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637829893 |
Product | peptidase M20D, amidohydrolase |
Protein accession | YP_430803 |
Protein GI | 83590794 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.904473 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAGCG CCGGCGATTT GCGTGCCGCC GCAGAGGCCC TCAAACCCCA GCTGGTGGCC TGGCGGCGGC GGCTGCACCA GTATCCGGAA CTGAGCTTTG AGGAAAGGGA AACGTCGGCG ATGGTGGCCG GGGTGTTACG CGAACTGGGG CTGCAGGTCA GGAGCGGCAT CGGCGGTACC GGGGTGGTGG GTGTCCTGGC CGGGGCCGGG GAAGGCCCCG GCGTCGCCCT GCGGGCCGAT ATGGACGCCC TGCCCCTCCA AGAAGATACC GGCGAGGAGT TCGCCTCCCG CTATCCTGGC CGGATGCACG GTTGCGGCCA CGATGCCCAT ATGACAATGG TCCTGGGGGC GGCGACCATT CTGGCCGAAC GGCGCCAGGA ACTCCCCGGG CCGGTAGTTT TCATCTTCCA ACCCGGAGAA GAACTGCCCC CGGGCGGCGC CAGCCGGATG CTGGCAGCCG GCGTCCTGGA CGACCCCCCG GTCAAGGCCG CCTTCGGCCT CCACGTCACC GCTTACCTGC CGGTGGGTAC GGTAGGTGTT CGCAGCGGGG CCATCATGGC TTCGGCTGAC AACTTTACGA TTAAGATCAA GGGGCGTACC AGCCACGGAG CCTCACCCCA CCTGGGGGCT GATGCCATCG TCGCCGCCGC CCAGGCCGTT CTGGCTTTAC AAACCATTAT TTCCCGGCAC CTGGACCCGG TGCAACCGGC GGTCCTGACC GTGGGGACCA TAAAAGGCGG GGAGAAGGAG AACATCGTTG CCGGTGAAGT AACCTTGACG GGTACCACCC GGGCCTTGAA TAACGTTATG CGGCAGCAAC TGGAAAAGGA CATGCGCCAG GTCCTGGCCG GGGTGGCGGC CGCCAGCGGT ACCGAGATTG ACCTCGATTA CCTGTGGGGT TACCCGCCTT TAGTCAACAA TGCCGGCCTG ACTGAACTTT TTCGCCGAGT TGCCGGGGAA ATCCTGGGGC CGGATAAAGT CCTGGAGCTG GCCAACCCAT CCATGGGGGC CGAGGATTTC GCCCGCTATG CGGAAAAAGT ACCGGCAGTA TACTTTAACC TGGGAGCGGC TATCCCCGGT GCGGAACCCC ACCCCTGGCA CCACCCGCGC TTTAACATTA ACGAGGATTG CCTGCCCATC GGCGCCGGGT TGCTGGCGGC GCTGGCTGTT CGGACCCTGG AGGATTTTTA G
|
Protein sequence | MVSAGDLRAA AEALKPQLVA WRRRLHQYPE LSFEERETSA MVAGVLRELG LQVRSGIGGT GVVGVLAGAG EGPGVALRAD MDALPLQEDT GEEFASRYPG RMHGCGHDAH MTMVLGAATI LAERRQELPG PVVFIFQPGE ELPPGGASRM LAAGVLDDPP VKAAFGLHVT AYLPVGTVGV RSGAIMASAD NFTIKIKGRT SHGASPHLGA DAIVAAAQAV LALQTIISRH LDPVQPAVLT VGTIKGGEKE NIVAGEVTLT GTTRALNNVM RQQLEKDMRQ VLAGVAAASG TEIDLDYLWG YPPLVNNAGL TELFRRVAGE ILGPDKVLEL ANPSMGAEDF ARYAEKVPAV YFNLGAAIPG AEPHPWHHPR FNINEDCLPI GAGLLAALAV RTLEDF
|
| |