Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1646 |
Symbol | |
ID | 3830934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1681626 |
End bp | 1682831 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637829571 |
Product | peptidase |
Protein accession | YP_430491 |
Protein GI | 83590482 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR03320] M20/DapE family protein YgeY [TIGR03526] putative selenium metabolism hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.017723 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATG CTGTCATTGC CCAGATTAAG TCCGAGGTGG CCAAATATCG GGGAGACATA ATCCGGTTCC TGAAGGACAT AGTCGCCATC CCCAGCCCCA ACGGGGATAT CAAGGCCGTA GCCGAGCGCA TCGGCCAGGA GATGAGAAAA CTGGGTTTCG ACGACGTCTT CCTGGACAGC ATGGGCAATA TCGTCGGCCG CATCGGCAGC GGCCCCAGGG TACTCCTGTA TGACAGCCAT ATTGACACCG TGGATATCGC CGATTCCGAT CAGTGGCAGT GGGACCCCTA CAAGGGCAAG GAAGAGAATG GTATTTTCTA CGGTCTGGGG GCCGGGGATG AGAAGAATTC CACTCCGGGG ATGGTTTACG GACTGAAGAT CATCAAGGAC CTGGGCCTGG CCGATGACTT TACCCTTTAC TATTTCGGTA ATATCGAGGA GATCTGCGAC GGAGTGGCGC CCAACTCCCT GGTGGTCACC GATAAGATCA AACCCGACTT TGTTGTTATC GGTGAGCCTA CCAAAATGAA CATCTACCGG GGTCACCGCG GCCGGGTGGA GATGAAGGTT ACCACCAAAG GCCGGACTTG CCACGCCAGC GCCCCGGAGC GCGGGGTCAA CGCCGTTTAC AAAATGGCGG AAATTATCAA GGGCATTAGC CAGATGGGCG CCGACTTCGT TGAGGACCCC TTCCTGGGAA AGGGGTCTAT AGCCGTCACC GACATCCACT GCAAAACGCC CTCCATCAAT GCCTTACCCG ACGAGTGCGT GATTTACATT GACCGCCGCC TGACCTTCGG TGAGACCCAG GAGATGGCCG TCGAGCAGGT GCGTAAAGTA GCCGAGCCCC ACGGCGGCAA GGTCGAGGTG CTGGAGTTTG ACGAGCCCAG CTATACCGGC TTTGTCTTCA AAGTCGACAA ATACTTCCCG GCCTGGGTCC TGCCTGAGGA TCACCTCCTG GTCAAGGCCG GCCTGGAAAC CTATCAACGG GTTTTCGGCC AGCCCACCGG GGTGGGGAAA TGGGTCTTCA GCACCAACGG TATTTACTGG ATGGGTAAAG CCGGCATTCC CGCCATCGGC TTTGGCCCCG GCGACGAGGT CTACGCCCAC AGCGTCCTCG ACCAGGTGCC CATCGAAGAC GTCGTCCGTT CCACCGAGTT CTACGCCTAC TTCCCCACGG TTTTAAGGGA AATGCTGGCC AGATAA
|
Protein sequence | MSDAVIAQIK SEVAKYRGDI IRFLKDIVAI PSPNGDIKAV AERIGQEMRK LGFDDVFLDS MGNIVGRIGS GPRVLLYDSH IDTVDIADSD QWQWDPYKGK EENGIFYGLG AGDEKNSTPG MVYGLKIIKD LGLADDFTLY YFGNIEEICD GVAPNSLVVT DKIKPDFVVI GEPTKMNIYR GHRGRVEMKV TTKGRTCHAS APERGVNAVY KMAEIIKGIS QMGADFVEDP FLGKGSIAVT DIHCKTPSIN ALPDECVIYI DRRLTFGETQ EMAVEQVRKV AEPHGGKVEV LEFDEPSYTG FVFKVDKYFP AWVLPEDHLL VKAGLETYQR VFGQPTGVGK WVFSTNGIYW MGKAGIPAIG FGPGDEVYAH SVLDQVPIED VVRSTEFYAY FPTVLREMLA R
|
| |