Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1308 |
Symbol | |
ID | 3831794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1351668 |
End bp | 1352792 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829244 |
Product | homoserine O-acetyltransferase |
Protein accession | YP_430164 |
Protein GI | 83590155 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2021] Homoserine acetyltransferase |
TIGRFAM ID | [TIGR01392] homoserine O-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00994406 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.346812 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGGAG TCGGTATTGT AACAACCAGG TTTTATGAAT GGTCGCAATG CCTCCACCTG GAAAGCGGGG CCCAGCTGGG CTCCCTTACC ATAGCCTATG AGACCTATGG GGAACTGAAC GCGGCCGGAA ATAACGCCAT CCTGGTCCTC CACGCCCTTA CCGGTAATGC CCATATCGCC GGCCGAAATT TTCCGGACGA GAGGTATCCC GGCTGGTGGG ACCCCCTGGT AGGGCCGGGC CGGGCCCTGG ACACCAGGCG CTATTTCATT GTCTGTGCCA ACGTCCTGGG AAGCTGCTAT GGTACCACCG GGCCGGCCAG CATTAATCCA GCCACCGGCA AGCCCTACGG GATGGATTTT CCGGCCATCA CTATCCGCGA TATGGTACGG GCACAAAAAA TCCTCCTTGA CTATCTGGGG GTCAAGCGCC TGGTGGCGGC CATCGGTGGT TCCATGGGCG GGATGCAGGT CCTGGAGTGG GGTTTTCTTT ATCCTCAGAT GCTGGACGCC ATTATTCCCA TTGCCACCTG CGGCCGGACT ACTCCCATGC AGATTGCCTT TCACCACGTG CAGCGGGAAG CCATTTACGC CGACCCCGAC TGGCAGGGAG GCAATTATTA CGGCACTGCC GGGCCCCGGC GGGGACTGGC CCTGGCCCGG CAGATCGGGA TTATTACTTA TAAAAGCGAC CCCTCCTGGA ACATGAAATT TGGCCGCAAC CTGGTGGACC CCCGGAAATA CTTCCAACTG GAAGGGCAGT TCGAAGTAGA GAGCTACCTG GCCTACCAGG GGAGGAAGCT GGTAGATCGT TTCGACGCCA ACTCTTACCT GTACCTTACC AAAGCAGTAG ACCTCCACGA TGTGAGCCAG GGACGGGGAA GCTATAATGA AGTCTGGCGG GATTTCCCCT GCCCCTGCCT GGGTATAGGC ATATCAAGCG ATTTTCTTTT CCCTCCCTAT CAGGTGCAGG AGATTGTCCG GATGATTAAC GACGGCGGCG GCCATGCCCG TTACGCAGAG ATTGATTCCC CCTATGGCCA CGACGCCTTT TTAATCGAGT TTAACCAGCT GGCAGCCATT ATCCAGCCGT TTCTGAAAGA GTTGCGCCCG GACCTGGCCG CTTGA
|
Protein sequence | MDGVGIVTTR FYEWSQCLHL ESGAQLGSLT IAYETYGELN AAGNNAILVL HALTGNAHIA GRNFPDERYP GWWDPLVGPG RALDTRRYFI VCANVLGSCY GTTGPASINP ATGKPYGMDF PAITIRDMVR AQKILLDYLG VKRLVAAIGG SMGGMQVLEW GFLYPQMLDA IIPIATCGRT TPMQIAFHHV QREAIYADPD WQGGNYYGTA GPRRGLALAR QIGIITYKSD PSWNMKFGRN LVDPRKYFQL EGQFEVESYL AYQGRKLVDR FDANSYLYLT KAVDLHDVSQ GRGSYNEVWR DFPCPCLGIG ISSDFLFPPY QVQEIVRMIN DGGGHARYAE IDSPYGHDAF LIEFNQLAAI IQPFLKELRP DLAA
|
| |