Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4066 |
Symbol | metX |
ID | 7873293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4465245 |
End bp | 4466381 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643700997 |
Product | homoserine O-acetyltransferase |
Protein accession | YP_002891020 |
Protein GI | 237654706 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2021] Homoserine acetyltransferase |
TIGRFAM ID | [TIGR01392] homoserine O-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.685956 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAAAA TGATCGCACC CCAATCGGTC GGCGTGGTCG TGCCGCAGCG TGCCGAGTTC ACCACGCCGC TCGCGCTGCG CAGCGGCGGC ACGCTGAACA ACTACCACCT CGTCTACGAA ACCTACGGCA CGCTCAATGC GGACCGCAGC AACGCGGTGC TGGTGTGCCA TGCGCTGTCG GGCTCGCACC ACGTCGCCGG CACCTACGCC GACGCGCCGC ACAACGTCGG CTGGTGGGAC AACCTGATCG GGCCGGGCAA GCCGCTCGAC ACGCGCAAGT TCTTCGTGAT CGGGGTGAAC AATCTCGGCG GCTGCTACGG CTCCTCTGGC CCCAACCAGA TCAACCCCGC CACCGGCAAG CTGTGGGGGG CGGACTTCCC CTTCGTCACC GTGGAAGACT GGGTCGAATC GCAGGCGCGC CTGGCCGACC GGCTCGGCAT CGAGCGCTTC GCCGCGGTGG TCGGCGGCTC GCTCGGCGGC ATGCAGGCGA TGTCGTGGGC GCTGCAGTAC CCGGACCGCG TCGGCCACGT CGCGGTGATC GCCGCCGCGC CCAAGCTCAC CGCGCAGAAC ATCGCCTTCA ACGAGGTCGC CCGCCAGGCC ATCCTGAGCG ACCCCGAGTT CCACGGCGGC CACTACTATG CGCACGGCGT GGTGCCGACG CGTGGCCTCA AGCTGGCGCG CATGGTGGGT CACATCACCT ACCTGTCCGA CGACTCGATG GCGGAAAAAT TCGGCCGCAG CCTGCGCCAC GGCCGCAACA CCTACAGCTA CGACGTCGAA TTCGAGATCG AGTCCTACCT GCGCTACCAG GGCGACAAGT TCGCCGGCTA CTTCGACGCC AACACCTACC TGCTGACCAC CAAGGCGCTC GACTACTTCG ACCCCGCCTT CGAGTATGGC GGCCATCTAC CCGCGGCGCT TGCCCGCGCC AGCGCCGACT TTCTGGTGAT TTCCTTCACC ACCGACTGGC GCTTCTCGCC CGAGCGTTCG CGCGAGATCG TCTACGCGCT GCTGCACAAC AAGCGCAACG TCAGCTACGC CGAGATCGAC TGCCCGGCCG GCCACGACTC CTTCCTGCTC GACGAGACGC GCTACCACAA GCTGCTGTCG GCATGGTTCG ACCGCATCGA GGTTTAA
|
Protein sequence | MTKMIAPQSV GVVVPQRAEF TTPLALRSGG TLNNYHLVYE TYGTLNADRS NAVLVCHALS GSHHVAGTYA DAPHNVGWWD NLIGPGKPLD TRKFFVIGVN NLGGCYGSSG PNQINPATGK LWGADFPFVT VEDWVESQAR LADRLGIERF AAVVGGSLGG MQAMSWALQY PDRVGHVAVI AAAPKLTAQN IAFNEVARQA ILSDPEFHGG HYYAHGVVPT RGLKLARMVG HITYLSDDSM AEKFGRSLRH GRNTYSYDVE FEIESYLRYQ GDKFAGYFDA NTYLLTTKAL DYFDPAFEYG GHLPAALARA SADFLVISFT TDWRFSPERS REIVYALLHN KRNVSYAEID CPAGHDSFLL DETRYHKLLS AWFDRIEV
|
| |