Gene Tmz1t_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_4066 
SymbolmetX 
ID7873293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4465245 
End bp4466381 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content67% 
IMG OID643700997 
Producthomoserine O-acetyltransferase 
Protein accessionYP_002891020 
Protein GI237654706 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.685956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAAA TGATCGCACC CCAATCGGTC GGCGTGGTCG TGCCGCAGCG TGCCGAGTTC 
ACCACGCCGC TCGCGCTGCG CAGCGGCGGC ACGCTGAACA ACTACCACCT CGTCTACGAA
ACCTACGGCA CGCTCAATGC GGACCGCAGC AACGCGGTGC TGGTGTGCCA TGCGCTGTCG
GGCTCGCACC ACGTCGCCGG CACCTACGCC GACGCGCCGC ACAACGTCGG CTGGTGGGAC
AACCTGATCG GGCCGGGCAA GCCGCTCGAC ACGCGCAAGT TCTTCGTGAT CGGGGTGAAC
AATCTCGGCG GCTGCTACGG CTCCTCTGGC CCCAACCAGA TCAACCCCGC CACCGGCAAG
CTGTGGGGGG CGGACTTCCC CTTCGTCACC GTGGAAGACT GGGTCGAATC GCAGGCGCGC
CTGGCCGACC GGCTCGGCAT CGAGCGCTTC GCCGCGGTGG TCGGCGGCTC GCTCGGCGGC
ATGCAGGCGA TGTCGTGGGC GCTGCAGTAC CCGGACCGCG TCGGCCACGT CGCGGTGATC
GCCGCCGCGC CCAAGCTCAC CGCGCAGAAC ATCGCCTTCA ACGAGGTCGC CCGCCAGGCC
ATCCTGAGCG ACCCCGAGTT CCACGGCGGC CACTACTATG CGCACGGCGT GGTGCCGACG
CGTGGCCTCA AGCTGGCGCG CATGGTGGGT CACATCACCT ACCTGTCCGA CGACTCGATG
GCGGAAAAAT TCGGCCGCAG CCTGCGCCAC GGCCGCAACA CCTACAGCTA CGACGTCGAA
TTCGAGATCG AGTCCTACCT GCGCTACCAG GGCGACAAGT TCGCCGGCTA CTTCGACGCC
AACACCTACC TGCTGACCAC CAAGGCGCTC GACTACTTCG ACCCCGCCTT CGAGTATGGC
GGCCATCTAC CCGCGGCGCT TGCCCGCGCC AGCGCCGACT TTCTGGTGAT TTCCTTCACC
ACCGACTGGC GCTTCTCGCC CGAGCGTTCG CGCGAGATCG TCTACGCGCT GCTGCACAAC
AAGCGCAACG TCAGCTACGC CGAGATCGAC TGCCCGGCCG GCCACGACTC CTTCCTGCTC
GACGAGACGC GCTACCACAA GCTGCTGTCG GCATGGTTCG ACCGCATCGA GGTTTAA
 
Protein sequence
MTKMIAPQSV GVVVPQRAEF TTPLALRSGG TLNNYHLVYE TYGTLNADRS NAVLVCHALS 
GSHHVAGTYA DAPHNVGWWD NLIGPGKPLD TRKFFVIGVN NLGGCYGSSG PNQINPATGK
LWGADFPFVT VEDWVESQAR LADRLGIERF AAVVGGSLGG MQAMSWALQY PDRVGHVAVI
AAAPKLTAQN IAFNEVARQA ILSDPEFHGG HYYAHGVVPT RGLKLARMVG HITYLSDDSM
AEKFGRSLRH GRNTYSYDVE FEIESYLRYQ GDKFAGYFDA NTYLLTTKAL DYFDPAFEYG
GHLPAALARA SADFLVISFT TDWRFSPERS REIVYALLHN KRNVSYAEID CPAGHDSFLL
DETRYHKLLS AWFDRIEV