Gene Mkms_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1224 
SymbolmetX 
ID4614420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1309131 
End bp1310252 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content72% 
IMG OID639790899 
Producthomoserine O-acetyltransferase 
Protein accessionYP_937226 
Protein GI119867274 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGAGCC CCGCCGTGCC CGCCCTCGAC CTGCCCGCCG AGGGTGAGAC CGGCGTGGTC 
GACATCGGCC CGCTGACCCT GGAGAGCGGC GCGGTCATCG ACGACGTGTC GATCGCCGTC
CAGCGCTGGG GTGAGCTCTC CCCCAACCGC GACAACGTCG TGATGGTGCT GCATGCGCTC
ACCGGTGACT CGCACGTCAC CGGACCGGCC GGCCCCGACC ATCCCACCCC GGGCTGGTGG
GACGGCGTCG CCGGGCCGGG AGCCCCGATC GACACCGACC GCTGGTGCGC GGTGTCGACG
AACGTCCTCG GCGGCTGCCG TGGGTCGACC GGCCCGTCGT CGATCGCTCC CGACGGCCGG
CCGTACGGTT CGCGGTTCCC CGCGGTGACG ATCCGCGACC AGGTCACCGC GGACCTCGCC
GCGCTCGAGG CGCTGGGCAT CACCGAGGTC GCCGCGGTGG TGGGCGGATC CATGGGCGGC
GCGCGTGCGC TGGAGTGGAT CGTCGGCCAT CCGGCCACCG TGCGTTCGGC GCTGATCCTC
GCCGTCGGCG CCCGCGCCAC CGCCGACCAG ATCGGCACGC AGAGCACCCA GGTCGCCGCG
ATCAAGGCCG ATCCCGACTG GTGCGGCGGC GACTACCACG ACACCGGTCG CGTGCCGTCC
ACCGGTCTGG CGATCGCCCG CCGCTTCGCC CACCTGACCT ACCGCGGTGA AGTCGAACTC
GACGACCGGT TCGGCAACCA CGCCCAGGGT GACGAGAGCC CGACCGACGG CGGCCGGTAC
GCGGTGCAGA GTTATCTGGA GTACCAGGGC GCCAAGCTGG TCGAGCGGTT CGACGCAGGC
ACCTACGTCA CGCTGACCGA CGCGTTGTCG AGCCACGACG TGGGTCGCGG CCGCGGAGGC
GTGCGCGCTG CGCTGCAGGG TTGCCGGGTG CCCACGATCG TCGGCGGCGT CACCTCCGAC
CGGCTCTACC CCCTGCGGCT GCAGCAGGAG TTGGCCGAAC TGCTGCCCGG CTGTACCGGT
CTGGACGTGG TCGATTCGGT CTACGGCCAC GACGGGTTCC TGGTCGAGAC GGAGGCCGTC
GGCAAGCTCA TCCGGCGCAC ACTGGAGTTG GCGGAGCGGT GA
 
Protein sequence
MKSPAVPALD LPAEGETGVV DIGPLTLESG AVIDDVSIAV QRWGELSPNR DNVVMVLHAL 
TGDSHVTGPA GPDHPTPGWW DGVAGPGAPI DTDRWCAVST NVLGGCRGST GPSSIAPDGR
PYGSRFPAVT IRDQVTADLA ALEALGITEV AAVVGGSMGG ARALEWIVGH PATVRSALIL
AVGARATADQ IGTQSTQVAA IKADPDWCGG DYHDTGRVPS TGLAIARRFA HLTYRGEVEL
DDRFGNHAQG DESPTDGGRY AVQSYLEYQG AKLVERFDAG TYVTLTDALS SHDVGRGRGG
VRAALQGCRV PTIVGGVTSD RLYPLRLQQE LAELLPGCTG LDVVDSVYGH DGFLVETEAV
GKLIRRTLEL AER