Gene Mmcs_1207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1207 
SymbolmetX 
ID4110044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1304822 
End bp1305943 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content72% 
IMG OID638030328 
Producthomoserine O-acetyltransferase 
Protein accessionYP_638375 
Protein GI108798178 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGAGCC CCGCCGTGCC CGCCCTCGAC CTGCCCGCCG AGGGTGAGAC CGGCGTGGTC 
GACATCGGCC CGCTGACCCT GGAGAGCGGC GCGGTCATCG ACGACGTGTC GATCGCCGTC
CAGCGCTGGG GTGAGCTCTC CCCCAACCGC GACAACGTCG TGATGGTGCT GCATGCGCTC
ACCGGTGACT CGCACGTCAC CGGACCGGCC GGCCCCGACC ATCCCACCCC GGGCTGGTGG
GACGGCGTCG CCGGGCCGGG AGCCCCGATC GACACCGACC GCTGGTGCGC GGTGTCGACG
AACGTCCTCG GCGGCTGCCG TGGGTCGACC GGCCCGTCGT CGATCGCTCC CGACGGCCGG
CCGTACGGTT CGCGGTTCCC CGCGGTGACG ATCCGCGACC AGGTCACCGC GGACCTCGCC
GCGCTCGAGG CGCTGGGCAT CACCGAGGTC GCCGCGGTGG TGGGCGGATC CATGGGCGGC
GCGCGTGCGC TGGAGTGGAT CGTCGGCCAT CCGGCCACCG TGCGTTCGGC GCTGATCCTC
GCCGTCGGCG CCCGCGCCAC CGCCGACCAG ATCGGCACGC AGAGCACCCA GGTCGCCGCG
ATCAAGGCCG ATCCCGACTG GTGCGGCGGC GACTACCACG ACACCGGTCG CGTGCCGTCC
ACCGGTCTGG CGATCGCCCG CCGCTTCGCC CACCTGACCT ACCGCGGTGA AGTCGAACTC
GACGACCGGT TCGGCAACCA CGCCCAGGGT GACGAGAGCC CGACCGACGG CGGCCGGTAC
GCGGTGCAGA GTTATCTGGA GTACCAGGGC GCCAAGCTGG TCGAGCGGTT CGACGCAGGC
ACCTACGTCA CGCTGACCGA CGCGTTGTCG AGCCACGACG TGGGTCGCGG CCGCGGAGGC
GTGCGCGCTG CGCTGCAGGG TTGCCGGGTG CCCACGATCG TCGGCGGCGT CACCTCCGAC
CGGCTCTACC CCCTGCGGCT GCAGCAGGAG TTGGCCGAAC TGCTGCCCGG CTGTACCGGT
CTGGACGTGG TCGATTCGGT CTACGGCCAC GACGGGTTCC TGGTCGAGAC GGAGGCCGTC
GGCAAGCTCA TCCGGCGCAC ACTGGAGTTG GCGGAGCGGT GA
 
Protein sequence
MKSPAVPALD LPAEGETGVV DIGPLTLESG AVIDDVSIAV QRWGELSPNR DNVVMVLHAL 
TGDSHVTGPA GPDHPTPGWW DGVAGPGAPI DTDRWCAVST NVLGGCRGST GPSSIAPDGR
PYGSRFPAVT IRDQVTADLA ALEALGITEV AAVVGGSMGG ARALEWIVGH PATVRSALIL
AVGARATADQ IGTQSTQVAA IKADPDWCGG DYHDTGRVPS TGLAIARRFA HLTYRGEVEL
DDRFGNHAQG DESPTDGGRY AVQSYLEYQG AKLVERFDAG TYVTLTDALS SHDVGRGRGG
VRAALQGCRV PTIVGGVTSD RLYPLRLQQE LAELLPGCTG LDVVDSVYGH DGFLVETEAV
GKLIRRTLEL AER