Gene Mmcs_1779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1779 
Symbol 
ID4110613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1918611 
End bp1919756 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content67% 
IMG OID638030899 
Productthiolase 
Protein accessionYP_638944 
Protein GI108798747 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.187266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATG TCGCGATCAT CGGTGTGGGG CTGCATCCGT TCGGCCGGTT CGAGGGTAAG 
TCGGCGATGC AGATGGGCGT CGACGCGATC TTCGCCGCGG TCGACGACGC CGGCGTCGCG
TGGTCGGATG TGCAGTTCGC CACCGGCGGC AGTTGGACGG TGGCCAACCC CGACGCCATC
GTCGGCATGG TCGGGCTCTC GGGTATTCCG TTCACCAATG TCTTCAACGC CTGCGCCACC
GCAGCGAGTG CGCTGAAGGC CTGCGCCGAC GGAATCCGAT TGGGCGACTA CGACATCGGC
ATCGCGATCG GCCTGGACAA GCATCCGCGC GGCGCCTTCA CCGAGGATCC CGCACTGGTG
GGCATGCCGT CCTGGTACGC GGAGAACGGC CAGTACCTGA CCACGCAGTT CTTCGGTATG
AAGGCCAATC GCTATCTGCA CGATCACCAG ATCTCCCACG CCACGCTCGC CAAGGTGGCC
GCCAAGAACT TCCGCAACGG GGCGCTCAAC CCGAATGCGT TCCGGCGCAA GCCGATGACC
GAGGAGCAGA TCCTCGACTC GACGATGCTG AACTATCCGC TCACGCAGTA CATGTTCTGC
GCGCCCGACG AAGGGGCCGC CGCGGTGGTG ATGTGCCGCG CCGACCTGGC CCACCGCTAC
ACCTCGAAAC CGGTGTACCT GCGCGCGGTG GAGGTCCGCA CCCGGCAGTA CGGCGCGTAC
GAGGTCAATA CCACGTTCGC GCCCGTCGAC GAGGACGTCG CGCCGACGGT GTACGCGGCC
AGGTCGGCGT TCGAGAAGGC CGGCATCGCG CCGACCGACG TCGACGTCGT CCAGTTGCAG
GACACCGACG CCGGCGCGGA GATCATCCAC ATGGCCGAAT GCGGATTCTG CGCCGACGGC
GATCAGGAGA AGCTGCTGGC CGACGGCGCC ACCGAGATCG GCGGCCCACT GCCGATCAAC
ACCGACGGTG GCCTGATCGC CAACGGCGAG CCGATCGGCG CATCGGGCCT GCGCCAGATC
CACGAGCTGG TCCGGCAATT GCGGGGCGAG GCCGGAGACC GACAGGTACC CGGTGAGCCA
CGGGTCGGGT TCGGGCAGCT CTACGGTGCG CCCGGTACCG CCGCGGCCAT GATCGTGTCC
ACCTGA
 
Protein sequence
MNDVAIIGVG LHPFGRFEGK SAMQMGVDAI FAAVDDAGVA WSDVQFATGG SWTVANPDAI 
VGMVGLSGIP FTNVFNACAT AASALKACAD GIRLGDYDIG IAIGLDKHPR GAFTEDPALV
GMPSWYAENG QYLTTQFFGM KANRYLHDHQ ISHATLAKVA AKNFRNGALN PNAFRRKPMT
EEQILDSTML NYPLTQYMFC APDEGAAAVV MCRADLAHRY TSKPVYLRAV EVRTRQYGAY
EVNTTFAPVD EDVAPTVYAA RSAFEKAGIA PTDVDVVQLQ DTDAGAEIIH MAECGFCADG
DQEKLLADGA TEIGGPLPIN TDGGLIANGE PIGASGLRQI HELVRQLRGE AGDRQVPGEP
RVGFGQLYGA PGTAAAMIVS T