Gene Mkms_3733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3733 
Symbol 
ID4611668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3951953 
End bp3953095 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content67% 
IMG OID639793414 
Productthiolase 
Protein accessionYP_939717 
Protein GI119869765 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.454317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.374229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACG TAGCCATCAT CGGCGTCGGC CTGCATCCCT TCGGCCGGTT CGACAAGACC 
GCGATGGAGA TGGGCGCCGA GGCGATCCAG TTCGCGCTGG CCGACGCGAA ACTGGAGTGG
AAGGACATTC AGTTCGGCTT CGGCGGCAGC TACGAGGTGT CCAACCCCGA TGCCGTCACC
CGGCTGGTCG GCCTCACCGG AATCACGTTC ACCAACGTGT TCAACGCCTG CGCCACGGCG
GCCAGCGCGA TCCAGCAGAC CGCTGACACG ATCCGCCTGG GCAAGTACGA CATCGGCATC
GCCATCGGAC TCGACAAACA CCCCCGCGGT GCGTTCACCG ACGATCCGGC CAAACTCGCC
CTGCCACAGT GGTATGCCGA GAACGGTCAG TTCGTCACCA CGAAGTTCTT CGGCATGAAG
GCCAATCACT ACCTCCACAA GCACGGCATC TCCGAGGAGA CGCTGGCGCG GGTGGCCAAC
AAGAACTTCC GCAACGGCGA ACGGAATCCG AATGCGTTCC GCCGCAAGGA GATCTCGGTC
GACGAGATCA TGGCGTCACC GGTTCTGAAC TACCCGTTGC GGCAGTACAT GTTCTGCGCA
CCCGACGAGG GCGCCGCCGC GGTGATCATG TGCCGCGGTG ACATCGCCCA CCGCTACACC
GACAAGCCGG TATTCCTGCG CGCCAGCGAG ATCCGCACAC GGACGTTCGG CGCCTACGAG
GTGCACGCCA CCTCGGCGCC GCTGGACGAG GACGCCTCCC CCACCGTCTA CGCCGCGCGC
GCCGCCTACG AGATCGCCGG AATCGGACCC GAGGATGTCG ACATCGCCCA GCTGCAGGAC
ACCGACGCCG GCGCGGAGGT CATCCACATG GCCGAGACCG GATTGTGCGC CGACGGTGAG
CAGGAGAAGT TGCTCGCCGA CGGGGCCACC GAGATCGGTG GCAGCATCCC GGTCAACACC
GACGGCGGCC TGATCGCCAA CGGGGAGCCC ATCGGCGCAT CCGGTCTGCG CCAGGTGCAC
GAACTCGTCC GCCAACTGCG CGGGGAGGCC GGCGACCGCC AGGTACCCGG CGAGCCGAAA
GTCGGTCTGG CACAGGTGTA CGGCGCGCCC GGCACCGCCT CGGCGACCAT CCTGACGGTC
TGA
 
Protein sequence
MNDVAIIGVG LHPFGRFDKT AMEMGAEAIQ FALADAKLEW KDIQFGFGGS YEVSNPDAVT 
RLVGLTGITF TNVFNACATA ASAIQQTADT IRLGKYDIGI AIGLDKHPRG AFTDDPAKLA
LPQWYAENGQ FVTTKFFGMK ANHYLHKHGI SEETLARVAN KNFRNGERNP NAFRRKEISV
DEIMASPVLN YPLRQYMFCA PDEGAAAVIM CRGDIAHRYT DKPVFLRASE IRTRTFGAYE
VHATSAPLDE DASPTVYAAR AAYEIAGIGP EDVDIAQLQD TDAGAEVIHM AETGLCADGE
QEKLLADGAT EIGGSIPVNT DGGLIANGEP IGASGLRQVH ELVRQLRGEA GDRQVPGEPK
VGLAQVYGAP GTASATILTV