Gene Mkms_2367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2367 
Symbol 
ID4613190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2485101 
End bp2488427 
Gene Length3327 bp 
Protein Length1108 aa 
Translation table11 
GC content71% 
IMG OID639792036 
Producttransglutaminase domain-containing protein 
Protein accessionYP_938355 
Protein GI119868403 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.021822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATCA AGGTGGCGCT GGAGCATCGC ACCAGCTACA CGTTCGACCG GCTGGTGGAA 
GTGCACCCGC ACGTCATCCG GCTGCGTCCG GCGCCGCACT CCCGAACGCC CATCGAGGCG
TACTCGCTGA CCATCGAGCC CGATGACCAC TTCGTCAACT GGCAGCAGGA CGCCTTCGGC
AATTTCCTTG CCCGCGTAGT GTTTCCGACG CGCACCCGGC AGCTCACGAT CACCGTCGGC
CTGATCGCCG ACCTGAAGGT GATCAACCCG TTCGACTTCT TCATCGAGGA GTACGCCGAG
CGGATCGGAT TCGCCTACCC CAAGGCGCTG GCCGAGGATC TCAAACCGTA TTTGAAACCG
GTCGACGAGT TCGGCGACGG ATCCGGGCCC GGAGACCTCG TGCAGGCGTG GGTGAAGAAC
TTCACCGTCG CGCCCGGCAC CAGCACCATC GAGTTCCTGG TCGCCCTCAA CCGCGCGATC
AACGCCGACG TCGGCTACAG CGTCCGCCTC GAACCCGGGG TGCACACCCC CGATCACACG
CTGCGCGTCG GCATCGGCTC GTGCCGCGAC TCGGCCTGGC TGTTGGTGTC GATCCTGCGG
CAACTCGGGC TGGCGGCCCG GTTCGTGTCC GGCTACCTCG TCCAGCTCAC CTCCGATGTG
CAAGCGCTCG ACGGCCCGTC CGGACCGGCC GCCGACTTCA CCGATCTGCA CGCGTGGACG
GAGGTCTACA TCCCCGGCGC CGGGTGGATC GGCCTGGACC CCACCTCGGG GCTCTTCGCC
GGTGAGGGAC ACATCCCGCT CTCGGCGACC CCGCACCCCG ACTCGGCCGC ACCGATCACC
GGTGCCACCG AACCGTGTGA GAGCACACTG GACTTCGCCA ACCTCGTCAC CCGTGTCCAC
GAGGATCCGC GCGTCACGCT GCCCTACACC GAGACCGCGT GGGACGCCGT CTGCGCGCTC
GGCGCCCGGG TCGACGACCG GCTGACCAAG GGCGATGTCC GTCTCACCGT CGGCGGGGAG
CCGACATTCG TGTCGGTCGA CAACCGCACC GACCCCGAAT GGACCACGGC GGCCGACGGT
CCACACAAGC GGGAGCGGGC CTCGGCGCTG GCCGCCGGGC TCAAGACGGT GTGGGCGCCG
CAGGGGTTGG TGCAGCGCGG CCAGGGCAAG TGGTATCCCG GAGAACCGTT GCCGCGCTGG
CAGATCGGCC TGTACTGGCG CGCCGACGGG GCACCGCTGT GGACCGACGC GTCGCTGCTG
GCCGATCCGT GGGGCGCCGA GACGCACACC GTGGACGACG ACGCCGCGCG GCGACTGCTC
GCCGCCGTCG CGGCCGGCCT CGGGCTCCCC GGCACACAGG TCCGGCCGGC GTTCGAGGAT
CCGCTGAGCC GGTTGGCTGC CGCGGCCCGC CGCCCCGCCG GGGAGCCCGT CGCCCCCGAC
GACGATCTCG CCGACGACAC CGCCCGGGCC CGCGCGGACC TGCTGGCCCG GCTCGACACC
AGCGTCACCG AACCGGCCGC GTTCGTGCTG CCGCTGCACC GCCGCGACGA CGACTCGGCA
TGGGCCAGCG CCGACTGGCG GCTGCGCCGG GGCCGCATCG TGCTCCTGGA CGGCGATTCA
CCCGCCGGGC TGCGGCTGCC GCTCGACGCG ATCAGCTGGC AGCCACCGCG ACCCGGCTAT
CCCGCCGACC CGCTGGCCCG GCGCGGTGCG CTCGCCTCCG TTGCGGACGC TGCCGAGCAG
ACCGACGCCG AGGTCGAGGA TGCCGACTCG CTGCCGACGA CGGCGATGGT CACCGAGGTG
CGAGACCGGG TGCTCTACGT ATTCCTGCCG CCCACAGAGG AACTCGAACA TTTCGTCGAC
CTCGTCGGCC GGATCGAGGC CGCCGCGTCG GCGATCGGCT GCCCGGTGGT CATCGAGGGC
TACCCGCCGC CCGCCGACCC GCGCCTGACG TCGGTGACGA TCACACCCGA CCCGGGGGTC
ATCGAGGTCA ACGTGGCCCC GACCGCCAGC TTCGCCGAAC AGCGGACCCA GCTCGAGACG
CTCTACGAAC AGGCGCGGCT GGCCCGCCTG ACCACCGAAC AGTTCGACGT CGACGGCACC
CACGGCGGCA CCGGCGGGGG CAACCACATC ACGCTCGGCG GTCCGACGCC CGCGGACTCA
CCGCTGTTGC GCCGCCCCGA CCTGCTGGTG TCGATGCTCA CTTACTGGCA GCGCCACCCC
GCGCTGTCCT ATCTGTTCGC CGGCCGGTTC ATCGGCACCA CGTCGCAGGC GCCGCGGGTC
GACGAAGGCC GGTCCGAGGC GCTCTACGAA CTCGAGATCG CGTTCGCCGA GATCGCCCGG
CTCTGTGGTG GCTCCGGGCC CAAGCGAGCC AGCGCGTGGG TCACCGACAG GGCTCTTCGC
CACCTCCTCA CCGACATCAC CGGCAACACC CACCGCGCCG AGTTCTGCAT CGACAAGCTC
TACAGCCCCG ACAGCGCCAG GGGGCGGCTC GGCCTGTTGG AGCTGCGGGG CTTCGAGATG
CCGCCACACT TCCAGATGGC GATGGTGCAG TCGCTGCTCG TGCGCGCCCT GGTGGCATGG
TTCTGGGACG AGCCGCTTCG TGCGCCGCTG ATCCGCCACG GGGCCAATCT GCACGGCCGA
TACCTGTTGC CGCACTTCCT CATCCACGAC ATCGCCGATG TCGCGGCCGA CCTGCGCGCC
CACGACATCC ACTTCGACAC CAGCTGGCTG GACCCGTTCA CCGAGTTCCG CTTCCCCCGC
ATCGGCGCTG CCGTGTTCGA CGGTGTGGAG ATCGAACTGC GCGGCGCCAT CGAGCCGTGG
AACGTCCTCG GTGAACAGGC CACCGCGGGT GGCACGGCCC GGTACGTCGA CTCGTCGGTC
GAACGGTTGC AGGTGCGGTT GATCGGCGCC GACCGGCACC GCTACCTCGT CACCTGCAAC
GGCCACCCCA TCCCGATGCT GGCCACCGAC AACCCCGACA TCCAGGTCGG CGGGGTGCGG
TACCGCGCCT GGCAGCCGCC CAGTGCACTT CACCCGACGA TCACCGTCGA CGGGCCGCTG
CGCTTCGAAC TCGTCGACAC CGCCGGTGGC GTGTCCCGCG GTGGTTGCAC CTACCACGTC
GCCCATCCCG GCGGCCGGTC CTACGACACC CCGCCCGTCA ACGCGGTCGA AGCCGAATCG
CGGCGCGGAC GCCGCTTCGA GGCCACCGGA TTCACCCCCG GACGCATCGA CCTGTCCGAC
CTCCGGGAGA AGCAGGCCAG GCAGTCCACC GACCTGGGCG CGCCGGGCAT CCTCGATCTG
CGTCGGGTGC GTACCGTTCT GCGGTGA
 
Protein sequence
MGIKVALEHR TSYTFDRLVE VHPHVIRLRP APHSRTPIEA YSLTIEPDDH FVNWQQDAFG 
NFLARVVFPT RTRQLTITVG LIADLKVINP FDFFIEEYAE RIGFAYPKAL AEDLKPYLKP
VDEFGDGSGP GDLVQAWVKN FTVAPGTSTI EFLVALNRAI NADVGYSVRL EPGVHTPDHT
LRVGIGSCRD SAWLLVSILR QLGLAARFVS GYLVQLTSDV QALDGPSGPA ADFTDLHAWT
EVYIPGAGWI GLDPTSGLFA GEGHIPLSAT PHPDSAAPIT GATEPCESTL DFANLVTRVH
EDPRVTLPYT ETAWDAVCAL GARVDDRLTK GDVRLTVGGE PTFVSVDNRT DPEWTTAADG
PHKRERASAL AAGLKTVWAP QGLVQRGQGK WYPGEPLPRW QIGLYWRADG APLWTDASLL
ADPWGAETHT VDDDAARRLL AAVAAGLGLP GTQVRPAFED PLSRLAAAAR RPAGEPVAPD
DDLADDTARA RADLLARLDT SVTEPAAFVL PLHRRDDDSA WASADWRLRR GRIVLLDGDS
PAGLRLPLDA ISWQPPRPGY PADPLARRGA LASVADAAEQ TDAEVEDADS LPTTAMVTEV
RDRVLYVFLP PTEELEHFVD LVGRIEAAAS AIGCPVVIEG YPPPADPRLT SVTITPDPGV
IEVNVAPTAS FAEQRTQLET LYEQARLARL TTEQFDVDGT HGGTGGGNHI TLGGPTPADS
PLLRRPDLLV SMLTYWQRHP ALSYLFAGRF IGTTSQAPRV DEGRSEALYE LEIAFAEIAR
LCGGSGPKRA SAWVTDRALR HLLTDITGNT HRAEFCIDKL YSPDSARGRL GLLELRGFEM
PPHFQMAMVQ SLLVRALVAW FWDEPLRAPL IRHGANLHGR YLLPHFLIHD IADVAADLRA
HDIHFDTSWL DPFTEFRFPR IGAAVFDGVE IELRGAIEPW NVLGEQATAG GTARYVDSSV
ERLQVRLIGA DRHRYLVTCN GHPIPMLATD NPDIQVGGVR YRAWQPPSAL HPTITVDGPL
RFELVDTAGG VSRGGCTYHV AHPGGRSYDT PPVNAVEAES RRGRRFEATG FTPGRIDLSD
LREKQARQST DLGAPGILDL RRVRTVLR