Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_2367 |
Symbol | |
ID | 4613190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 2485101 |
End bp | 2488427 |
Gene Length | 3327 bp |
Protein Length | 1108 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639792036 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_938355 |
Protein GI | 119868403 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.021822 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTATCA AGGTGGCGCT GGAGCATCGC ACCAGCTACA CGTTCGACCG GCTGGTGGAA GTGCACCCGC ACGTCATCCG GCTGCGTCCG GCGCCGCACT CCCGAACGCC CATCGAGGCG TACTCGCTGA CCATCGAGCC CGATGACCAC TTCGTCAACT GGCAGCAGGA CGCCTTCGGC AATTTCCTTG CCCGCGTAGT GTTTCCGACG CGCACCCGGC AGCTCACGAT CACCGTCGGC CTGATCGCCG ACCTGAAGGT GATCAACCCG TTCGACTTCT TCATCGAGGA GTACGCCGAG CGGATCGGAT TCGCCTACCC CAAGGCGCTG GCCGAGGATC TCAAACCGTA TTTGAAACCG GTCGACGAGT TCGGCGACGG ATCCGGGCCC GGAGACCTCG TGCAGGCGTG GGTGAAGAAC TTCACCGTCG CGCCCGGCAC CAGCACCATC GAGTTCCTGG TCGCCCTCAA CCGCGCGATC AACGCCGACG TCGGCTACAG CGTCCGCCTC GAACCCGGGG TGCACACCCC CGATCACACG CTGCGCGTCG GCATCGGCTC GTGCCGCGAC TCGGCCTGGC TGTTGGTGTC GATCCTGCGG CAACTCGGGC TGGCGGCCCG GTTCGTGTCC GGCTACCTCG TCCAGCTCAC CTCCGATGTG CAAGCGCTCG ACGGCCCGTC CGGACCGGCC GCCGACTTCA CCGATCTGCA CGCGTGGACG GAGGTCTACA TCCCCGGCGC CGGGTGGATC GGCCTGGACC CCACCTCGGG GCTCTTCGCC GGTGAGGGAC ACATCCCGCT CTCGGCGACC CCGCACCCCG ACTCGGCCGC ACCGATCACC GGTGCCACCG AACCGTGTGA GAGCACACTG GACTTCGCCA ACCTCGTCAC CCGTGTCCAC GAGGATCCGC GCGTCACGCT GCCCTACACC GAGACCGCGT GGGACGCCGT CTGCGCGCTC GGCGCCCGGG TCGACGACCG GCTGACCAAG GGCGATGTCC GTCTCACCGT CGGCGGGGAG CCGACATTCG TGTCGGTCGA CAACCGCACC GACCCCGAAT GGACCACGGC GGCCGACGGT CCACACAAGC GGGAGCGGGC CTCGGCGCTG GCCGCCGGGC TCAAGACGGT GTGGGCGCCG CAGGGGTTGG TGCAGCGCGG CCAGGGCAAG TGGTATCCCG GAGAACCGTT GCCGCGCTGG CAGATCGGCC TGTACTGGCG CGCCGACGGG GCACCGCTGT GGACCGACGC GTCGCTGCTG GCCGATCCGT GGGGCGCCGA GACGCACACC GTGGACGACG ACGCCGCGCG GCGACTGCTC GCCGCCGTCG CGGCCGGCCT CGGGCTCCCC GGCACACAGG TCCGGCCGGC GTTCGAGGAT CCGCTGAGCC GGTTGGCTGC CGCGGCCCGC CGCCCCGCCG GGGAGCCCGT CGCCCCCGAC GACGATCTCG CCGACGACAC CGCCCGGGCC CGCGCGGACC TGCTGGCCCG GCTCGACACC AGCGTCACCG AACCGGCCGC GTTCGTGCTG CCGCTGCACC GCCGCGACGA CGACTCGGCA TGGGCCAGCG CCGACTGGCG GCTGCGCCGG GGCCGCATCG TGCTCCTGGA CGGCGATTCA CCCGCCGGGC TGCGGCTGCC GCTCGACGCG ATCAGCTGGC AGCCACCGCG ACCCGGCTAT CCCGCCGACC CGCTGGCCCG GCGCGGTGCG CTCGCCTCCG TTGCGGACGC TGCCGAGCAG ACCGACGCCG AGGTCGAGGA TGCCGACTCG CTGCCGACGA CGGCGATGGT CACCGAGGTG CGAGACCGGG TGCTCTACGT ATTCCTGCCG CCCACAGAGG AACTCGAACA TTTCGTCGAC CTCGTCGGCC GGATCGAGGC CGCCGCGTCG GCGATCGGCT GCCCGGTGGT CATCGAGGGC TACCCGCCGC CCGCCGACCC GCGCCTGACG TCGGTGACGA TCACACCCGA CCCGGGGGTC ATCGAGGTCA ACGTGGCCCC GACCGCCAGC TTCGCCGAAC AGCGGACCCA GCTCGAGACG CTCTACGAAC AGGCGCGGCT GGCCCGCCTG ACCACCGAAC AGTTCGACGT CGACGGCACC CACGGCGGCA CCGGCGGGGG CAACCACATC ACGCTCGGCG GTCCGACGCC CGCGGACTCA CCGCTGTTGC GCCGCCCCGA CCTGCTGGTG TCGATGCTCA CTTACTGGCA GCGCCACCCC GCGCTGTCCT ATCTGTTCGC CGGCCGGTTC ATCGGCACCA CGTCGCAGGC GCCGCGGGTC GACGAAGGCC GGTCCGAGGC GCTCTACGAA CTCGAGATCG CGTTCGCCGA GATCGCCCGG CTCTGTGGTG GCTCCGGGCC CAAGCGAGCC AGCGCGTGGG TCACCGACAG GGCTCTTCGC CACCTCCTCA CCGACATCAC CGGCAACACC CACCGCGCCG AGTTCTGCAT CGACAAGCTC TACAGCCCCG ACAGCGCCAG GGGGCGGCTC GGCCTGTTGG AGCTGCGGGG CTTCGAGATG CCGCCACACT TCCAGATGGC GATGGTGCAG TCGCTGCTCG TGCGCGCCCT GGTGGCATGG TTCTGGGACG AGCCGCTTCG TGCGCCGCTG ATCCGCCACG GGGCCAATCT GCACGGCCGA TACCTGTTGC CGCACTTCCT CATCCACGAC ATCGCCGATG TCGCGGCCGA CCTGCGCGCC CACGACATCC ACTTCGACAC CAGCTGGCTG GACCCGTTCA CCGAGTTCCG CTTCCCCCGC ATCGGCGCTG CCGTGTTCGA CGGTGTGGAG ATCGAACTGC GCGGCGCCAT CGAGCCGTGG AACGTCCTCG GTGAACAGGC CACCGCGGGT GGCACGGCCC GGTACGTCGA CTCGTCGGTC GAACGGTTGC AGGTGCGGTT GATCGGCGCC GACCGGCACC GCTACCTCGT CACCTGCAAC GGCCACCCCA TCCCGATGCT GGCCACCGAC AACCCCGACA TCCAGGTCGG CGGGGTGCGG TACCGCGCCT GGCAGCCGCC CAGTGCACTT CACCCGACGA TCACCGTCGA CGGGCCGCTG CGCTTCGAAC TCGTCGACAC CGCCGGTGGC GTGTCCCGCG GTGGTTGCAC CTACCACGTC GCCCATCCCG GCGGCCGGTC CTACGACACC CCGCCCGTCA ACGCGGTCGA AGCCGAATCG CGGCGCGGAC GCCGCTTCGA GGCCACCGGA TTCACCCCCG GACGCATCGA CCTGTCCGAC CTCCGGGAGA AGCAGGCCAG GCAGTCCACC GACCTGGGCG CGCCGGGCAT CCTCGATCTG CGTCGGGTGC GTACCGTTCT GCGGTGA
|
Protein sequence | MGIKVALEHR TSYTFDRLVE VHPHVIRLRP APHSRTPIEA YSLTIEPDDH FVNWQQDAFG NFLARVVFPT RTRQLTITVG LIADLKVINP FDFFIEEYAE RIGFAYPKAL AEDLKPYLKP VDEFGDGSGP GDLVQAWVKN FTVAPGTSTI EFLVALNRAI NADVGYSVRL EPGVHTPDHT LRVGIGSCRD SAWLLVSILR QLGLAARFVS GYLVQLTSDV QALDGPSGPA ADFTDLHAWT EVYIPGAGWI GLDPTSGLFA GEGHIPLSAT PHPDSAAPIT GATEPCESTL DFANLVTRVH EDPRVTLPYT ETAWDAVCAL GARVDDRLTK GDVRLTVGGE PTFVSVDNRT DPEWTTAADG PHKRERASAL AAGLKTVWAP QGLVQRGQGK WYPGEPLPRW QIGLYWRADG APLWTDASLL ADPWGAETHT VDDDAARRLL AAVAAGLGLP GTQVRPAFED PLSRLAAAAR RPAGEPVAPD DDLADDTARA RADLLARLDT SVTEPAAFVL PLHRRDDDSA WASADWRLRR GRIVLLDGDS PAGLRLPLDA ISWQPPRPGY PADPLARRGA LASVADAAEQ TDAEVEDADS LPTTAMVTEV RDRVLYVFLP PTEELEHFVD LVGRIEAAAS AIGCPVVIEG YPPPADPRLT SVTITPDPGV IEVNVAPTAS FAEQRTQLET LYEQARLARL TTEQFDVDGT HGGTGGGNHI TLGGPTPADS PLLRRPDLLV SMLTYWQRHP ALSYLFAGRF IGTTSQAPRV DEGRSEALYE LEIAFAEIAR LCGGSGPKRA SAWVTDRALR HLLTDITGNT HRAEFCIDKL YSPDSARGRL GLLELRGFEM PPHFQMAMVQ SLLVRALVAW FWDEPLRAPL IRHGANLHGR YLLPHFLIHD IADVAADLRA HDIHFDTSWL DPFTEFRFPR IGAAVFDGVE IELRGAIEPW NVLGEQATAG GTARYVDSSV ERLQVRLIGA DRHRYLVTCN GHPIPMLATD NPDIQVGGVR YRAWQPPSAL HPTITVDGPL RFELVDTAGG VSRGGCTYHV AHPGGRSYDT PPVNAVEAES RRGRRFEATG FTPGRIDLSD LREKQARQST DLGAPGILDL RRVRTVLR
|
| |