Gene Mkms_3547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3547 
Symbol 
ID4611477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3739755 
End bp3741788 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content69% 
IMG OID639793223 
Productglycoside hydrolase 15-related 
Protein accessionYP_939531 
Protein GI119869579 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.269998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCGG GCATGGTTCT GGAACACACC GAGCCCACCG ACGGAGCCGC GACGATCGGG 
CAGCCGGCCT ATCTGCCCGA CACTCCGTTG ACGGTGACGG CGCCGGTCCC CTACGCGCCG
ACCGGTGGGC TGCGGAACCC GTTCCCGCCC ATCGCCGACT ACGGCTTCCT GTCCGACTGC
GAGAACACGT GCCTGATCTC CTCGGCCGGC TCCGTCGAGT GGCTGTGCGT GCCCCGCCCG
GATTCGCCGA GCGTGTTCGG CGCGATCCTC GACCGCGGTG CAGGTCACTT CCGGCTCGGC
CCGTACGGCG TGACGGTGCC CGCGGCGCGG CGTTACCTGC CGGGCAGCCT GATCCTCGAG
ACCACGTGGC AGACCCACAC CGGCTGGCTG ATCGTGCGCG ACACCCTGGT GATGGGTCCC
TGGCACGACC TCGAGGCGCG GTCGCGCACC CACCGGCGCA CGCCGATGGA CTGGGATGCC
GAGCACATCC TGTTGCGCAC CGTGCGGTGT GTCAGCGGCA CCGTCGAGCT GGTGATGAAC
TGTGAGCCGT CGTTCGACTA CCACCGGGTG AGCGCCGAGT GGGAGTACTC CGGCCCGGCC
TACGGTGAGG CGATCGCGCG GGCCAACCGC AACGCCGACT CCCATCCGAC ACTGCGGCTC
ACCACGAACC TGCGGATCGG GTTGGAGGGC CGCGAGGCCA GGGCCCGCAC CCGGCTCAAG
GAGGGCGACA ACGTCTTCGT GGCGCTGTCC TGGTCGAAGC ATCCGGCGCC GCAGAACTAC
GAAGAGGCCG CCGACAAGAT GTGGCAGACC AGCGAGGCGT GGCGGCAGTG GATCAACGTC
GGCGACTTCC CCGACCACCC GTGGCGGGCG TACCTGCAGC GCAGCGCGCT CACCCTCAAG
GGCCTGACCT ACTCCCCGAC CGGCGCGCTG TTGGCCGCGA GCACCACGTC GTTGCCGGAA
ACACCTCAGG GCGAACGCAA TTGGGACTAC CGCTACGCGT GGGTGCGGGA TTCGACGTTC
GCGCTGTGGG GTCTCTACAC ACTGGGCCTG GACCGCGAGG CCGACGACTT CTTCGCGTTC
ATCGCCGACG TGTCCGGCGC CAACAACGGG GAGCGCCACC CGCTGCAGGT GATGTACGGC
GTCGGGGGTG AGCGCAGCCT GGTCGAGGAG GAACTGCACC ACCTGTCGGG GTACGACGGC
GCCCGCCCGG TGCGGATCGG CAACGGTGCC TACAACCAGA TGCAGCACGA CATCTGGGGC
ACCATGCTCG ATTCGGTCTA CCTGCACACC AAGTCGCGTG AGCAGATCCC CGAGGCGTTG
TGGCCGGTGC TCAAGCACCA GGTCGAGGAG GCCATCAAGC ACTGGAAGGA ACCCGACCGC
GGCATCTGGG AGGTCCGCGG CGAACCGCAG CACTTCACCA GTTCGAAGGT GATGTGCTGG
GTGGCGCTCG ACCGTGGCGC GAAGCTCGCC GAACTCGAGG GCGAGAAGAG CTACGCCCAG
GAGTGGCGCA CCATCGCCGA GCAGATCAAG GCCGACATCC TCGCCAACGG CGTCGACTCG
CGGGGCGTGT TCACCCAGCG TTACGGCGAC GACGCGCTGG ACGCCTCCCT GCTGCTGGTG
CCGCTGGTCC GGTTCCTGCC GCCGGACGAC CCGCGGGTGC GGGCCACGGT GCTGGCGATC
GCCGACGAGC TGACCGAGGA GGGTCTGGTC CTGCGCTACC GCGTCGAGGA GACCGACGAC
GGGTTGGCCG GCGAGGAGGG CACGTTCACG ATCTGCTCGT TCTGGCTGGT GTCGGCGCTC
GTGGAGATCG GTGAGATCAG CCGTGCCAAG CACCTGTGTG AACGGTTGTT GTCGTTCTCC
AGTCCGCTGC ACCTCTACGC CGAGGAAATC GAACCCCGCA CCGGCCGCCA CCTGGGCAAC
TTCCCGCAGG CGTTCACCCA CCTGGCGTTG ATCAACGCGG TCGTGCACGT CATCCGCGCC
GAGGAGGAAG CCGACAGCTC GGGGGTCTTC GTCCCGGCCA ACGCGCCGTC GTAA
 
Protein sequence
MMAGMVLEHT EPTDGAATIG QPAYLPDTPL TVTAPVPYAP TGGLRNPFPP IADYGFLSDC 
ENTCLISSAG SVEWLCVPRP DSPSVFGAIL DRGAGHFRLG PYGVTVPAAR RYLPGSLILE
TTWQTHTGWL IVRDTLVMGP WHDLEARSRT HRRTPMDWDA EHILLRTVRC VSGTVELVMN
CEPSFDYHRV SAEWEYSGPA YGEAIARANR NADSHPTLRL TTNLRIGLEG REARARTRLK
EGDNVFVALS WSKHPAPQNY EEAADKMWQT SEAWRQWINV GDFPDHPWRA YLQRSALTLK
GLTYSPTGAL LAASTTSLPE TPQGERNWDY RYAWVRDSTF ALWGLYTLGL DREADDFFAF
IADVSGANNG ERHPLQVMYG VGGERSLVEE ELHHLSGYDG ARPVRIGNGA YNQMQHDIWG
TMLDSVYLHT KSREQIPEAL WPVLKHQVEE AIKHWKEPDR GIWEVRGEPQ HFTSSKVMCW
VALDRGAKLA ELEGEKSYAQ EWRTIAEQIK ADILANGVDS RGVFTQRYGD DALDASLLLV
PLVRFLPPDD PRVRATVLAI ADELTEEGLV LRYRVEETDD GLAGEEGTFT ICSFWLVSAL
VEIGEISRAK HLCERLLSFS SPLHLYAEEI EPRTGRHLGN FPQAFTHLAL INAVVHVIRA
EEEADSSGVF VPANAPS