Gene Mkms_5475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5475 
Symbol 
ID4613159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5712215 
End bp5713555 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content70% 
IMG OID639795169 
Producthypothetical protein 
Protein accessionYP_941450 
Protein GI119871498 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3408] Glycogen debranching enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.509766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.162072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACACG ACCCCAGCTT CGCGCCCACG CAACTCGCGG CCCGCGCCGC GTACCTCCTG 
CGCGGTAACG ACCTGGGCGC GATGACCACC GCGGCGCCCC TGCTGTACCC GCACATGTGG
AGCTGGGATG CCGCGTTCGT GGCGATCGGG CTGGCGCCGT TGAGCGTCGA GCGTGCGGTC
GTCGAACTCG ACACGTTGCT CTCCGCACAG TGGCGCAACG GGATGATCCC GCACATCGTG
TTCGCCAACG GGGTCGACGG ATACTTCCCG GGCCCCGCCC GCTGGGCGAC GTCCGCCCTG
GCGGCCGACT CCCCGCGCAC CCGCCACACC TCCGGGATCA CCCAGCCGCC CGTGCACGCG
ATCGCCGTAC AGCGCATCCT CGACCACGCC CGCACCAGGG GCCGGTCGAC CCGCCAGGTG
GCCGAGGCCT TCCTCGACCG GCGCTGGGGG GATCTGGTGC GCTGGCACCG CTGGCTGGCC
GAATGCCGCG ACCAGAACGG CCGCGGCCGC ATCACGCTCT ATCACGGGTG GGAGTCCGGC
ATGGACAACT CCCCGCGGTG GGATGCCGCC TACGCCAACG TGATTCCGGG TGCGGTGCCG
GAATATCAGC GCGAGGACAA CAAGATCAAC ACCGACGCCA CCCAGCGGCC GTCCGATCAC
GAGTACGACC GCTACCTGTG GTTGCTCGAG GAGATGAAAT CCGCCCGCTA CGACGATCAT
CTGCTGCCGA AGGTGATGAG TTTCGCCGTC GAGGACGTGT TCGTCTCGGC GATCTTCTCG
GTGGCCTGTC AGGTGCTCGC CGAGATCGGG GAGGACTACA AACGCCCCAA CGCCGACGTG
CGTGACCTGT ACGCGTGGGC CGAGCGGTTC CGCGCCGGCG TCATCGAGAC CACCGACCAA
CGCACCGGCG CGGCAAGGGA TTTCGACGTC CGCACGGAGA AGTGGGTGGC CACCGAGACC
GTCGCGCAGT TCGCCCCGCT GTTGTGCGGC GGCCTGCCGC ACCACCGGGA GCGGGCGCTG
CTGCGCCTGC TGGAGGGGCC GCGGTTCTGC GGGCATCCCG ACCTCAGATA CGCGTGCATC
CCCTCGACGT CGCCGGTGTC ACGCGACTTC CGGCCGCGGG AGTACTGGCG CGGCCCGGTC
TGGCCGGTGA TGACGTGGCT GTTCGCCTGG TGCTTCGCCC GGCGCGGGTG GGCCGAACGG
GCCAGGGTGC TACGGCACGA GGGACTGCGC CAGGCCAGCG ACGGCACCTT CGCCGAGTAC
TACGAACCGT TCACCGGCGA ACCGTTGGGC AGCATGCAGC AGTCGTGGAC CGCCGCGGCG
GTACTGGACT GGCTGGGCTA G
 
Protein sequence
MPHDPSFAPT QLAARAAYLL RGNDLGAMTT AAPLLYPHMW SWDAAFVAIG LAPLSVERAV 
VELDTLLSAQ WRNGMIPHIV FANGVDGYFP GPARWATSAL AADSPRTRHT SGITQPPVHA
IAVQRILDHA RTRGRSTRQV AEAFLDRRWG DLVRWHRWLA ECRDQNGRGR ITLYHGWESG
MDNSPRWDAA YANVIPGAVP EYQREDNKIN TDATQRPSDH EYDRYLWLLE EMKSARYDDH
LLPKVMSFAV EDVFVSAIFS VACQVLAEIG EDYKRPNADV RDLYAWAERF RAGVIETTDQ
RTGAARDFDV RTEKWVATET VAQFAPLLCG GLPHHRERAL LRLLEGPRFC GHPDLRYACI
PSTSPVSRDF RPREYWRGPV WPVMTWLFAW CFARRGWAER ARVLRHEGLR QASDGTFAEY
YEPFTGEPLG SMQQSWTAAA VLDWLG