Gene Mkms_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1116 
Symbol 
ID4614494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1201726 
End bp1203003 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content68% 
IMG OID639790792 
Producthypothetical protein 
Protein accessionYP_937119 
Protein GI119867167 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACT ACTGGCTGAA CGTGGCCCTG GTATTCGCGC TCATACTGGT CAACGGGCTG 
CTGGCGGGAA GCGAAGCAGC GTTCATCTCC CTGAGAGAGG GTCAGCTGCG CGAGCTGGAA
CATCGCGGCG GCCGACGGGA TCTCACCGTC GTCGGGTTGG CCCGAGAGCC GAACCGCTAC
CTCGCCACCA TCCAACTGGG CATCACCCTG GCCGGATTCT TCGCCTCGGC CACCGCGGCG
GTCACCCTGG CGGAGCCGCT GGCCCCGCTG CTGGGCTTCC TGGGCGCCGG TGCACAGACG
GCGGTCAGCA TCGCGGTGGT GACGGTGCTG GTGGCCGGTG TGACCCTCGT GTTCGGGGAG
CTCGCGCCCA AGAGGCTGGC GATGCAGTAC GCCCGGCGGT GGGCACTCGT CGTGGCCTCA
CCGTTGAGTG CCATGTCGGC CGTCGCCGCA CCGATCGCGT GGGTTCTCGG CAGGGCCACC
GACCTCGTCG TGCGGATTCT CGGGGGAGAT CCCGCCGTCG GGCAGGAAGA GCTCACCATC
GAGGAGTTCG GGCAACTGAT CACCGGTCTC GGCGGCCTGA CCGCCGAACA ACGCACGATC
CTGTCCGGTG CGCTGGAGAT CCACGAGCGT TCACTGCGCG AAGTCATCGT CCCCCGGACG
GCGGTCTTCC GGCTGAACGG TGAGCTGTCG CTGCAGCGGG CTCGCACGGA CCTCGCGGCG
TCCGGCCACA CCAGGGCGCC GGTCGTGCGA TCCGGAGAAC TGGACGACGC CATCGGTGTG
GTGCACCTGC GCGACCTGCT GGGTGACGAC GGCACCGTCG CCGAAGTCAC CCGACCGGTG
CTCAGACTGC CGGACAGTCT GCGCGTCACC ATCGCGCTGC GCCAACTGCT CGCCGCGCAC
GAGCATCTGG CGCTCGTCGT CGGCGAGCAC GGCGGCGTCG ACGGCATCGT CACCCTCGAG
GATCTGCTCG AGGAGATCGT CGGCGAGATC TACGACGAGG CCGACGAGGA CATCCGAACC
GCCGAAGCAC TCCCGGACGG CAGTCGAATT CTGCCGGGCA CCTTCCCGAT TCACGATCTG
CCCGACATCG GGATCGAGTT CTCCGACGCA CCTCCCGGCG ACTACACCAC GATCGCCGGA
CTCGTGCTGT CCCTGCTGGG GCGGATTCCG ACGGTTCCCG GAGATCGCGT CGACCTTCCG
CCTTGCCGTG TCCAGGTCAC AGGCGTCGGC CGCCATGCGA TCACCGAGGT GCGCATTCTG
CCTCGAGATC GGCGATGA
 
Protein sequence
MSDYWLNVAL VFALILVNGL LAGSEAAFIS LREGQLRELE HRGGRRDLTV VGLAREPNRY 
LATIQLGITL AGFFASATAA VTLAEPLAPL LGFLGAGAQT AVSIAVVTVL VAGVTLVFGE
LAPKRLAMQY ARRWALVVAS PLSAMSAVAA PIAWVLGRAT DLVVRILGGD PAVGQEELTI
EEFGQLITGL GGLTAEQRTI LSGALEIHER SLREVIVPRT AVFRLNGELS LQRARTDLAA
SGHTRAPVVR SGELDDAIGV VHLRDLLGDD GTVAEVTRPV LRLPDSLRVT IALRQLLAAH
EHLALVVGEH GGVDGIVTLE DLLEEIVGEI YDEADEDIRT AEALPDGSRI LPGTFPIHDL
PDIGIEFSDA PPGDYTTIAG LVLSLLGRIP TVPGDRVDLP PCRVQVTGVG RHAITEVRIL
PRDRR