Gene Mkms_2356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2356 
Symbol 
ID4613358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2469762 
End bp2472680 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content69% 
IMG OID639792025 
Producthypothetical protein 
Protein accessionYP_938344 
Protein GI119868392 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.416517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAAC CCATCCCCAC ACCGGCCTCC TGCCGACGGC GCCGCACGGT GGCGGTGCTC 
GGCGGTGGCA TCGCGGGCCT GACCGCAGCC CATGAACTCG CCGACCGCGG ATTCGACGTC
ACCGTGTACG AACCGCGCCA TGACGAACGT ATCGGGTTGG GCCCCGAACC GCCGTCCTGC
TACCCACCGG TGAAATTGGG CGGGCTGGCC GCCTCGCAGT ACTCGACGGT GGGCACTCAC
GACGGCAGCA ACGCGGAACT GAGACCGTTT CCCGGCCGGC GGGGGGCGCC GCGCAAACCC
GGTCGGGCCG TCGCCGGTGA ACATGGGTTC CGCTTCTTCC CGGCGTATTA CCTGCACATC
TGGGACCTGT TCCAGCGCAT CCCGGTCTAT GAACGAATCG AACTGCCCGG CAGCGACATC
CGCTTCCTGC CGACCTCGCG CACCGTGCTC GACAACGTCC GCCGGGTCGT CACGCAGGGC
ACCACCGTGG AAGGCAAACC CTCGCTGGTG TTTCCGCGCG AAGCACCCCG CAGCCTCGCC
GAATTCCTGG GCACGGTGAA CCAACTGAGG GAGCTGGGCT TCACCCAGGC CGACGTGAGC
ACGTTCGTGA GCAGGTTGCT GCGGTACCTG GTGACCAGCC CGCACCGGCG AGCCAGGGAA
CTGCAGAACC TCTCGGCCTA CGACTTCTTC GTCGGCCGCG ACAGCGCGAC CGGGGTCCCA
CGGTTCTCCT ACACCCCGCA GTTCGACACC GTGCTGCGCG AGATGCCCAA GGTGCTCGCG
GCTTTCGACT CGAACTGGGG CGACGCCCGC ACCAACCTCA CCACCTACCT GCAACTGCAG
TTGCAGATGG ATCGCCGCGA CAACAAGGCC GACGGGGTGC TCAACGGCCC CACCACCGAG
TCGTGGTTCG ACCACTGGTA CCGCCACCTG ACCGCTCTCG GTGTGCGTTT CGTCCGCGCG
GTGGCCAACC GGATCGACGC ACTGCCCGTC GATCCCAGCC TGCCACCGCA CCGCCGAGCG
CGGGTCCAGA TCACACTGGC CGACGGGACC CGACTGACGC CCGACTACGC CGTCGTCGCC
GTCGACGCCC CGGAGGCCGA GCGGGTCACC GCTCCCCTGC GCACCGCGGC CTGTGGTGGA
ACCGTCTCGG AACTGGAGGG TTTCACCACC TCCGTCCCAC CGGCGACCGG GCCCCTCGAG
CCCGCCGCGA CCCGCACCGC GGCCCGGCGC AATCCCTATG CACTGGCCGA GATGGGCCGG
GTGCCCTGGG ACCGGTTCCA AACCCTCGGC GGCATCCAGT ACTTCTTCGA CACCGAATTC
CAACTGCTCC GCGGGCACAT GTACTACTCG GGAACCGAAT GGGCGCTGTC GTCGATCAAC
CAGCACGGCA TGTGGGAGCG GCGACCGATG CTGGCCCAAG ACGGACATGT CTCCGTATTG
TCCGTCGACA TCGGCGATTT CAACGCCCCG TCGCGGCGCC TCGTCGACGC CGACGGTCAC
GGCAAGGCCG CCCGCGACTG CACCGCCGAC GAGATCGCCG CCGAGGTGTG GCGCCAGATC
GTCACGGCGC TCACCAACAA CGTCGACAGA CCCCCGGAAT CGCTGCTGCC GACACCGGCC
TGGTACGCGC TGGACCGCGG GTTGATCATG GCGGACGGAC CCGGACAGGG CACGGGCCCG
CCGGTGCTCA ACGAAACCCC TTATCTGGTA CCGATCATCG GGGACTGGCC GAACCGGCCC
GGTGGCGATC CGTGGAATCC GCACGGCACG TCCTACGTCG GTGTGCCGAC CGAGGAGGCC
TGGCGCGAAG ACCTCGAACT GCGCAACGTC TGGCAGGCCC GCCACGGCGG CTATCAGGTG
CACAACAATT CGGTCGTGTT CGCCGGCACC TGGGCCAAGA CGTTCACCCG CATGACGTCG
ATGGAAGCGG CGTGTGAATC CGGACGGCAT GCGGTCAACG CGATCCTCGA CCACTACATC
TGGGTCGAAT CGGGCGGTCG TGACCGGCGC GAGAAGACGA CGCTCAAGTG GCGGTTCCCC
TACGGTTTCC TCGACCAGGG CCAGTCGACC CCGATCCGGA TGCCGACCCC GGCCGGTGAC
TACTGCTACG TCTTCGACAT CGAGAACCGA GAGCCGTCCG ACACGCGCGC CCTGCGGGTC
CTCGATTCAC GGTTCTCCGA GCGGTCGCTG CCCCATCCCT TGGACACGCT TGTCCCACCC
ACAGGAGGAA TTCCGATGAC CATGCCGCCG TTCGGGCCGT TCGACGCGAA CCAGCAACTA
CTCGCCTTCC TGCAGGCGTG GCGACAACTC CTCGAGCAGT GGACGGCCCT GCTGTCCGGG
GCGGGTACCC CGTTCCCGAT GCCGGCCCCG CCGGGGACCG CGAACGCCCC TGCGCCCGCG
CCGGCCGACT ACTCCCAGCA GTTGTTCGGC CAGCTCCAGG CGTGGCGTCG CTACCTGGAG
CACGCCGCGG GTGCGACACC GCAGACCGCT TCACAGGCCC CGGCACAGCA GTCGGGTGGC
GGCCAGGAGT CGGCCGCGTC GGGGTCGTCG TCCGGACCCA CGGAACCGCC GGAGTACTTT
CCGCAGCTCG ACGACTACTG GGGTTCGGCG GATCCCGCCA CCCAGTGGGG CAAACGGCAG
CAGAAGGTCA TCCGGCCGCC CGACGACGAC TGGGGCACGG TGGGTATCCC GTTTCGCGAT
CTGGCCGGTG GACTGGACGT CGGTCGGCCG CCGCAGGGCC CCACGCCCCC GCCCGACCGG
TGGAGCGGTC AGACCGGCCC GCTCGAGTCA CCCGCCGCGC GTACCGTGCG CTCGGCGTTC
CGCGACGTCA CCACGGGAGC GGATCCCGCC GCTGCGCGGC AGGTCGCCCC GAAGTCGCTC
TATCGCGACG TCATGGGCGA AATCCGCGGC GGGGACTGA
 
Protein sequence
MTEPIPTPAS CRRRRTVAVL GGGIAGLTAA HELADRGFDV TVYEPRHDER IGLGPEPPSC 
YPPVKLGGLA ASQYSTVGTH DGSNAELRPF PGRRGAPRKP GRAVAGEHGF RFFPAYYLHI
WDLFQRIPVY ERIELPGSDI RFLPTSRTVL DNVRRVVTQG TTVEGKPSLV FPREAPRSLA
EFLGTVNQLR ELGFTQADVS TFVSRLLRYL VTSPHRRARE LQNLSAYDFF VGRDSATGVP
RFSYTPQFDT VLREMPKVLA AFDSNWGDAR TNLTTYLQLQ LQMDRRDNKA DGVLNGPTTE
SWFDHWYRHL TALGVRFVRA VANRIDALPV DPSLPPHRRA RVQITLADGT RLTPDYAVVA
VDAPEAERVT APLRTAACGG TVSELEGFTT SVPPATGPLE PAATRTAARR NPYALAEMGR
VPWDRFQTLG GIQYFFDTEF QLLRGHMYYS GTEWALSSIN QHGMWERRPM LAQDGHVSVL
SVDIGDFNAP SRRLVDADGH GKAARDCTAD EIAAEVWRQI VTALTNNVDR PPESLLPTPA
WYALDRGLIM ADGPGQGTGP PVLNETPYLV PIIGDWPNRP GGDPWNPHGT SYVGVPTEEA
WREDLELRNV WQARHGGYQV HNNSVVFAGT WAKTFTRMTS MEAACESGRH AVNAILDHYI
WVESGGRDRR EKTTLKWRFP YGFLDQGQST PIRMPTPAGD YCYVFDIENR EPSDTRALRV
LDSRFSERSL PHPLDTLVPP TGGIPMTMPP FGPFDANQQL LAFLQAWRQL LEQWTALLSG
AGTPFPMPAP PGTANAPAPA PADYSQQLFG QLQAWRRYLE HAAGATPQTA SQAPAQQSGG
GQESAASGSS SGPTEPPEYF PQLDDYWGSA DPATQWGKRQ QKVIRPPDDD WGTVGIPFRD
LAGGLDVGRP PQGPTPPPDR WSGQTGPLES PAARTVRSAF RDVTTGADPA AARQVAPKSL
YRDVMGEIRG GD