Gene Mkms_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1941 
Symbol 
ID4613687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2057884 
End bp2059092 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content68% 
IMG OID639791605 
Producthypothetical protein 
Protein accessionYP_937930 
Protein GI119867978 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTCG GCGACGGTCC CGACGACACG GCGCTCGATG CGGCGTGGGC GGCGTTCTGC 
GACCGGTTGA AAGCCGCGGG CGCGCAGGCG TTCAAGGATC ACAACGCCAC CTCGGGCGCA
CAGCGGGTCG ACGCGTTGCG TTTCCTCACC CAGAACCTGG GTCAGGCCTT CGACCTGGCG
CTCGAAACCG CCGACACCCG GTATCCGATC GTGCACGCCT TCTGCACCCC GCTGCGCAAA
CTGGGCGGTG ACAGTGCGGA CTTCACCTAC CACCAGGCCT GGATCGACGG GACACACACC
TACCGCCTCA CCGGGAACCG GGGCGGCGCA CCGTTTTTCA ACATCACCGT GCAGGGTCCG
CGGGGTTCGG GTCCCGGCGT CCTGCACGAG CCATTCGGTG ACGTCCCGGA GGTCAACCTG
TCCGGCTCCC AGCTGGCGAC GGCCGCCGGC GGCGACTTCG AGCTCTACAT CGGTGGACCC
GAGCGCGGAC CGAACTGGCT GCCGACGACA CCGGGTTCGC GAAAACTGTT CATCCGTCAG
GGTTTCGACC GGTGGGACGA CCGGCCGGCC GAACTGCGCA TCGAACGCGT CGACATGGCG
GCCCCGCGGC CACTGCCCAC ACCCGCCGAG ATGGTGGCCG CCATCGATTG GGCCGGTGAC
TTCGTCGAAG GGGTGATGCG CGACTGGCCG GACTACCCGT TCACCTACGG CGGCGTCGAC
GCCGCGCACC CCAACCGGTT TCCCGCCGTC GACTCCGACA CCGGTGACGA CAAGAGGGGC
CGCGCGGCGG CGAACATGTT CTGGGAACTC GGCGCCGACG AAGCGCTGAT CATCGAGTTC
GACGCGCACG AGGGCCTGTG GATGCTCACC AACATGGGCG TGTTCTTCAA CAGCATGGAC
TACCTGTACC GGCCCGTCTC CTACACCCCG AGCCGCACGG TGACCGACGG TGACGGGCGG
ATCCGCATCG TGCTGGCCCA CGACGATCCG GGCTGTCACA ACTGGCTCGA CACCCAGGGA
TTCAGCCGCG GCAACGTCAC CTACCGGCAC ATGCTGGCCG GAAAGCCCGC CGTGCTGCAC
ACCAGGCTGG TGGCCCGGTC CGACCTCGCC GACGCGCTAC CGTCGGACAC CGCCACCGTC
ACCGGCGAGC AACGCGTCGC CCAGATGTGG GCCCGGTTCA ACGGGATCCG ACGACGCCAC
CGGATGTGA
 
Protein sequence
MAFGDGPDDT ALDAAWAAFC DRLKAAGAQA FKDHNATSGA QRVDALRFLT QNLGQAFDLA 
LETADTRYPI VHAFCTPLRK LGGDSADFTY HQAWIDGTHT YRLTGNRGGA PFFNITVQGP
RGSGPGVLHE PFGDVPEVNL SGSQLATAAG GDFELYIGGP ERGPNWLPTT PGSRKLFIRQ
GFDRWDDRPA ELRIERVDMA APRPLPTPAE MVAAIDWAGD FVEGVMRDWP DYPFTYGGVD
AAHPNRFPAV DSDTGDDKRG RAAANMFWEL GADEALIIEF DAHEGLWMLT NMGVFFNSMD
YLYRPVSYTP SRTVTDGDGR IRIVLAHDDP GCHNWLDTQG FSRGNVTYRH MLAGKPAVLH
TRLVARSDLA DALPSDTATV TGEQRVAQMW ARFNGIRRRH RM