Gene Mkms_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4043 
Symbol 
ID4611983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4263889 
End bp4267320 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content71% 
IMG OID639793727 
ProductRecB family-like nuclease 
Protein accessionYP_940025 
Protein GI119870073 
COG category[R] General function prediction only 
COG ID[COG2251] Predicted nuclease (RecB family) 
TIGRFAM ID[TIGR03491] RecB family nuclease, putative, TM0106 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCGTCG AATCCGTTGG GGCGCACGAC GAGGTCATCT ACAGCGCTTC GGATCTGGCC 
GCCGCCGCGC GCTGCGAGTA CGCGCTGCTG CGTTCATTCG ACGCCCGGCT GGGATGGGGT
CCGTCCGTGG CCACCGAGGA CGAGCTGCTG GCCCGCACCG CCGACCTGGG TGACGAACAC
GAGAAGCGCC ACCTCGACAC ACTCAGGACC GACAGCGACC ACAACGTCAC CGTCATCGGC
AGGCCGCCCT ACACCGTGGC CGGGCTGACC GCCGCCGCCG AGGCCACCAC CCGCGCCGTC
GCCCGCCGCG CCCCGGTGAT CTACCAGGCC GCCATGTTCG ACGGCCGGTT CGCCGGTTTC
GCCGACTTCC TGATCCTCGA GGGCGACCGC TACCTGCTGC GCGACACCAA GCTGGCGCGT
TCGGTGAAGG TCGAGGCGCT GCTGCAGCTG GCCGCGTACG CCGAGGCGCT GACCGCGGCG
GGGGTGCCGG TCGCCGACGA GGTGGAGCTC GTGCTCGGCG ACGGGGCCAC CGCCCGCTAC
CCGCTCGAGG AACTGCTGCC GGTGTACCGG CCACGGCGGG CCGCGCTGCA GCGGCTGCTC
GACGACCATC TCGCCGGCGG GACGCCGGTG CGCTGGGAGG ACGAGACCGT CCGGGCGTGT
TTCCGCTGTC CGGAGTGCAG CATCGAGGTG CGCGCACAGG ACGACCTGCT GCTCGTCGCG
GGCATGCGGG TCAGCCAGCG CGCCCGGTTC CACGAGGCGG GGATCACCAC CGTGGCCGAA
CTCGCCGCGC ACCAGGGTCC GGTGCCGGAA CTGCCGGCGC GCACCGTCAC CGCGTTGAGC
GCCCAGGCCC GGCTGCAGAC CGCGCCGCGC GTGGACGGCA AACCGCCCTA TGAGGTCGCC
GATCCGCAGC CGCTGATGCT GCTGCCCGAA CCCGACAAGG GTGACCTCTT CTTCGATTTC
GAGGGCGACC CGTTGTGGAC CGACGACGGC CACGAGTGGG GCCTGGAATA CCTGTTCGGT
GTGCTCGACA CCGCCGACGG GTTCCACCCG CTGTGGGCGC ATGACCGGCC GCAGGAACGG
AAGGCGCTCG AGGACTTCCT CGAGTTGGTG CGCAAGAGGC GCAAACGCCA CCCGAACATG
CACATCTACC ACTACGCCGC ATACGAGAAG ACCGCGCTGC TGCGCCTCGC CGGGCGCTAC
GGCGTCGGTG AGGACGCGGT CGACGACCTG TTGCGCAGCG GGGTGTTGGT CGACCTGTAC
CCGTTGGTGC GCAAGAGTTT TCGCATCGGC ACCGAGAACT ACAGCCTCAA GTCGCTGGAG
CCGCTGTACA TGGGTGGGCA GCTGCGCACC GGGGACGTGA CCACGGCCGC GGCGTCGATC
ACCGAGTACG CCCGCTACTG CGAGCTGCGC GCGGAGGGCC GTGACGACGA CGCCGCCGTC
GTCCTCAAGG ACATCGAGGA CTACAACCGC TACGACTGCA CCTCGACGCG CAAACTTCGC
GACTGGCTGG TGTGCCGGGC CATCGACTGC GAGGTGCCGC CGCGGGGGCC GCAGCCGGTC
CGCGACGGCG CCGAGCCCGA ACCCGTTGAC GCCCTGGACC GCACGCTGTG CCGTTACGCG
GGCGACGAAC TGGAGGGCCG CACCCCGGAG CAGTCGGCGG TCGCGATGGT CGCCGCCGCC
CGCGGCTACC ACCGTCGCGA GGACAAGCCG TTCTGGTGGT CGCATTTCGA CCGGCTCAAC
AACCCGGTCG ACGAATGGGC CGACAGCACC GACGTCTTCC TCGTCGACCC CGGCGGCGCG
CAGGTCGAGG TGGATTGGCA CACGCCGCCG CGGGCCCGCA AACCGCAGCG CCGGATCCGG
TTGAGCGGCG TGATGGCCGC CGGCGGGCTC AGCCGCGACA TGTACGCCCT CTATGAGCCG
CCCGCCCCGG CCGGGCTCGG CGACGATCCG GACCGCCGCG CGGCGGGCAG CGTCACCGTC
GTCGAGTGCA ACGCCCCTGA CGTCCCCACC GAAGTGGTCG TCGTCGAGCG CACCCCACGC
GACGGCGGGG TGTTCGACCA GTTGCCCTTC GCGCTGGCGC CGGGGCAGCC GATCCGCACG
ACCGCGCTGC GCGAATCGAT CGAGGCGACG GCCGCCGACC TGGCCGAGCG CCTGCCCCGG
CTACCCGAAA CCGCGGTCAC CGACATCCTG CTGCGCCGCC CGCCGCGGAC CCGCAGCGGC
GCCCCACTGC CCACCGGGCC CGACGTCTCG GCCGCGATCA CCGCCGCGCT GCTCGACCTC
GACTCGTCCT ACCTGGCCGT GCACGGCCCA CCCGGCACCG GCAAGACGTT CACCGCGGCG
GCCATCATCG CGACGCTGGT CAACACGCAC GGCTGGCGGA TCGGGGTGGT GGCGCAGTCA
CATGCGGTGG TGGAGAACCT CTTCCGCGGG GTCATCGACG CGGGGGTGGA CGCGGCGCGG
GTGGCCAAGA AACGGGGGCA CGGCGACGAC GCCAACTGGA CCGTACTCGA GGAGGCGGGG
TTCCCCGGCT TCGTGGCGGA CCACGACGGT TGCGTGATCG GCGGCACCGC ATGGGATTTC
GCGAACGGCA ACCGGATTCC GCGCGGCTGC CTCGACCTGC TCGTCATCGA GGAGGCCGGG
CAGTTCTGCC TGGCCAACAC GATCGCTGTG GCGCCGGCGG CGGCGAATCT GCTGCTGCTC
GGTGACCCGC AACAGCTACC CCAGGTCAGC CAGGGCACGC ACCCCGAACC GGTCGACGCC
TCCGCGCTGG GGTGGCTGGT CGACGGTGCG CACACGCTGC CCGCCGAACG CGGCTACTTC
CTCGACGTGT CCTGGCGGAT GCATCCGGCG GTGTGCGCGG CGGTGTCCCG GCTGTCCTAT
GACGGGCGTC TGCAATCGAA CGACGCGGTG ACCACCGCGC GCACGCTCGA GGGCTGGTCA
CCCGGGGTGC ACGAGGTGAC CGTGCCCCAC GACGGCAACG CCACCGAAAG CCCGGAGGAG
GCCGACGCGA TCGTCACGCG GATCGGCAGG CTGCTCGGTT CGGTGTGGAC CGACGAGAAC
GGTTCCCGAC CGCTCGCACA GGACGACGTG ATGGTGGTGA CGCCGTACAA CGCCCAGGTG
GTGCTGCTGC GCCAACGCCT CGACGCGGCA AGGCTGACCG ACGTGCGGGT GGGCACAGTC
GACAAATTCC AGGGACAGCA GGCGCCGGTG GTGTTCATCT CCATGGTGGC GTCCTCGATC
GACGACGTGC CGCGGGGAAT CTCGTTCCTG CTCAACCGGA ATCGGCTCAA CGTGGCGATC
AGCCGCGCGA AGTACGCCGC CGTGATCGTG CGCTCGGAGG CCCTCACCGA GTACCTGCCT
TCCACGCCGA AGGGACTGGT CGAACTCGGC GCGTTCCTGT CGTTGAGTCA GTCGCCGTCG
CGGCGTGAAT GA
 
Protein sequence
MFVESVGAHD EVIYSASDLA AAARCEYALL RSFDARLGWG PSVATEDELL ARTADLGDEH 
EKRHLDTLRT DSDHNVTVIG RPPYTVAGLT AAAEATTRAV ARRAPVIYQA AMFDGRFAGF
ADFLILEGDR YLLRDTKLAR SVKVEALLQL AAYAEALTAA GVPVADEVEL VLGDGATARY
PLEELLPVYR PRRAALQRLL DDHLAGGTPV RWEDETVRAC FRCPECSIEV RAQDDLLLVA
GMRVSQRARF HEAGITTVAE LAAHQGPVPE LPARTVTALS AQARLQTAPR VDGKPPYEVA
DPQPLMLLPE PDKGDLFFDF EGDPLWTDDG HEWGLEYLFG VLDTADGFHP LWAHDRPQER
KALEDFLELV RKRRKRHPNM HIYHYAAYEK TALLRLAGRY GVGEDAVDDL LRSGVLVDLY
PLVRKSFRIG TENYSLKSLE PLYMGGQLRT GDVTTAAASI TEYARYCELR AEGRDDDAAV
VLKDIEDYNR YDCTSTRKLR DWLVCRAIDC EVPPRGPQPV RDGAEPEPVD ALDRTLCRYA
GDELEGRTPE QSAVAMVAAA RGYHRREDKP FWWSHFDRLN NPVDEWADST DVFLVDPGGA
QVEVDWHTPP RARKPQRRIR LSGVMAAGGL SRDMYALYEP PAPAGLGDDP DRRAAGSVTV
VECNAPDVPT EVVVVERTPR DGGVFDQLPF ALAPGQPIRT TALRESIEAT AADLAERLPR
LPETAVTDIL LRRPPRTRSG APLPTGPDVS AAITAALLDL DSSYLAVHGP PGTGKTFTAA
AIIATLVNTH GWRIGVVAQS HAVVENLFRG VIDAGVDAAR VAKKRGHGDD ANWTVLEEAG
FPGFVADHDG CVIGGTAWDF ANGNRIPRGC LDLLVIEEAG QFCLANTIAV APAAANLLLL
GDPQQLPQVS QGTHPEPVDA SALGWLVDGA HTLPAERGYF LDVSWRMHPA VCAAVSRLSY
DGRLQSNDAV TTARTLEGWS PGVHEVTVPH DGNATESPEE ADAIVTRIGR LLGSVWTDEN
GSRPLAQDDV MVVTPYNAQV VLLRQRLDAA RLTDVRVGTV DKFQGQQAPV VFISMVASSI
DDVPRGISFL LNRNRLNVAI SRAKYAAVIV RSEALTEYLP STPKGLVELG AFLSLSQSPS
RRE