Gene Mkms_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1785 
Symbol 
ID4613893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1892183 
End bp1894564 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content67% 
IMG OID639791451 
ProductABC transporter related 
Protein accessionYP_937776 
Protein GI119867824 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.806156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGAA AGACGGGTAA GCACGCCGCC GACACCCACG ACGTCATCCG CGTGGTCGGC 
GCGCGGGAGA ACAACCTCAA GAACATCGAC GTCGAACTGC CGAAACGGCG GCTGACCGTG
TTCACCGGGG TGTCGGGATC GGGTAAGAGC TCGTTGGTGT TCGGCACCAT CGCCGCGGAA
TCCCAGCGTC TGATCAACGA GACGTACAGT GCGTTCCTGC AGGGCTTCAT GCCGTCGATG
TCGCGGCCGG ACGTGGACGT CCTCGAAGGG CTGACGACGG CGATCATCGT CGACCAGGAG
CGGATGGGCG CCAACCCGCG CTCGACGGTC GGCACGGCGA CCGACGCCCA TGCCATGCTG
CGGATCCTCT TCAGCCGCCT CGGTGAGCCG CACATCGGTT CACCGCAGGC GTTCTCGTTC
AACGTCGCCT CCGTCAGCGG GGCCGGCGCG GTGACGTTCG ACAAGGGCGG CAGGACCGTC
AAGGAGAGGC GCGAGTTCTC GATCACCGGC GGGATGTGTC CACGGTGCGA GGGCCGCGGG
TCGGTGTCCG ACATCGACCT CACCGCGCTC TACGACGACT CCAAATCCCT CAACGAGGGC
GCGCTGAGCA TCCCCGGCTA CAGCATGGAG GGTTGGTACG GCCGAATCTT CCGTGGCTGT
GGGTATTTCG ACCCCGACAA ACCGATCCGC AAGTACACCA AGAAGGAACT CAACGACCTC
CTGTACCGCG AGGCCACGAA GATCAAGGTC GACGGGGTCA ACCTCACCTA CGCCGGGCTG
ATCCCCACGA TCCAGAAATC GTTCCTGTCC AAGGACGTCG ATGCGATGCA ACCCCACATC
CGCGCATTCG TCGAACGGGC GGTGACGTTC GCGACCTGCC CCGAGTGCGA GGGCACGCGC
CTGACCGATC AGGCGCGGTC GTCGAAGATC AAGGGCTGCA GCATCGCCGA CGTGTGCGCG
ATGCAGATCA GCGACCTCGC CGAGTGGATC CGCGGACTCG ACGAAGCCTC CGTCCGTCCC
CTGCTGGACG GTTTGGGCCA CCTTCTGGAT TCGTTCACCG AGATCGGCCT GGGCTACCTC
TCGCTGGACC GTCCCGCGGG CACGCTGTCC GGGGGAGAAG CTCAGCGCAC GAAGATGATC
CGCCACCTCG GCTCGTCACT CACCGACGTC ACCTACGTCT TCGACGAGCC CACCATCGGC
CTGCATCCCC ACGACATCGA GCGGATGAAC ACGCTGCTGC TGCGACTGCG GGACAAGGGC
AACACGGTGC TCGTCGTCGA GCACAAACCC GAGACGATCG TCATCGCCGA CCGCGTCGTG
GACCTCGGAC CCGGTGCGGG TACCGGCGGC GGCGAGGTGG TCTTCGAGGG CACCGTCGCC
CAGCTCCGTC GCAGCGGCAC GCTCACCGGA CGTCACCTCG ACGACCGGGC GGCCCTGAAG
AAGTCTGTGC GCCAAGCACA AGGCGCCCTG GAGATCCGCG GTGCCACGAC GAACAACCTG
CGCGACGTCG ACGTCGACAT CCCGCTCGGT GTCCTCACGG TGCTCACCGG GGTCGCGGGG
TCGGGGAAGA GCTCGCTCAT CGACGGTTCG GTGGCCGGCC GCGACGAGGT CGTCTCGATC
GATCAGGGCG CGATCCGCGG TTCCCGGCGA AGCAACCCCG CCACCTATAC CGGCCTGCTC
GACCCCATCC GCAAGGCGTT CGCCAAAGCC AACGGCGTGA AGCCGGCGCT GTTCAGTTCC
AACTCCGAAG GCGCCTGCCC GGCCTGCAAG GGCGCCGGTG TCATCTACAC CGAACTCGGC
GTCATGGCGA CCGTGGAATC ACCGTGCGAG GAATGCGAGG GACGACGGTT CCAGGCCTCG
GTCCTCGAGT ACACGCTCGG CGGCCGGAAC ATCGCCGACG TGCTCGAGAT GTCGGTGGCG
GACGCGCTCG GCTTCTTCGC GGACGGCGAG GCCGCGACCC CGGCCGCGCA CAAGGTGCTC
GACCGTCTCG CCGATGTGGG GCTCGGATAC CTCAGCCTCG GTCAGCCGCT CACCACACTC
TCCGGGGGCG AACGGCAGCG CCTCAAGCTG GCCACCCGGC TGGGGGACAC CGGTGCCGAC
AAGAAGGACG TCTACGTACT CGACGAGCCG ACCTCGGGTC TGCACCTCGC CGACGTCGAG
CAGCTGCTCG CCCTGCTCGA CCGGCTGGTC GACTCCGGCA AGACGGTCAT CGTGATCGAG
CACCACCAGG CCGTGATGGC GCACGCGGAC TGGATCATCG ACCTCGGTCC CGGCGCCGGC
CACGACGGGG GCCGGATCGT CTTCGAGGGA CCTCCGGCAG ACCTCGTGGC CAGCCGGGCG
ACCCTCACCG GTGAGCATCT CGCCGACTAC GTCGGCGGCT GA
 
Protein sequence
MAGKTGKHAA DTHDVIRVVG ARENNLKNID VELPKRRLTV FTGVSGSGKS SLVFGTIAAE 
SQRLINETYS AFLQGFMPSM SRPDVDVLEG LTTAIIVDQE RMGANPRSTV GTATDAHAML
RILFSRLGEP HIGSPQAFSF NVASVSGAGA VTFDKGGRTV KERREFSITG GMCPRCEGRG
SVSDIDLTAL YDDSKSLNEG ALSIPGYSME GWYGRIFRGC GYFDPDKPIR KYTKKELNDL
LYREATKIKV DGVNLTYAGL IPTIQKSFLS KDVDAMQPHI RAFVERAVTF ATCPECEGTR
LTDQARSSKI KGCSIADVCA MQISDLAEWI RGLDEASVRP LLDGLGHLLD SFTEIGLGYL
SLDRPAGTLS GGEAQRTKMI RHLGSSLTDV TYVFDEPTIG LHPHDIERMN TLLLRLRDKG
NTVLVVEHKP ETIVIADRVV DLGPGAGTGG GEVVFEGTVA QLRRSGTLTG RHLDDRAALK
KSVRQAQGAL EIRGATTNNL RDVDVDIPLG VLTVLTGVAG SGKSSLIDGS VAGRDEVVSI
DQGAIRGSRR SNPATYTGLL DPIRKAFAKA NGVKPALFSS NSEGACPACK GAGVIYTELG
VMATVESPCE ECEGRRFQAS VLEYTLGGRN IADVLEMSVA DALGFFADGE AATPAAHKVL
DRLADVGLGY LSLGQPLTTL SGGERQRLKL ATRLGDTGAD KKDVYVLDEP TSGLHLADVE
QLLALLDRLV DSGKTVIVIE HHQAVMAHAD WIIDLGPGAG HDGGRIVFEG PPADLVASRA
TLTGEHLADY VGG