Gene Mkms_3944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3944 
Symbol 
ID4611880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4152463 
End bp4154016 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content72% 
IMG OID639793624 
ProductDNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II / transcriptional regulator Ada 
Protein accessionYP_939926 
Protein GI119869974 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.445653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00805865 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCACG ACCATGTCGG ATTTCCGCCG GTGGTGTCGG TGGCGGCGTG TAAGTCTGAC 
AGCATGTACG ACGATTTCGA CCGCTGCTAC CGGGCCGTGC AGTCCAAGGA CGCGCGGTTC
GACGGTTGGT TCGTCACGGC GGTGCTGACG ACGCGGATCT ACTGCCGCCC AAGCTGTCCC
GTCCGGCCGC CGTTCGCCCG CAACGTGCGC TTCTATCCGA CCGCCGCGGC CGCTCAGGCG
GCGGGATTCC GCGCCTGTAA GCGGTGCAGG CCCGACGCGT CGCCCGGTTC TCCCGAGTGG
AACGTCCGCG GCGATGTGGC CGCCAGGGCG ATGCGCCTGA TCGCCGACGG CACGGTCGAC
CGGGACGGTG TCACGGGTCT GGCCGGCCGG CTCGGCTACA CCACGCGGCA GTTGCAGCGC
ATCCTGCAGG CCGAGGTGGG GGCGAATCCG CTGGCGTTGG CCCGTGCGCA GCGGGCACAG
ACCGCACGCG TGCTGATCGA GACCACCGAC CTGCCGTTCT CCGATGTGGC GTTCGCCGCG
GGGTTCTCGA GCATCCGGCA GTTCAACGAC ACGGTGCGCG CCACCTCCGC GTGCACCCCG
ACCGCGATGC GGGAGCGGGC GCGACGCCGC TTCGGGGCGG CCACCGCCGG CGCGGGGTCA
CTGTCACTGC GCCTGCCGGT GCGTAGGCCG TTCGCCTACG AAGGGGTGTT CGGGCACCTG
GCGGCCAGCG CCGTACCGGG TGTCGAGGAG TTCCGCGACG GGGCCTTCCG CCGCACGCTG
CGGCTTTCGA GGGGCCACGG CATCGTCGGC CTCACCCCCC GCGACGGTCA CGTCGACTGC
GTGCTGCACC TCGAGGACCT TCGGGACCTG TCCAGCGCCA TCGCGCGGTG CCGGCGCCTG
CTGGACCTCG ACGCCGACCC GGAGGCCGTC GTCGACGTAC TCGGCGCCGA CCCGGACCTC
ACCGCGTTGG TGACGAAGGC GCCCGGGCAG CGCATCCCGC GCACTGTCGA CGAGGCGGAA
CTGGCCGTGC GGGTGGTTCT GGGCCAACAG GTGTCCCTGA AGGCCGCCCG CACGCACGCC
GCGCGGCTCG TCACCCACTA CGGTCGCCCG ATCAGCGATC CACACGGTGG CCTGACCCAC
GTGTTTCCCA CCGTTGAGGA ACTCGCCGAC ATCGCTGCGC CCCATCTGGC GGTGCCGCGC
AGCCGGCAGG CCACCGTCCG CTCGCTCATC GCGGCGCTGG CGTCGGGCGA CGTGCGGCTC
GATCCCGGAT GTGACTGGAA CGAGGCACGG GCACAACTCA CCGTACTGCC CGGCATCGGC
ACATGGACTG CGGAGGTGAT CGCGATGCGC GGACTCGGCG ATCCCGACGC CTTCCCCGTC
ACCGATCTGG GCGTGCTCAC CGCCGCTCGC CACCTCGGCC TGGCCGAGGA TGCCCGGGCC
CTTGCAGCGC ACGGCGCCCG GTGGCGTCCG TGGCGGGCCT ACGCGACGCA GCACCTGTGG
ACGGCGCTCG ATCATCCGGT CAACGACTGG CCCCCGAAGG AGATCCGACA GTGA
 
Protein sequence
MAHDHVGFPP VVSVAACKSD SMYDDFDRCY RAVQSKDARF DGWFVTAVLT TRIYCRPSCP 
VRPPFARNVR FYPTAAAAQA AGFRACKRCR PDASPGSPEW NVRGDVAARA MRLIADGTVD
RDGVTGLAGR LGYTTRQLQR ILQAEVGANP LALARAQRAQ TARVLIETTD LPFSDVAFAA
GFSSIRQFND TVRATSACTP TAMRERARRR FGAATAGAGS LSLRLPVRRP FAYEGVFGHL
AASAVPGVEE FRDGAFRRTL RLSRGHGIVG LTPRDGHVDC VLHLEDLRDL SSAIARCRRL
LDLDADPEAV VDVLGADPDL TALVTKAPGQ RIPRTVDEAE LAVRVVLGQQ VSLKAARTHA
ARLVTHYGRP ISDPHGGLTH VFPTVEELAD IAAPHLAVPR SRQATVRSLI AALASGDVRL
DPGCDWNEAR AQLTVLPGIG TWTAEVIAMR GLGDPDAFPV TDLGVLTAAR HLGLAEDARA
LAAHGARWRP WRAYATQHLW TALDHPVNDW PPKEIRQ