Gene Mkms_5796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5796 
Symbol 
ID4610505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008704 
Strand
Start bp5103 
End bp8033 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content67% 
IMG OID639789451 
Producthelicase domain-containing protein 
Protein accessionYP_935786 
Protein GI119855183 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG4646] DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.00607211 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.23463e-19 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCCGC GCTTTGAGCG AAACGTCGCT GCGCTGGAGG CGGTGCAGCC ACCCTGGCTG 
AGCCGCGAAG ACATTCGCGT TGAACTGGGC TCACCGTGGA TCACCGCCGG CGACGTCGCC
GACTTCTGCG CTGAGGTGTT TGGGGCGCGG GCCGGTGTCG ATCATGTGGC GCCGCTGGCA
GCGTGGGAGG TGAGTGCGCG TGGCCAGATC TCGCCGGAGG CGCGGATCGC CTACTGCACC
GACCGCATGG ATGCGATCGA CCTGCTGCAG ATCGGACTCA ACGGCGCCGC TCCGGTCGTG
TGGGATGAGT TCTACGACCA GCAGACGCAC ACGCGGCGCA AGGTCCGCAA CGCCGATGCC
ACCGAGGCCG CCGAACTGAA GCTCGCCGCG ATCCAACAGC GGTTCTCGCT GTGGGTGTGG
GAGAACGCCG ACCGGGAACG GCGCATCGTC GAGCAGTACA ACCAGACCAT GAACGCGCAC
GTGCTGCGCA ATCATGACGG GTCGCATCTG ACGTTCCCAG GCCTGGCCGA CGGCATCGCG
TTATGGCCGT GGCAACGTGA CTTCGTTGAC CGTGCGGTTT CCACGCCCGC GGTTTTCTGT
GCGCACGAAG TCGGGCTCGG AAAGACGCTC ACGGCAATCA CGTTGGCAAT GACGTTGCGG
CAGTTCGGAT TGGCGAACCG TCCGGCGTTG ATCGTTCCCC TGCATCTGAT CGAGCAGGCA
ACCCGCCAGT GCTACCAGGC GTGGCCGGCG GGCCGGTTCC TGATTGTGAC GCGTGAGGAT
TTACACGGTG ATGCACGGCG CCGGTTCGTG GCGCGCTGCG CGACGGGGGA TTGGGATCTG
GTGATCATGA CCCACGAAAC GTTTTCCTCG CTGCCGGTCC CCGGCAATGC GGAGCGTGAT
TGGCTAGAAG ACCAGCTCGG CGAACTCGAA AATTATGCCC GCACTGAGGG TTACACCGGC
AAGCGGATCG CCGCCGCGGT GCGGTCCCTG CAGGGCCGAC TCGAGAAGCT GCGCGCGTCG
GTCAACGACC CCAAGGCGGT CACGTTCAAG AGCCTGGGAA TCGATTATCT GATCGTCGAC
GAGGCGGACA AGTTCCGCCG GCTGCCGGTG ACCACCCGCG CAGACGGGTT TAGCCTCGGC
TCGTCGAAAC GGGCCCTGGA CCTGTTTCTC AAGGTGTCGC TGCTGCGGCG GGCCAACCCG
GATCGGCCGC ATGCCTGCCT TTTGACGGGA ACGCCGTTCA CCAACACGCT CGCCGAAGGG
TTTGTGTGGC AGAGCATGCT TGCCCCCGAG CAGCTGGCGC GCACGGGGTT GGGGCACTTC
GACGCGTGGG CGGCCCAGTT CGTGCGGTAC AAGGTGCTGA TCGAGACCAG CCCAGACGGG
TCGGGGTTCC GGTCGCGGCG CCGGCCGGGC ACCATCCAAA ACGTGCCCGA GCTGCGCACG
ATGCTCTCGG AGTTCATGTC GATGGTGCGG GCCGACAGCG TAGGTCTGCC GCGACCGCAG
GTGCAGAACC ATACCCATCT GAGCGAACCG ACCGACGCGC AGCGCGAATT CATGGCACAC
CTGGTGACGC GGGCCGACGC GCTGCGGGCC CGGATGCCCT CGGCCGAGTC GGACAACATG
CTGCTGATCT GCGGTGACGG GCGCAAGGTG GCGCTGGACC CGAACCTGGT CGGAATCCGC
GAGGAAGCGC CCAAGCTCGA TGGCGTGGCC GCGGCGGTAG CCGACATCTA CCACCGCACC
CGGCATCTGA CGTACTCCGG ATCGACAACG CCCGGGGCGT TCCAACTGGT GATGTGCGAC
ATGGGCACAC CGAAGAAGGG CGACGCACAA AGCTACGGGC GGATCCGGGC GGGACTGATC
GCGCGCGGGG TGCCGGCCGA GCAGATCCGA TTCGTGCACG AAGCGACCAC GGCCAAGGCC
CGAGAGGCGC TATTCGCAGC ATGCCGGGAC GGCCGGGTTG CGGTGCTGCT GGGCTCGACG
CCCAAGGTCG GCATCGGGAC CAACGTGCAG AACAGGCTGC ACTCGCTACA CCACGTGGAC
CCGACGTGGA CGGCCGCGGC GTGGGAGCAG CGCAACGGGC GCATCCAGCG CAACGGCAAC
CAGCACGCTA CGGCCGAAAT CCATTCGCAC GTAGCGCGGG GAACGTTCGA CGCGTTCATG
TTCGGCACTG TGGAGCGCAA AGCCCGAGGT TTCGCGCAGC TGTACCGGAT GGACGGCCAG
GCCCGCGAAA TCGAGGACAT CGGTGACGAG GTACTGACGT TCGGTGAACT CAAGGCCGCC
GCGGCGGGCA ACGATCTCTT GCTGCGGCAG CATGAGCTGG AAAGCCGGGT CCGGGCGCTG
CGGTTGGCGC ACGTGACCGT CCAGCAGAAC GTGCGGACGC TGCTGCATCA GGCCGCGGCC
GCGGACACCG CCGCCGAGGC CGCGGCAGCA CGCGTCCAGC GGTTGCAGGC CTTCGCCGAG
CACCGCGACG GCATGCGGGA GATGGACATG ACCAGGGTCG CCGCCGACGC CTGCACGGTC
CGGGATCCGG CGGCCTACCG CTCGCGGTAC CGAGCAGAGT GCGGCGATCA CCGGGTATCG
GTGCGCGTTG TGGATACCGA CCCTGGACAG CGTTTGGAGC TGGCCTTCGA CTACCGCGTG
CTGTGGGCCG AACCGCTACC TGGCAAGGTG CGCCGTCGCG GCGCCGAGGC GGTCAAAGCC
TGGGCGGAGG CGATGGTGGC AGCGTGGGTC GCAGGCGTCG ATCGCGAGAT CGTTGCGACG
CAAAGCCGCG TCGAGGAATC CCGGCGGCGC GCCCAGGACG CCCGCACCGC CGCGGCGGCC
ACCAACACCG GAGAGCCGGC CGATCTGCTC GCAGCCCGCG CCGAGTTGGT CGAGGTCAAT
AGGGCCATCG ACGACGCGCT GAAAGGGGAA AGCCGGCCCG CCGCAGCGTA G
 
Protein sequence
MDPRFERNVA ALEAVQPPWL SREDIRVELG SPWITAGDVA DFCAEVFGAR AGVDHVAPLA 
AWEVSARGQI SPEARIAYCT DRMDAIDLLQ IGLNGAAPVV WDEFYDQQTH TRRKVRNADA
TEAAELKLAA IQQRFSLWVW ENADRERRIV EQYNQTMNAH VLRNHDGSHL TFPGLADGIA
LWPWQRDFVD RAVSTPAVFC AHEVGLGKTL TAITLAMTLR QFGLANRPAL IVPLHLIEQA
TRQCYQAWPA GRFLIVTRED LHGDARRRFV ARCATGDWDL VIMTHETFSS LPVPGNAERD
WLEDQLGELE NYARTEGYTG KRIAAAVRSL QGRLEKLRAS VNDPKAVTFK SLGIDYLIVD
EADKFRRLPV TTRADGFSLG SSKRALDLFL KVSLLRRANP DRPHACLLTG TPFTNTLAEG
FVWQSMLAPE QLARTGLGHF DAWAAQFVRY KVLIETSPDG SGFRSRRRPG TIQNVPELRT
MLSEFMSMVR ADSVGLPRPQ VQNHTHLSEP TDAQREFMAH LVTRADALRA RMPSAESDNM
LLICGDGRKV ALDPNLVGIR EEAPKLDGVA AAVADIYHRT RHLTYSGSTT PGAFQLVMCD
MGTPKKGDAQ SYGRIRAGLI ARGVPAEQIR FVHEATTAKA REALFAACRD GRVAVLLGST
PKVGIGTNVQ NRLHSLHHVD PTWTAAAWEQ RNGRIQRNGN QHATAEIHSH VARGTFDAFM
FGTVERKARG FAQLYRMDGQ AREIEDIGDE VLTFGELKAA AAGNDLLLRQ HELESRVRAL
RLAHVTVQQN VRTLLHQAAA ADTAAEAAAA RVQRLQAFAE HRDGMREMDM TRVAADACTV
RDPAAYRSRY RAECGDHRVS VRVVDTDPGQ RLELAFDYRV LWAEPLPGKV RRRGAEAVKA
WAEAMVAAWV AGVDREIVAT QSRVEESRRR AQDARTAAAA TNTGEPADLL AARAELVEVN
RAIDDALKGE SRPAAA