Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_5796 |
Symbol | |
ID | 4610505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008704 |
Strand | - |
Start bp | 5103 |
End bp | 8033 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639789451 |
Product | helicase domain-containing protein |
Protein accession | YP_935786 |
Protein GI | 119855183 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG4646] DNA methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.00607211 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 2.23463e-19 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATCCGC GCTTTGAGCG AAACGTCGCT GCGCTGGAGG CGGTGCAGCC ACCCTGGCTG AGCCGCGAAG ACATTCGCGT TGAACTGGGC TCACCGTGGA TCACCGCCGG CGACGTCGCC GACTTCTGCG CTGAGGTGTT TGGGGCGCGG GCCGGTGTCG ATCATGTGGC GCCGCTGGCA GCGTGGGAGG TGAGTGCGCG TGGCCAGATC TCGCCGGAGG CGCGGATCGC CTACTGCACC GACCGCATGG ATGCGATCGA CCTGCTGCAG ATCGGACTCA ACGGCGCCGC TCCGGTCGTG TGGGATGAGT TCTACGACCA GCAGACGCAC ACGCGGCGCA AGGTCCGCAA CGCCGATGCC ACCGAGGCCG CCGAACTGAA GCTCGCCGCG ATCCAACAGC GGTTCTCGCT GTGGGTGTGG GAGAACGCCG ACCGGGAACG GCGCATCGTC GAGCAGTACA ACCAGACCAT GAACGCGCAC GTGCTGCGCA ATCATGACGG GTCGCATCTG ACGTTCCCAG GCCTGGCCGA CGGCATCGCG TTATGGCCGT GGCAACGTGA CTTCGTTGAC CGTGCGGTTT CCACGCCCGC GGTTTTCTGT GCGCACGAAG TCGGGCTCGG AAAGACGCTC ACGGCAATCA CGTTGGCAAT GACGTTGCGG CAGTTCGGAT TGGCGAACCG TCCGGCGTTG ATCGTTCCCC TGCATCTGAT CGAGCAGGCA ACCCGCCAGT GCTACCAGGC GTGGCCGGCG GGCCGGTTCC TGATTGTGAC GCGTGAGGAT TTACACGGTG ATGCACGGCG CCGGTTCGTG GCGCGCTGCG CGACGGGGGA TTGGGATCTG GTGATCATGA CCCACGAAAC GTTTTCCTCG CTGCCGGTCC CCGGCAATGC GGAGCGTGAT TGGCTAGAAG ACCAGCTCGG CGAACTCGAA AATTATGCCC GCACTGAGGG TTACACCGGC AAGCGGATCG CCGCCGCGGT GCGGTCCCTG CAGGGCCGAC TCGAGAAGCT GCGCGCGTCG GTCAACGACC CCAAGGCGGT CACGTTCAAG AGCCTGGGAA TCGATTATCT GATCGTCGAC GAGGCGGACA AGTTCCGCCG GCTGCCGGTG ACCACCCGCG CAGACGGGTT TAGCCTCGGC TCGTCGAAAC GGGCCCTGGA CCTGTTTCTC AAGGTGTCGC TGCTGCGGCG GGCCAACCCG GATCGGCCGC ATGCCTGCCT TTTGACGGGA ACGCCGTTCA CCAACACGCT CGCCGAAGGG TTTGTGTGGC AGAGCATGCT TGCCCCCGAG CAGCTGGCGC GCACGGGGTT GGGGCACTTC GACGCGTGGG CGGCCCAGTT CGTGCGGTAC AAGGTGCTGA TCGAGACCAG CCCAGACGGG TCGGGGTTCC GGTCGCGGCG CCGGCCGGGC ACCATCCAAA ACGTGCCCGA GCTGCGCACG ATGCTCTCGG AGTTCATGTC GATGGTGCGG GCCGACAGCG TAGGTCTGCC GCGACCGCAG GTGCAGAACC ATACCCATCT GAGCGAACCG ACCGACGCGC AGCGCGAATT CATGGCACAC CTGGTGACGC GGGCCGACGC GCTGCGGGCC CGGATGCCCT CGGCCGAGTC GGACAACATG CTGCTGATCT GCGGTGACGG GCGCAAGGTG GCGCTGGACC CGAACCTGGT CGGAATCCGC GAGGAAGCGC CCAAGCTCGA TGGCGTGGCC GCGGCGGTAG CCGACATCTA CCACCGCACC CGGCATCTGA CGTACTCCGG ATCGACAACG CCCGGGGCGT TCCAACTGGT GATGTGCGAC ATGGGCACAC CGAAGAAGGG CGACGCACAA AGCTACGGGC GGATCCGGGC GGGACTGATC GCGCGCGGGG TGCCGGCCGA GCAGATCCGA TTCGTGCACG AAGCGACCAC GGCCAAGGCC CGAGAGGCGC TATTCGCAGC ATGCCGGGAC GGCCGGGTTG CGGTGCTGCT GGGCTCGACG CCCAAGGTCG GCATCGGGAC CAACGTGCAG AACAGGCTGC ACTCGCTACA CCACGTGGAC CCGACGTGGA CGGCCGCGGC GTGGGAGCAG CGCAACGGGC GCATCCAGCG CAACGGCAAC CAGCACGCTA CGGCCGAAAT CCATTCGCAC GTAGCGCGGG GAACGTTCGA CGCGTTCATG TTCGGCACTG TGGAGCGCAA AGCCCGAGGT TTCGCGCAGC TGTACCGGAT GGACGGCCAG GCCCGCGAAA TCGAGGACAT CGGTGACGAG GTACTGACGT TCGGTGAACT CAAGGCCGCC GCGGCGGGCA ACGATCTCTT GCTGCGGCAG CATGAGCTGG AAAGCCGGGT CCGGGCGCTG CGGTTGGCGC ACGTGACCGT CCAGCAGAAC GTGCGGACGC TGCTGCATCA GGCCGCGGCC GCGGACACCG CCGCCGAGGC CGCGGCAGCA CGCGTCCAGC GGTTGCAGGC CTTCGCCGAG CACCGCGACG GCATGCGGGA GATGGACATG ACCAGGGTCG CCGCCGACGC CTGCACGGTC CGGGATCCGG CGGCCTACCG CTCGCGGTAC CGAGCAGAGT GCGGCGATCA CCGGGTATCG GTGCGCGTTG TGGATACCGA CCCTGGACAG CGTTTGGAGC TGGCCTTCGA CTACCGCGTG CTGTGGGCCG AACCGCTACC TGGCAAGGTG CGCCGTCGCG GCGCCGAGGC GGTCAAAGCC TGGGCGGAGG CGATGGTGGC AGCGTGGGTC GCAGGCGTCG ATCGCGAGAT CGTTGCGACG CAAAGCCGCG TCGAGGAATC CCGGCGGCGC GCCCAGGACG CCCGCACCGC CGCGGCGGCC ACCAACACCG GAGAGCCGGC CGATCTGCTC GCAGCCCGCG CCGAGTTGGT CGAGGTCAAT AGGGCCATCG ACGACGCGCT GAAAGGGGAA AGCCGGCCCG CCGCAGCGTA G
|
Protein sequence | MDPRFERNVA ALEAVQPPWL SREDIRVELG SPWITAGDVA DFCAEVFGAR AGVDHVAPLA AWEVSARGQI SPEARIAYCT DRMDAIDLLQ IGLNGAAPVV WDEFYDQQTH TRRKVRNADA TEAAELKLAA IQQRFSLWVW ENADRERRIV EQYNQTMNAH VLRNHDGSHL TFPGLADGIA LWPWQRDFVD RAVSTPAVFC AHEVGLGKTL TAITLAMTLR QFGLANRPAL IVPLHLIEQA TRQCYQAWPA GRFLIVTRED LHGDARRRFV ARCATGDWDL VIMTHETFSS LPVPGNAERD WLEDQLGELE NYARTEGYTG KRIAAAVRSL QGRLEKLRAS VNDPKAVTFK SLGIDYLIVD EADKFRRLPV TTRADGFSLG SSKRALDLFL KVSLLRRANP DRPHACLLTG TPFTNTLAEG FVWQSMLAPE QLARTGLGHF DAWAAQFVRY KVLIETSPDG SGFRSRRRPG TIQNVPELRT MLSEFMSMVR ADSVGLPRPQ VQNHTHLSEP TDAQREFMAH LVTRADALRA RMPSAESDNM LLICGDGRKV ALDPNLVGIR EEAPKLDGVA AAVADIYHRT RHLTYSGSTT PGAFQLVMCD MGTPKKGDAQ SYGRIRAGLI ARGVPAEQIR FVHEATTAKA REALFAACRD GRVAVLLGST PKVGIGTNVQ NRLHSLHHVD PTWTAAAWEQ RNGRIQRNGN QHATAEIHSH VARGTFDAFM FGTVERKARG FAQLYRMDGQ AREIEDIGDE VLTFGELKAA AAGNDLLLRQ HELESRVRAL RLAHVTVQQN VRTLLHQAAA ADTAAEAAAA RVQRLQAFAE HRDGMREMDM TRVAADACTV RDPAAYRSRY RAECGDHRVS VRVVDTDPGQ RLELAFDYRV LWAEPLPGKV RRRGAEAVKA WAEAMVAAWV AGVDREIVAT QSRVEESRRR AQDARTAAAA TNTGEPADLL AARAELVEVN RAIDDALKGE SRPAAA
|
| |