Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_0828 |
Symbol | |
ID | 4614848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 901087 |
End bp | 905106 |
Gene Length | 4020 bp |
Protein Length | 1339 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639790504 |
Product | hypothetical protein |
Protein accession | YP_936834 |
Protein GI | 119866882 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.215826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCGGT CGCGTCGCGG CTTCGGTCCG GATTTCGGTT GGCTCGAGCA GTTCGACGTC GATGGTCCGT TTTTATCGTT GCCGGTGGTG AAAGAGTTTT GGGCCAGTGG TATTGATCGG TTAAGCGATG CCGATGACCG GTTGGTGCGG TTCAAGCAGG GATTCACGGC GTGGCTGCGT GCTTATGACC AACAATCGCT GGAGAAGCGG GATCACTACG CGGCGACGGC GCGGGCCTGG GTCGATACGG TGCTCGACGA GCTCGCCGGT TGGGACGGTC TTCGCGTCGG TGCCGACGAG TTGCCTGCGG AGTTCGAGAT CCATTCGCCG GGTGAGCAGG TGCGGATTCG TGCGGATGGA GGGTTGCGGG GCCAGGAGAG CGATGAGGTC GCGGCGCTGC TGCGAGTGGT CTCGCCCACT GAGGATCTGC GCGGACCTGG GTTGGATGGG TGGGCGGCGA CCGAAATCGA CCGGATGGCG GCGCTTTTGC GGCAAGCGGG GGTCCCGATC GGTCTGGTCA CGGATGGTCG ATGGTGGGGG ATCGTGTGGG CCGAGGAAGG CACGGCCACC GGTTCGGGCA TCGTTGACGC GGTCACCTGG GGTGAAGAAC CGCTGCTGCG CGATGCGTTC CTGACGCTGA TCGATCAGCA GCTACTACGG GCGAAGAATG CCGATCACCG GCTGGCGCGG CTGTTGCAGC GCAGCGAACT CGAGGCCGAG GAGATCACCG AGGCCCTCGG CACTCAGGTT CGCAAGTCGG TCGAGCTTCT GGTGCAGGCG TTCTCCGAGA CACGACTGCT TGCAGCCGAG AATGGTGAGC CTGACCCGCT GACGGAGAAG CCCGACGACG TGTATCAGGC GGCGGTCACC GTGATGATGC GGGTGGTGTT CCTGCTGTTC GCGGAAGAGC GGGGAATGCT GCCGACCGAG CGGTTGTATT GGGATTCCTA CGCTATCCGT GAGCTTCTCG ACGATCTCAA GGGCCGGGCG TTGGCGCACG GCGAAGAAAG TCTCGACGAA ACTCACGATG TTTGGCATCG GCTGCTCGCG GTCAGTGATG CGTTGTACTT CGGAGTGAAT TACGACGAGA TGCGCATGCC TGCCTACGGT GGCTCGCTGT TGGACCCGGC TCGGTTCCCG TGGCTTACGG CGACCGATCA GCATGGTCTG CGGGTGCAGG TATCCGATCG GGTGATGCTG CACGTGCTCG AGTCCGTACA GGAGGCCAAG GTTCGGGGCG AGGCGCGACG GATTTCGTTC CGTGACATTG ACGTCGAACA GATCGGGTAC ATCTACGAAG GCCTGCTGGG TTACACGTGT GCCACAGTCG CCGACGATGT TGTGCTTGGA TTGGTCGGCA GGGAGGGTGA AGAGCCGGAG ATCACGCTCA GCCAGCTGAA CCAGCTGCAC GGAAGCGCCG GCGCTACAAA GGCTTTCGTT GACAAGTTGA TCGAGTGGGT GAAGAAGGTC CAGCCCGCCG CGAGTTTGAA GACGGCTGCG CAGTTGGTGA AGCTAACCGA TGCAAAGGTC GATGGGTCGG AACTGCGGCG AATCCTCACG CCGGTAGCCG GGCATGATTC CGAGTTGCTC ACCGATCTGA TCCGCTGGGG CAATCTCATC CGTCGCGATC TGCGAGGTAT TCCGCTGGTC GTTCCGCCGG GTGGCTTGGT GGTGATAGAA ACCCCGTCGC GGCGCAACGC TGGCGCGCAC TACACACCGC GACCGCTTGC GGAGGAAGTC GTCAGATACG CACTCGAACC GGTGGTGTAC GAGCCGGGGC CTTTGCAGAC CAACAACATC GACGAGTGGA AACTGAAGTC CAGCACGGCC ATTCTCGATC TCAAGGTCGC CGACATCGCG GCTGGCTCGG GGGCGTTCCT CGTCGCGGCG GCCCGGTTCC TCGCCAAGCG AGTCACCGAA GCGTGGACCA AGGAAGGGAT GCTCAACGAG GCTGAGCGTG CCGACCCGCT GATCGCGGAG GAGCGCGCGA TCCGCGAGGT CGTGGCCCGG TGTTTGTACG GCGCCGACAT CAACCCGATG GCCGTCGAGA TGTGCAAGCT GTCGCTGTGG TTGGTGTCGC TGGACAAGAC AAAGCCATTC TCATTCGTCG ACGACAAGAT CCTCTGCGGC AACTCGCTAC TGGGGGTGAC GACACTGGAT CAGCTTCGGC ACCTCCACAT CGCCCCGGAC AGGAAGCGAA AATTCGTACA GCCCTTCGTC GACGTTGATG CCGTGCTCGC CGAAGCGACA CGCTTGAGAC GAGAGCTGGC GTCGCCCGTC GATGAGGACG ATCCGCAGCG CTCGACAGCG GGCAAACTGC GGCTTCTCCG ACGAGCCGAG GAAGTCACCG CACAGCTGCG GGTGATGGCG GACGGGATCA TCGCAGCCGG CCTTGCCTTG GGTGGAAGAT CAGGCCCCCA GCTTGAGGAC GCCTACAAAT CGCTGGAATG GTCGCTGGCG GAAGCGTTCC CCAGCGACGG GTCGACCAGC AATCGGTCCG CCCTCGACGC GATCATCGCG AAGGGACTCG CCCCGACAGT TGACATCGAT TACGAACGGT GGCAACCCTT GCATTGGGTC ATTGAAGTTC CTGACGTGAT GGAACGTGGC GGCTTCGACG CCGTCATCGG CAATCCTCCG TTTCTCGGTG GAAAGAAGCT CGGCGGTGCG GTTGGGCCGA ACGTCCGAGA CTGGCTAGTC AATGTTCTTG CTGATGGAGC AAAGGGGGTT GCCGACCTCA TCGCATACTT CTTCCTTCGC GCTCATTCAC TTGTTCGCAA TGACGGGCTT CTCGGGCTGA TCGCGACTAA TACCGTCGCT CAAGGAGATA CCCGAGAGGT TGGTCTCGAC CAAATGGTCA ATCGCGGATT CACGATCATT CGGGCGATTA GAAGTCGACC ATGGCCGGCT CGGGGTGCAA GTCTAGACTT TGCCGCTGTC TGGGGACTTG CTGGAGGCGT ACAAAGCGAC CTGTCGCCGG TCAGTGACGG GTCTCCGGTA AAGCGAATTA CGTCTCTGCT TGAGCCGGGA GGGGTTGTTG AAGGCCACCC CTTGAGACTG GCCGAAAATG GCGGCTTCGG CTTCGTCGGT TGCTATATGA ACGGGACTGG CTTTGAGGTC CCGCCCGACG ACGCTGAGGC GATGATCAGA AGATCTCGTG ACAACGCCGA CGTCCTCTGT CCTCTACTTG GTGGCGACGA TGTTAATCGC GTGCCTTCGC TGACGGCGCC CATGTGGGTT GTCGACTTCT ACAACTTCGA ACTTGAGGAA GCGATGAAGT ACGCTGAGCC GTTCGCCTGG GTCGCGGAAC GGGTTCGTCC ACATCGCGAG AAGCTTGTGC AGAAGCCCAA GTTGGTGGCG CGATGGTGGC GGTACGAGAG AGATGCCAAG GCGATGCGCG AGGCAGTCGC ACGGTTAGAC GAAGTGCTAG TGCTCGCCCT CGTTAGCAAA TTCGTTATGC CAGTCCGTGT GCCCACGGGT CAAGTTTTTG TTCACTCCTT GGGAATTTTT GCCACAGATT CATTTGCTGA GCAGGCTATC CTGTCTTCCG TCATCCATCA GCTTTGGGCA ATCACTTACG GCTCGACTCT TGAAACTCGA GTTCGCTACA CGCCCACCGA TGTGATTGAA ACATTCCCGC GACCGAAGGT TACTGCCTCG CTTGAGCGTA TCGGCCGCGC GTTGGAGAGT GAGCGGCGCG AGATCATGCT GCGGCGCAAT CTTGGTCTGA CGAAGCTGTA CAACATTGTG CACGACGCTG AAGCCGCCGA CACTATGGAC AAGGACGCCG CACGATTGCG CGCGCTACAC GTTGAGCTTG ATGAGGCGGT CGTTGCTGCG TACGGATGGT CAGATATTCG GCTAGATCAC GGCTTCCATA CGTATCGGCA GGTGGAGGGG TTCTCGGTCT CTCCAGCAGC GCGGGTCGAA ATTTTGGATC GGCTATTGGA AGAGAACCAC CGCCGAGCGC AACTTGAGGG ACGAAGTGTG CCACAGAAGC AAGGGAAGTT GTTCTCATGA
|
Protein sequence | MRRSRRGFGP DFGWLEQFDV DGPFLSLPVV KEFWASGIDR LSDADDRLVR FKQGFTAWLR AYDQQSLEKR DHYAATARAW VDTVLDELAG WDGLRVGADE LPAEFEIHSP GEQVRIRADG GLRGQESDEV AALLRVVSPT EDLRGPGLDG WAATEIDRMA ALLRQAGVPI GLVTDGRWWG IVWAEEGTAT GSGIVDAVTW GEEPLLRDAF LTLIDQQLLR AKNADHRLAR LLQRSELEAE EITEALGTQV RKSVELLVQA FSETRLLAAE NGEPDPLTEK PDDVYQAAVT VMMRVVFLLF AEERGMLPTE RLYWDSYAIR ELLDDLKGRA LAHGEESLDE THDVWHRLLA VSDALYFGVN YDEMRMPAYG GSLLDPARFP WLTATDQHGL RVQVSDRVML HVLESVQEAK VRGEARRISF RDIDVEQIGY IYEGLLGYTC ATVADDVVLG LVGREGEEPE ITLSQLNQLH GSAGATKAFV DKLIEWVKKV QPAASLKTAA QLVKLTDAKV DGSELRRILT PVAGHDSELL TDLIRWGNLI RRDLRGIPLV VPPGGLVVIE TPSRRNAGAH YTPRPLAEEV VRYALEPVVY EPGPLQTNNI DEWKLKSSTA ILDLKVADIA AGSGAFLVAA ARFLAKRVTE AWTKEGMLNE AERADPLIAE ERAIREVVAR CLYGADINPM AVEMCKLSLW LVSLDKTKPF SFVDDKILCG NSLLGVTTLD QLRHLHIAPD RKRKFVQPFV DVDAVLAEAT RLRRELASPV DEDDPQRSTA GKLRLLRRAE EVTAQLRVMA DGIIAAGLAL GGRSGPQLED AYKSLEWSLA EAFPSDGSTS NRSALDAIIA KGLAPTVDID YERWQPLHWV IEVPDVMERG GFDAVIGNPP FLGGKKLGGA VGPNVRDWLV NVLADGAKGV ADLIAYFFLR AHSLVRNDGL LGLIATNTVA QGDTREVGLD QMVNRGFTII RAIRSRPWPA RGASLDFAAV WGLAGGVQSD LSPVSDGSPV KRITSLLEPG GVVEGHPLRL AENGGFGFVG CYMNGTGFEV PPDDAEAMIR RSRDNADVLC PLLGGDDVNR VPSLTAPMWV VDFYNFELEE AMKYAEPFAW VAERVRPHRE KLVQKPKLVA RWWRYERDAK AMREAVARLD EVLVLALVSK FVMPVRVPTG QVFVHSLGIF ATDSFAEQAI LSSVIHQLWA ITYGSTLETR VRYTPTDVIE TFPRPKVTAS LERIGRALES ERREIMLRRN LGLTKLYNIV HDAEAADTMD KDAARLRALH VELDEAVVAA YGWSDIRLDH GFHTYRQVEG FSVSPAARVE ILDRLLEENH RRAQLEGRSV PQKQGKLFS
|
| |