Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_1433 |
Symbol | |
ID | 4614264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 1537080 |
End bp | 1540091 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639791108 |
Product | hypothetical protein |
Protein accession | YP_937435 |
Protein GI | 119867483 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.207138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTATGC GGCCCCCCGC GAGGATGCCG AAGCTGACGC GACGAAGCCG GGTGCTGATC GGTGTCGCCC TCGCCGCGGT GGTGCTGCTG TTGATCGGGC CACGGTTCAT CGACACCTAT GTGAACTGGC TGTGGTTCGG TGAACTCGGC TACCGGTCGG TGTTCACCAC GGTGCTGTTG ACCCGCGTCG TCGTGTTCCT GGTCGTCTCG CTGCTGATCG GCGCGATCGT CTTCGCCGGC CTTGCGTTGG CCTACCGGAC GCGGCCGGTG TTCGTCCCGA CCGCCGGCCC CAACGATCCG ATCGCCCGCT ACCGCACGAC GGTGATGGCG CGGCTGCGCC TGTTCGGCTT CGGCGTCCCG GCGTTCATCG GCATCCTGTC CGGCATCGTC GCGCAGAGCT ACTGGGTGCG CATCCAGCTG TTCCTGCACG GTGGTGAATT CGGGGTCACC GACCCGCAAT TCGGCCTCGA CCTCGGGTTC TACGCCTTCG ACCTGCCGTT CTACCGGCTG GTGCTGAGCT ATCTGTTCGT CGCGACGTTC CTCGCCTTCA TCGCGAACCT GTTGGGCCAC TACCTGTTCG GCGGGATCCG GCTGACCGGG CGCAACGGGG CGCTGACCCG CTCGGCGCGG ATCCAGCTCG TCACCCTGGT CGGCATCCTG ATCCTGCTCA AGGCGTTCGC CTACTGGCTC GACCGCTACG AGCTGCTCAG CCACACCCGC GGCGGCAAGC CGTTCACCGG CGCCGGCTAC ACCGACATCA ACGCCGTGCT GCCCGCCAAG CTGATCCTGC TGGCGATCGC GGTGATCTGC GCGGTCGCGG TGTTCTCCGC GATCGTGCTG CGCGACCTGC GCATCCCCGC CATCGGTGTG GTGCTGCTGC TGCTCTCGTC GCTGGTCGTC GGCGCCGGCT GGCCGCTGGT GGTCGAGCAG TTCAGCGTCA AGCCCAACGC CGCGCAGAAG GAAAGCGAAT ACATTTCGCG CAGCATCGCC GCGACCAGGC AGGCCTACGG TCTGACCGAC GAGGTGGTCA CCTACCGCGA CTACCCGGGC AACGCGCCGG CCACCGCCCA GCAGGTGGCC GCCGACCGGT CCACCACGTC GAACATCCGC GTGCTCGACC CGAACATCGT CAGCCCGGCG TTCACCCAGT TCCAGCAGGG CAAGAACTTC TACTTCTTCC CCGAACAGCT GGCGATGGAC CGCTACCGCG ACGCCGACGG CAACCTGCGC GACTACGTGG TCGCCGCCCG CGAGCTCAAC CCGGACCGGT TGATCGACAA CCAGCGCGAC TGGATCAACC GACACACCGT CTACACCCAC GGCAACGGGT TCATCGCCTC ACCGGCGAAC ACGGTCCGGG GGATCGCCAA CGACCCCAAC CAGAACGGCG GCTACCCGGA GTTCCTCGCG AGCGTGGTGG GCGCCAACGG TGCCGTCGTC TCGCCCGGGC CGGCGCCGCT GGACCAGCCG CGCATCTACT TCGGCCCGGT CATCGCCAAC ACCGCGTCGG ATTACGCGAT CGTCGGGGAG AACGGCACCC CGCGCGAATA CGACTACGAG AACAACGTCG AGACCCGCAA CTACACCTAC ACCGGCTCCG GCGGTGTGCC GATCGGCAAC TGGCTGACGC GAAGCCTGTT CGCGGCGAAG TTCGCCGAGC GCAACTTCCT GTTCTCCAAC GTCATCGGTG AGAACAGCAA GATCCTGTTC AACCGTGACC CCGCCGACCG GGTCGAGGCC GTCGCGCCGT GGCTGACCAC CGACACCACG GTGTACCCGG CGATCGTCAA CAAGAAGATC GTGTGGATCG TCGACGGCTA CACGACGCTC GACAACTACC CCTATTCGGA GTTGACGTCG CTGTCGTCGG CGACCGCGGA CTCCAACGAG GTGGCCGTCA ACCGGTTGGC GCTCAACAAG CAGGTGTCCT ACATCCGCAA CTCGGTGAAG GCCACCGTCG ACGCCTACGA CGGCACGGTG ACGCTGTACG CCCAGGACGA GACCGACCCG GTGCTGCAGG CGTGGATGAA GGTGTTCCCC GACACCATCA AGCCCAAGAG CGAGATCAGC CCCGAGTTGC AGCAGCACCT GCGCTATCCC GAGGACCTGT TCAAGGTGCA GCGCGCGCTG CTGGCCAAGT ATCACGTCGA CGACCCGGTG ACGTTCTTCT CGACGTCGGA CTTCTGGGAC GTCCCGCTGG ACCCGAACCC CACGGCCAGC AGCTTCCAGC CGCCGTACTA CATCGTGGCC AAGGACCTCG CGGAGAACAA CAACTCGGCG GCGTTCCAGC TGACCAGTGC GATGAACCGG TTCCGTCGTG ACTTCCTCGC CGCGTACATG AGCGCGAGTT CGGATCCGGA GACCTACGGA AAGATCACGG TGCTGACCAT CCCGGGTCAG GTCAACGGTC CCAAGCTGGC GTTCAACGCG ATCAGCACCG ACACCGCGGT CAGCCAGGAT CTCGGCGTCA TCGGCCGCGA CAATCAGAAC CGGATCCGGT GGGGCAACCT GTTGACCCTG CCGGTGGGCC CGGGCGGTCT GCTGTACGTG GCGCCGGTGT ACGCCTCACC GGGTACCAGC GATGCGGCGT CGACCTACCC GCGCCTGATC CGTGTGGCGA TGTTCTACAA CGATCAGGTC GGATACGGCC CGACGGTGCG CGACGCGCTG ACCGACCTGT TCGGTGCGGG TGCCGACGCC ACCGCCACCG GACCCGCGCC GGCCAACCTG CCCGACGGTC AGCCGGCGGC CCAGCCGCCG AACGGTCAGC AGCCCGCGGC ACAGACCCCC GGCAACCAGG CGGGTCGGGC GCCGACGCCC CCTCCGGCCG CCATCCCGTC GGGGCCGTCG GGTCCCCAGC AGTTATCCGA GGCGAAAGCC GCGGCGCTGC AGGAGGTCCA GGAGGCGATG AGCGGTCTGC AGGACGCGCA GCGCAGCGGC AACTTCGCCG AATACGGTGA GGCGCTGCAG CGTCTGGACG ACGCGATGAA CAGGTACTCC GAGGCGCGCT GA
|
Protein sequence | MGMRPPARMP KLTRRSRVLI GVALAAVVLL LIGPRFIDTY VNWLWFGELG YRSVFTTVLL TRVVVFLVVS LLIGAIVFAG LALAYRTRPV FVPTAGPNDP IARYRTTVMA RLRLFGFGVP AFIGILSGIV AQSYWVRIQL FLHGGEFGVT DPQFGLDLGF YAFDLPFYRL VLSYLFVATF LAFIANLLGH YLFGGIRLTG RNGALTRSAR IQLVTLVGIL ILLKAFAYWL DRYELLSHTR GGKPFTGAGY TDINAVLPAK LILLAIAVIC AVAVFSAIVL RDLRIPAIGV VLLLLSSLVV GAGWPLVVEQ FSVKPNAAQK ESEYISRSIA ATRQAYGLTD EVVTYRDYPG NAPATAQQVA ADRSTTSNIR VLDPNIVSPA FTQFQQGKNF YFFPEQLAMD RYRDADGNLR DYVVAARELN PDRLIDNQRD WINRHTVYTH GNGFIASPAN TVRGIANDPN QNGGYPEFLA SVVGANGAVV SPGPAPLDQP RIYFGPVIAN TASDYAIVGE NGTPREYDYE NNVETRNYTY TGSGGVPIGN WLTRSLFAAK FAERNFLFSN VIGENSKILF NRDPADRVEA VAPWLTTDTT VYPAIVNKKI VWIVDGYTTL DNYPYSELTS LSSATADSNE VAVNRLALNK QVSYIRNSVK ATVDAYDGTV TLYAQDETDP VLQAWMKVFP DTIKPKSEIS PELQQHLRYP EDLFKVQRAL LAKYHVDDPV TFFSTSDFWD VPLDPNPTAS SFQPPYYIVA KDLAENNNSA AFQLTSAMNR FRRDFLAAYM SASSDPETYG KITVLTIPGQ VNGPKLAFNA ISTDTAVSQD LGVIGRDNQN RIRWGNLLTL PVGPGGLLYV APVYASPGTS DAASTYPRLI RVAMFYNDQV GYGPTVRDAL TDLFGAGADA TATGPAPANL PDGQPAAQPP NGQQPAAQTP GNQAGRAPTP PPAAIPSGPS GPQQLSEAKA AALQEVQEAM SGLQDAQRSG NFAEYGEALQ RLDDAMNRYS EAR
|
| |