Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_5820 |
Symbol | |
ID | 4610529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008704 |
Strand | - |
Start bp | 30136 |
End bp | 33330 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639789475 |
Product | hypothetical protein |
Protein accession | YP_935810 |
Protein GI | 119855207 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.175347 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGGC CGGCGCCGAG CATTGATCGC CATGTCGTCA TCATCGCCGC GGAAGGTTCG TTCACTGCGG CGGGAGACCA GCTCACCGGC GCGGTCGATA CCGTCGACAA ACTCGAGAAG CTCATCGAGT GGGCGCATCA GCGAGGGGGG CTCCAGCCCG TACCGGCTGG TGAGGAAGAG CATGAGCCGG CCCGCGTATG GGTGGTCGGG GCCGCATGCG ACCTACTGGC GGGAACGCCC CTCGCGCAGG CTGCCGACGA CAACGAAGCC GAGCGCATTG GGCAAGTCCT CGCCCCGCTG GTCACACGCG GGTGGGAACT GCGCGGCAGA CCGACCTCAG CGCTCCTGCT CACCCGCGGC CAGGGTGCGC AGCGGATCTA CGTTGAGATC CTCGCCGAGC GCCAACCCTG GCTCGCTGCC GGGCATGCGG CGGTTGTCGG GAATCCGGCA GAACTCGGCC GCCGGTTACG CGCGTGGTAC GCCGCAGTGG GAACGCTGCC GGCCCGCAGC GGCGCAGCCT CGGCAGCCGT CCTCCACGAC CACATCATGC GCGCCCGGAC AGGCCGTCGC GGCGCTGTCG TGTCGACCCC TGGCGTCCTG CCAGCCTGGG TGCAGCCCGA TATGCGGATC CAACCAGCCT GGTGTGTCTC CGTGGACGAG GCGGAGCGTG AGTTCGAGCG CTCGGACGAA CTGGTGTGTT TGACACAGCT GTGCCCGCAA CTGGCCTCCG CTGGAATGCT GACCCTCGGC TACGGCCAGC CACAGGTGCT CGACGGCCAG GCCGCGGCCG CCGCCGCGGC CGAATCCAAG CGGCCATTCG GCCTGTGGCG CGCGACTCTG CCCGCCGCCG AGAAGTCCAA CCTTCCCGCG ATGCTGCCGC TGCCGCATCC CCAGATGCGG CCCGACCAGC CCACCCAGGC GTGGCTGACC ACCGAGGACC TCGACGGTCT GGCTAAAGAC ACCCGTGACG GCGGCGCCGG CCTGAGCGCC GAGCAGCTCG CCATCGACGA GGCGATCGTC TGGCCGCAGC AGGGCCGGGT CCTAGAAGTG TGGGCGACGC GGCTGAGGGA GGCCCGAGAA ACGTTCCGTG ACGATTCGGT ACTGCAGTCG CTGGTCGACG TCGCCGCGGC CGACTACCTC ACCGCGCTGG CCGCCCCGGA CACCTGGCGC GAGGACGCCT GGCGCCACCA CTTCCAGCCC GCGTGGGCCG CGGCGATCGC CGCCCACATC CGGTTCCGCG GCCGGCGAGC AGCGATGCGG CTGTCCCGTG AATACCGCAG CTGGCCGGTC TGGGCGCACG ACGCCGCCAT GATCTACACC CCGGGCCGAG ACGACACCAC CGGCGAGCCA ATCGACCTGT CGGACACCCA CACTCGACTC GGGCGGCTCG TGGTCTCGCA TCGCTGCGCA CTCACCGACC AAACCGTGCT CGCCGTCCTA CTGGCCGAGT CCACCCTTGA GGTGGCCGAC GCATTCATCA CAGCGCTTGG CATCACCGCC CACCAAGGCA GCGAAGCACG GCCGACTCGC CACAGCCTCG ACGTCGCGGA CGAAGGCGGC GCCGCGGTCA CCGGCGAACC GACATCAATG CCGCAGCCCA CACCAACCGG CGGCGACGAC GAGCAGCCAG CGCAGCCGAC CGGCGGCGAC AAACCCACCG GCCCGGCATC GCAACCACGG GTCTCGGCAA CCCGCACACA CGCCGCCAGC GGCGGCGCAC CGGCCGCTGT CCTCCATACC GACGGTCTCT GGCTTCCCGA CGGCACGCAC ATCGAACTCG ACGAGCCGAT CCTGCACGTC GGACAAGTCG CCGAACTGGC CTACATCCAC CGCATCGGAT ACCAGCTCAC CCCGAAATAC ACAGAAGCCG CCCAAATTTG GGTCACCGCC GACGCCTGCG CAGCCTTCGG CATCGACGTA GAAGCCATCA GCCGGCGCGA CCGGGCCAAG TCGCTGCGCC AGCTCACCGA GGGCATCGAC TTCGTGGTCC TAGCGGTCAA CGACGGCTGG AGCTTGGGCG GGGCAGCCGA AGATCCGACC ACCCAACGCC TCGGCACATG GACACGGGTG TACCGCGACG ACAAACGCGG CGTCATGGTC GCCCTGATCC CCGGAATGGG CGCCGGACAC GAAGAGATGC CCATCTTGGC CGACGACCCC ACCCCAGCGC AGATCGCGCG GCGCCTGCAG CTCCTCGCAG ACGCACTGCA CTTCCCCTGG AAAATCAACG CCGGCGTGAC CGCCGTGGAC TTGATGCTGC AGACCCGCAC CAAAAAGTGG TCGCCCCAGG AGTGGAAAGA AGTCGTGTTC GCGCCCTCGA CGACCAGCCC ACCATTTGGC ATCGGCGACG TCGAATCCGA CTTCGACTGG TCGCGACCGC CGACTGCCGA AGAAAGCCAG CGTCGCTATC TGCACGCCTA CGACCGCGGC GGGTCCTATG TCGCAGGCAT CGCCGGTCTC GAACTGCCCA TCGGCGATCC AGTCCACTAT CCCGAAGGCA CGCAGTTCGA CGCCAAGACA CCCGGCTACT GGCTAGCTGA AATCCCTGAG GCCTCCGACT GGCGCATGCC ATATGTGCTC AATCCCAGAG GAATTCAATT CACCGAACCC AAATGGGTCA CGACACCGAC CCTGGAACGT GCCTTCGCGC TCGGTTACAA CCCAGCGATC CTCGAAGCGT GGACCTGGCC GCAACACGGC CGCGTTCTGC TCGGATGGTA CGAGCGATTC CGCGACGCAA GTGGTGCCCT CGATACCGAC GATCTCGACG CTCAGGCGGC ACGCAACCAG GCCAAGATCA TTCGCACCCA CGGCATCGGC ATCATCGGCT CCGACGAACA CCTCAAGGGC AAGACCGGGT ACAGCCCCGA GCGGCGACTG CATGTGCTGG CCAAAGCCAA GGCCAACATC GTCTACCGGC TACAGCAGAT CGGCGAGCGC ACCGATCAAT GGCCGGTGGC CGTGGCCACC GACACCGTGC TGTACGCCTC TGACGACCCA GACCCCGTGA CGGCATGGCC CGGCGGACCT GACTCATTCG GCCGCGGCTT CGGCCAGTAC AAGCCCGAAG GATCGGCACT ACTCGCCGAC CATCTCGACT TCCTCAACGG ACGTGACTAT CGCGGCAAGC GAGAGCTGAC GCCGGTTGGG CAGTGGCGAC GCCAGGTGCT CGACAAAGAC GACAGGAGCC ACTGA
|
Protein sequence | MSGPAPSIDR HVVIIAAEGS FTAAGDQLTG AVDTVDKLEK LIEWAHQRGG LQPVPAGEEE HEPARVWVVG AACDLLAGTP LAQAADDNEA ERIGQVLAPL VTRGWELRGR PTSALLLTRG QGAQRIYVEI LAERQPWLAA GHAAVVGNPA ELGRRLRAWY AAVGTLPARS GAASAAVLHD HIMRARTGRR GAVVSTPGVL PAWVQPDMRI QPAWCVSVDE AEREFERSDE LVCLTQLCPQ LASAGMLTLG YGQPQVLDGQ AAAAAAAESK RPFGLWRATL PAAEKSNLPA MLPLPHPQMR PDQPTQAWLT TEDLDGLAKD TRDGGAGLSA EQLAIDEAIV WPQQGRVLEV WATRLREARE TFRDDSVLQS LVDVAAADYL TALAAPDTWR EDAWRHHFQP AWAAAIAAHI RFRGRRAAMR LSREYRSWPV WAHDAAMIYT PGRDDTTGEP IDLSDTHTRL GRLVVSHRCA LTDQTVLAVL LAESTLEVAD AFITALGITA HQGSEARPTR HSLDVADEGG AAVTGEPTSM PQPTPTGGDD EQPAQPTGGD KPTGPASQPR VSATRTHAAS GGAPAAVLHT DGLWLPDGTH IELDEPILHV GQVAELAYIH RIGYQLTPKY TEAAQIWVTA DACAAFGIDV EAISRRDRAK SLRQLTEGID FVVLAVNDGW SLGGAAEDPT TQRLGTWTRV YRDDKRGVMV ALIPGMGAGH EEMPILADDP TPAQIARRLQ LLADALHFPW KINAGVTAVD LMLQTRTKKW SPQEWKEVVF APSTTSPPFG IGDVESDFDW SRPPTAEESQ RRYLHAYDRG GSYVAGIAGL ELPIGDPVHY PEGTQFDAKT PGYWLAEIPE ASDWRMPYVL NPRGIQFTEP KWVTTPTLER AFALGYNPAI LEAWTWPQHG RVLLGWYERF RDASGALDTD DLDAQAARNQ AKIIRTHGIG IIGSDEHLKG KTGYSPERRL HVLAKAKANI VYRLQQIGER TDQWPVAVAT DTVLYASDDP DPVTAWPGGP DSFGRGFGQY KPEGSALLAD HLDFLNGRDY RGKRELTPVG QWRRQVLDKD DRSH
|
| |