Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_1914 |
Symbol | |
ID | 4613660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 2031695 |
End bp | 2033260 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639791578 |
Product | glycoside hydrolase family protein |
Protein accession | YP_937903 |
Protein GI | 119867951 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0322006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGACG ACGCTCGGGT ACCGGGGCTG TTCACCCTCG TCCTGCACAC GCATCTGCCG TGGCTGGCGC ACCACGGCCG CTGGCCGGTC GGCGAGGAGT GGCTCTACCA GTCGTGGTCG GCGTCGTACC TGCCGCTGAT GCGCGTGCTG CGCAGGCTGG CCGGGGAGGG CCGCGACCAT CTGCTCACCC TCGGGATGAC ACCGGTGGTG ACCGCTCAGC TCGACGACCC CTACTGCCTG ACCGGGATGC ACAGCTGGCT GGCGAACTGG CAGCTGCGGG CACTGGAGGC CGCCACCCTG CGCGCCTCCT CGGACACCAC ACCGGCGTGC ACACCGGAAG CGTTGCGCGC CTTCGGTGTT CGCGAGCAGG GCGAAGCGGA GCTGGCGCTC GAGGAGTTCG CCACACTGTG GCGCCACGGC GGCAGTCCGC TGCTGCGTGA GCTCGTCGAC GCGGGCACCG TCGAACTGCT GGGCGGGCCG CTGGCGCATC CGTTCCAGCC GTTGCTCAAC CCCCGACTGC GGGAGTTCGC GCTGCGCGAA GGGCTCGCCG ACGCCGGACA GCGCTTCGCG CACACCCCGC GCGGCATCTG GGCCCCGGAG TGCGCGTACG CCCCGGGCAT GGAGGCCGAC TATGCCGCGG CGGGCGTCGG CCACTTCATG GTCGACGGCC CGTCGCTGCA CGGCGACACC GCGCTAGGCC GCCCCGTCGG CCACTCCGGC GTCGTCGCGT TCGGTCGCGA CCTGCAGGTC AGCTACCGCG TGTGGTCGCC CAAGTCCGGC TATCCCGGCC ACGCCGCCTA CCGCGACTTC CACACCTACG ACCACGTCAC CGGGTTGAAG CCGGCGCGGG TCACCGGGCG CAACGTGCCG TCGTCGGCCA AAGCCCCATA CGAACCGGAC CGCGCCGACG CCGCCATCGA CGCCCACGTC GCCGACTTCG TGCAGGTGGT GCGGCGGCGG CTGACGGACG AGAGCGAGCG GATCGGCCGC CCGGCGCACG TGGTCGCCGC CTTCGACACC GAACTGTTCG GCCACTGGTG GTACGAGGGG CCGGAGTGGC TGGCCCGCGT ACTGCGGGCG CTGCCGGAAG CCGGTGTGCG GGTGGGCACG CTCAGCGATG CCGTCGACGG CGGATTCGTC GGCGCCCCAG TCGATCTGCC GCCCAGTTCG TGGGGTTCGG GTAAGGACTG GCAGGTCTGG GCCGGAGACC AGGTGACCGA CTTCGTCCGA CTCAACGCCG AGGTCGTCGA CACCGCGCTC AGCACCGTCG ACAAGGCGCT CACCCAGCGC GCGTCGGTGG GCAGCCCGAC ACCGCGGGAC ACCGTCGCCG ACCAGATCCT GCGCGAGACC CTGCTGACCG TCTCGAGCGA CTGGCCGTTC ATGGTGAGCA AGGACTCCGC GGCCGACTAC GCCCGCTACC GCGCCCACCT GCACGCCCAC GCGACCCGCG AGATCGCCGA CGCACTCGCG GCCGGCCGGC GGGAGCAGGC CCAGCGCCTC GCCGACGGCT GGAACCGCGC CGACGGCCTG TTCGGCGCCC TCGACGCCCG CCGGTTGCCG CGATGA
|
Protein sequence | MSDDARVPGL FTLVLHTHLP WLAHHGRWPV GEEWLYQSWS ASYLPLMRVL RRLAGEGRDH LLTLGMTPVV TAQLDDPYCL TGMHSWLANW QLRALEAATL RASSDTTPAC TPEALRAFGV REQGEAELAL EEFATLWRHG GSPLLRELVD AGTVELLGGP LAHPFQPLLN PRLREFALRE GLADAGQRFA HTPRGIWAPE CAYAPGMEAD YAAAGVGHFM VDGPSLHGDT ALGRPVGHSG VVAFGRDLQV SYRVWSPKSG YPGHAAYRDF HTYDHVTGLK PARVTGRNVP SSAKAPYEPD RADAAIDAHV ADFVQVVRRR LTDESERIGR PAHVVAAFDT ELFGHWWYEG PEWLARVLRA LPEAGVRVGT LSDAVDGGFV GAPVDLPPSS WGSGKDWQVW AGDQVTDFVR LNAEVVDTAL STVDKALTQR ASVGSPTPRD TVADQILRET LLTVSSDWPF MVSKDSAADY ARYRAHLHAH ATREIADALA AGRREQAQRL ADGWNRADGL FGALDARRLP R
|
| |