Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_2600 |
Symbol | |
ID | 4615796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 2734124 |
End bp | 2735551 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639792268 |
Product | extracellular solute-binding protein |
Protein accession | YP_938587 |
Protein GI | 119868635 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCGAG ATCGGTTCGC GCAGCAACGT CAGCTGTCGC GCCGGAACAT GTTGGCCGCC ATGGGAATCG CCGGAGCGGC GGCCGCGAGC CTGCCGGTGC TCTCGGCCTG CGGCGTCGGC GGCAAGACCA GCGCCCCGAA CGGCGCCTCG GAGGTGAGCG GCGGATTCGA CTGGCGCAAG GCGGCCGGGT CGACGATCAA CATCCTGCAG ACCCCGCACC CGTACCAGCA GAGCTACCAG CCGCTGCTCA AGGAGTTCAC CGAGCTCACC GGGATCAACG TCAACGTCGA TCTCGTGCCG GAGGCGGACT ACTTCACCAA GCTCAACACC GAACTGGCGG GCGGCACCGG CAAGCACGAT GCGTTCATGC TGGGTGCCTA CTTCATCTGG CAGTACGGTC CGCCCGGTTG GATCGAGGAT CTCAACCCGT GGCTGCAGAA CGCCTCGGCG ACCAACGCCG AGTACGACTT CGAGGACATC TTCGAGGGTC TGCGCACCTC CACGCGGTGG GACTTCACAT TGGGCAACCC ATTGGGCACC GGCGGTCAGT GGGCGATCCC GTGGGGGTTC GAGAACAACG TCGTCGCCTA CAACAAGGCC TATTTCGACC GGCGGGGCAT CAGGAAACTG CCCGACAACT TCGACGATTT CATCCAGCTG GCCGTGGACC TGACCGACCG CTCGGAGAAC CGGTACGGCA TCGCCACCCG CGGATCGAAG TCGTGGGCCA CGATCCACCC GGGCTTCATG ACGCAGTACG TCCGCGAAGG CGCCGTCGAC TACACGTTCG ACGGCCGCGA TCTGGTCGCC GAGATGGACA GCGACAAGGC CGTCGACTTC ACCGAGAAGT GGATCCGGAT GCAGCACGAG GCCGGCCCCA CCTCGTGGAC CACCTACGAC TACCCGAACG CCACCGGTGA TCTCGGTGAC GGCAAGGCGA TGATGGTCTA CGACGCCGAC AGTGCGACGT ATCCGAAGAA CAAGCCAGGC GCGAGCGCGC AGGCGGGGAA CCTCGGCTGG TATCCGGGTC CGGCCGGCCC CGACGGCAAC TACAAGACCA ACCTGTGGAC CTGGACATGG GCGATGAACG CCAACTCCCG CAACAAACTG CCGGCCTGGC TGTTCATCCA ATGGGCCACC GGCAAGGAGT CGATGAACAA AGCCGTCGAG GGCGGCATCT ACGCAGATCC GGTGCGGCAG TCGGTGTTCG ACACGACGTT CAAGCGGATC GCCGCCGATC AGCACGGCTA CCTCGAGACC TTCGAGACGG TGATCCCCAC CTCCAAGATC CAGTTCACCC CGCAGAAGAA GTTCTTCGAC ACCACCAAGG ACTGGGCCGT TGCGCTGCAG GACATCTACG GCGGGGACGA CGCCGCGTCC CGGCTGCGCA GCCTGGCCAA GACCAACACC TCCAAGGTCA ACCTCTAG
|
Protein sequence | MSRDRFAQQR QLSRRNMLAA MGIAGAAAAS LPVLSACGVG GKTSAPNGAS EVSGGFDWRK AAGSTINILQ TPHPYQQSYQ PLLKEFTELT GINVNVDLVP EADYFTKLNT ELAGGTGKHD AFMLGAYFIW QYGPPGWIED LNPWLQNASA TNAEYDFEDI FEGLRTSTRW DFTLGNPLGT GGQWAIPWGF ENNVVAYNKA YFDRRGIRKL PDNFDDFIQL AVDLTDRSEN RYGIATRGSK SWATIHPGFM TQYVREGAVD YTFDGRDLVA EMDSDKAVDF TEKWIRMQHE AGPTSWTTYD YPNATGDLGD GKAMMVYDAD SATYPKNKPG ASAQAGNLGW YPGPAGPDGN YKTNLWTWTW AMNANSRNKL PAWLFIQWAT GKESMNKAVE GGIYADPVRQ SVFDTTFKRI AADQHGYLET FETVIPTSKI QFTPQKKFFD TTKDWAVALQ DIYGGDDAAS RLRSLAKTNT SKVNL
|
| |