Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_1870 |
Symbol | |
ID | 4613797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 1983523 |
End bp | 1985085 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639791535 |
Product | extracellular solute-binding protein |
Protein accession | YP_937860 |
Protein GI | 119867908 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATGTA CGCCCGCCGT TGTGACGCTG GCCGTGACCA CCCTGGTGCT GGCCGCCACG GGCTGTTCCG GTTCCAGCAA CAGCCCGACG GACACCGCGG CGTTGATGTC GTCGGTCCGA ACCCCGCTGA TGTCGGACCC GCCGCCCCTG GACCCGGACG TCTTCTATCA GCCCGAGGGT CTGCTGATCA TGACGTCGGC CTATCAGGGT CTGCTCCGGT ATGCGCCGGA GAGCACGGAG GTGGAGGGCC TGCTGGCGAC GGAATGGACT GTCTCCGAGG ACGGTCTGAC CTACACGTTC ACCCTGCGGG ACGGCGTGAA GTTCTCCGAT GGCACACCCT TCGACTCGGC CGCGGCCAAG GCCAGTTTCC AGCGGCGGAT CGACATGGCC GCGGGGCCGT CCTACATGCT CGCCGACGTC CGTGACATGC AGACGCCCGA TCCGCAGACG TTCGTCGTGA CGCTCACCAA ACCGGTCGCC CCCTTCCTGG ACTACCTGGC CTCGCCCTAC GGGCCACTGA TGACGAGCCC GGCGGCGATC GCCGAACACG CCGTCGGTGA CGACCGCGGC GCGGGCTGGT TGGCCGCGCA CACCGCAGGC ACCGGCCCGT ACGAGCTCAC CGCCGCGATC CCGGCGAACC GCTACACCCT CACCGCCAAC GAGCACTACT GGGGCGAGGC GCCGCAGATC ACCACCGTCG AACTGCCCGT GGTGGCGGCG ACCGCGGTAC AGCGGATGCA ACTGGAGAAC GGTCAGCTCG ACATGATCCT GCACGGGCTG TCCAAAGGCG ACTACGAGGC GTTGGGCGCA GGCCCCGACA CCGAGGTCCG GCAGGAGACG GCGCTGGTGA AGGCGCTGGT GATGGTCGAT CCGGACTCCG AGGTCTTCGG TCCTCCCGCG GCCAGGGCGG CCCTGAGCGC CGCGCTCGAC CAAACCACGC TCACGACAAC GGTGTTCGGC GACCAGGGCA GTCCGTCGAC TCAGTTCTAT CCCAGCGGGA TGCTGCCCGA CGGCGCGGTC CCCGACACGC ACGACTTCGA CCCGGCGAAG CTGGCCGAGA CGGGACGCTC GGGCGGCGAT GTGGAGATCG GTTACCCCAC CGGGGACAGC AGCCTGCAGA ACCTGGCGAA CCAGATGCAG GTGATCCTGC AGCAGGCCGG CCTGACGGCG ACGGTGCGGG ACTTTCCGCT CGCGCAGTTC TTCGCGCTCG GCGAGAACCC CGGTCAGCGC CCCGATCTGC TGCTCGCCTC GTTCAATCCC GATGCGGCGC ATCCCGATAC GTGGTCGCGG ATCTACCAGT ACACCGATGC GCCGGTGAAC CTGCAGGGCT GTTCGGTGCC GGCGGCGGAC GCACTGCTCG ACGCAGGCAG TGCCGAACCG GATCCGGCGA AGTCGCGGGC GCTTTACGTC GAGGCAGCCA AGGAGTACCG GGATTCGCTG TGCTGGATCA ATCTCGCCGA CCTGCACAAC ACGATCGCCG CACGCAAGGG CTATTCGGGG TGGAGCAGCC AGCCGGCCTG GATGTGGGAC ACCGACTTCT CGACGCTGGC CTACCGGGAC TGA
|
Protein sequence | MRCTPAVVTL AVTTLVLAAT GCSGSSNSPT DTAALMSSVR TPLMSDPPPL DPDVFYQPEG LLIMTSAYQG LLRYAPESTE VEGLLATEWT VSEDGLTYTF TLRDGVKFSD GTPFDSAAAK ASFQRRIDMA AGPSYMLADV RDMQTPDPQT FVVTLTKPVA PFLDYLASPY GPLMTSPAAI AEHAVGDDRG AGWLAAHTAG TGPYELTAAI PANRYTLTAN EHYWGEAPQI TTVELPVVAA TAVQRMQLEN GQLDMILHGL SKGDYEALGA GPDTEVRQET ALVKALVMVD PDSEVFGPPA ARAALSAALD QTTLTTTVFG DQGSPSTQFY PSGMLPDGAV PDTHDFDPAK LAETGRSGGD VEIGYPTGDS SLQNLANQMQ VILQQAGLTA TVRDFPLAQF FALGENPGQR PDLLLASFNP DAAHPDTWSR IYQYTDAPVN LQGCSVPAAD ALLDAGSAEP DPAKSRALYV EAAKEYRDSL CWINLADLHN TIAARKGYSG WSSQPAWMWD TDFSTLAYRD
|
| |