Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_4005 |
Symbol | |
ID | 4611945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 4222327 |
End bp | 4223994 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639793689 |
Product | extracellular solute-binding protein |
Protein accession | YP_939987 |
Protein GI | 119870035 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.326685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTTGC GACGCCTGAT CTCGGCCGCC CTCGTTGCCA CCCTCACCCT CGCCGCGTGT TCCAGCGGTG ACGAGGAGAC CCCGTCGGCC GGCGGTAGCG CGGAGGTGGG CGCCACCAAC GACGTCAATC CTCAGGACGT GTCCAACCTG CGCCAGGGCG GCAACCTGCG CTTGGCGCTG ACCGCATTCC CGGCGAACTT CAACAGCCTG CACATCGACG GCAACGTCGC CGATGCGGCC GGGATGCTCA GGGCCACGAT GCCGCGTGCG TTCCGGATCG CCGCGGACGG TTCGGCGACG CTGAACACCG ACTACTTCAC CGGCGCCGAG ATCACCGGCA CCGACCCGCA GGTCGTCACC TGGACGATCA ACCCGAAGGC GGTGTGGAGC GACGGCGCCC CGATCACCTG GGAGGACATC GCCGCCCAGG TGAACGCCAC CAGCGGTAAG CAGGAGGGGT TCGCCTTCGC CAGCCCCAAC GGCTCCGACC GCGTCGCGTC GGTGACCCGC GGGGCCGACG ACCGGCAGGC CGTCATGACC TTCGCCAAGC CCTACGCGGA ATGGCGCGGC ATGCTCTCGG GCAACACCAT GCTGCTGCCC AAGAGCATGA CCGCGACGCC CGAGGCGTTC AACCGCGGCC AACTCGACGC GCCGGGCCCG TCGGCGGGTC CGTTCATCGT GTCGAACCTG GACCGTACGG CCCAGCGGAT CACGCTGACC CGCAACCCGA AATGGTGGGG TGAGCCGCCA CTTCTGGACA GCATCACCTA CTCGGTGCTC GACGACGCCG CCCGAATCCC CGCGCTGCAG AACAACGCGC TGGACGCCAC CGGTCTGGCG ACCCTCGACG AGTTGACGAT CGCGCGGCGC ACCAACGGTG TGGCGATCCG GCGGGCGCCG GGCAACAGCT GGTACCACTT CACGTTCAAC GGGGCACCGG GGTCGATCCT GGCCGACAAG GCGCTGCGGC AGGCGATCGC GAAAGGCATT GACCGCCAGA CCATTGCGGC GGTCACCCAG CGCGGCCTCG CCGACGACCC GGTGCCGCTC AACAACCACA TCTTCGTCGC GGGCCAGCAG GGCTACCAGG ACAACAGCGG TGTGGTGGCG TTCGACCCCG AGAAGGCCAA ACAGGAACTC GATGCGCTCG GATGGCGGCT CAACGGGCAG TTCCGGGAGA AGGACGGCCG CCAGCTGGTG ATCCGCGACG TGCTGTTCGA CGCGCTGAGC ACCCGTCAGT TCGGCCAGAT CGCGCAGAAC AACCTCGCCC AGATCGGTGT GAAACTCGAA CTGGACGCCA AGGGTGCGGC CGGCTTCTTC ACCGACTACA TCAACACCGG CGACTTCGAC ATCGCGCAGT TCTCGTGGGT GGGTGACGCG TTCCCGCTCT CGGGTCTGAC GCAGATCTAC GCCTCCAACG GGGAGAGCAA CTTCGGCAAG ATCGGCAGCC CGCAGATCGA CGCGAAGATC GAGGAGGCGC TGGAGGAACT GGATCCGGCG AAGGCTCAGC AGAAGGCCAA CGAGGTCGAC AAACTGCTGT GGGACGAGGT GTTCAGCCTG CCGTTGACGC AGTCGCCGGG CAACGTGGCG GTGCGTGCCA ACCTCGCCAA CTTCGGGGCG TTCGGCCTCG CGGACGCCGA CTACTCGAAG ATCGGCTTCG TGAAGTAG
|
Protein sequence | MTLRRLISAA LVATLTLAAC SSGDEETPSA GGSAEVGATN DVNPQDVSNL RQGGNLRLAL TAFPANFNSL HIDGNVADAA GMLRATMPRA FRIAADGSAT LNTDYFTGAE ITGTDPQVVT WTINPKAVWS DGAPITWEDI AAQVNATSGK QEGFAFASPN GSDRVASVTR GADDRQAVMT FAKPYAEWRG MLSGNTMLLP KSMTATPEAF NRGQLDAPGP SAGPFIVSNL DRTAQRITLT RNPKWWGEPP LLDSITYSVL DDAARIPALQ NNALDATGLA TLDELTIARR TNGVAIRRAP GNSWYHFTFN GAPGSILADK ALRQAIAKGI DRQTIAAVTQ RGLADDPVPL NNHIFVAGQQ GYQDNSGVVA FDPEKAKQEL DALGWRLNGQ FREKDGRQLV IRDVLFDALS TRQFGQIAQN NLAQIGVKLE LDAKGAAGFF TDYINTGDFD IAQFSWVGDA FPLSGLTQIY ASNGESNFGK IGSPQIDAKI EEALEELDPA KAQQKANEVD KLLWDEVFSL PLTQSPGNVA VRANLANFGA FGLADADYSK IGFVK
|
| |