Gene Mkms_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1870 
Symbol 
ID4613797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1983523 
End bp1985085 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content68% 
IMG OID639791535 
Productextracellular solute-binding protein 
Protein accessionYP_937860 
Protein GI119867908 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATGTA CGCCCGCCGT TGTGACGCTG GCCGTGACCA CCCTGGTGCT GGCCGCCACG 
GGCTGTTCCG GTTCCAGCAA CAGCCCGACG GACACCGCGG CGTTGATGTC GTCGGTCCGA
ACCCCGCTGA TGTCGGACCC GCCGCCCCTG GACCCGGACG TCTTCTATCA GCCCGAGGGT
CTGCTGATCA TGACGTCGGC CTATCAGGGT CTGCTCCGGT ATGCGCCGGA GAGCACGGAG
GTGGAGGGCC TGCTGGCGAC GGAATGGACT GTCTCCGAGG ACGGTCTGAC CTACACGTTC
ACCCTGCGGG ACGGCGTGAA GTTCTCCGAT GGCACACCCT TCGACTCGGC CGCGGCCAAG
GCCAGTTTCC AGCGGCGGAT CGACATGGCC GCGGGGCCGT CCTACATGCT CGCCGACGTC
CGTGACATGC AGACGCCCGA TCCGCAGACG TTCGTCGTGA CGCTCACCAA ACCGGTCGCC
CCCTTCCTGG ACTACCTGGC CTCGCCCTAC GGGCCACTGA TGACGAGCCC GGCGGCGATC
GCCGAACACG CCGTCGGTGA CGACCGCGGC GCGGGCTGGT TGGCCGCGCA CACCGCAGGC
ACCGGCCCGT ACGAGCTCAC CGCCGCGATC CCGGCGAACC GCTACACCCT CACCGCCAAC
GAGCACTACT GGGGCGAGGC GCCGCAGATC ACCACCGTCG AACTGCCCGT GGTGGCGGCG
ACCGCGGTAC AGCGGATGCA ACTGGAGAAC GGTCAGCTCG ACATGATCCT GCACGGGCTG
TCCAAAGGCG ACTACGAGGC GTTGGGCGCA GGCCCCGACA CCGAGGTCCG GCAGGAGACG
GCGCTGGTGA AGGCGCTGGT GATGGTCGAT CCGGACTCCG AGGTCTTCGG TCCTCCCGCG
GCCAGGGCGG CCCTGAGCGC CGCGCTCGAC CAAACCACGC TCACGACAAC GGTGTTCGGC
GACCAGGGCA GTCCGTCGAC TCAGTTCTAT CCCAGCGGGA TGCTGCCCGA CGGCGCGGTC
CCCGACACGC ACGACTTCGA CCCGGCGAAG CTGGCCGAGA CGGGACGCTC GGGCGGCGAT
GTGGAGATCG GTTACCCCAC CGGGGACAGC AGCCTGCAGA ACCTGGCGAA CCAGATGCAG
GTGATCCTGC AGCAGGCCGG CCTGACGGCG ACGGTGCGGG ACTTTCCGCT CGCGCAGTTC
TTCGCGCTCG GCGAGAACCC CGGTCAGCGC CCCGATCTGC TGCTCGCCTC GTTCAATCCC
GATGCGGCGC ATCCCGATAC GTGGTCGCGG ATCTACCAGT ACACCGATGC GCCGGTGAAC
CTGCAGGGCT GTTCGGTGCC GGCGGCGGAC GCACTGCTCG ACGCAGGCAG TGCCGAACCG
GATCCGGCGA AGTCGCGGGC GCTTTACGTC GAGGCAGCCA AGGAGTACCG GGATTCGCTG
TGCTGGATCA ATCTCGCCGA CCTGCACAAC ACGATCGCCG CACGCAAGGG CTATTCGGGG
TGGAGCAGCC AGCCGGCCTG GATGTGGGAC ACCGACTTCT CGACGCTGGC CTACCGGGAC
TGA
 
Protein sequence
MRCTPAVVTL AVTTLVLAAT GCSGSSNSPT DTAALMSSVR TPLMSDPPPL DPDVFYQPEG 
LLIMTSAYQG LLRYAPESTE VEGLLATEWT VSEDGLTYTF TLRDGVKFSD GTPFDSAAAK
ASFQRRIDMA AGPSYMLADV RDMQTPDPQT FVVTLTKPVA PFLDYLASPY GPLMTSPAAI
AEHAVGDDRG AGWLAAHTAG TGPYELTAAI PANRYTLTAN EHYWGEAPQI TTVELPVVAA
TAVQRMQLEN GQLDMILHGL SKGDYEALGA GPDTEVRQET ALVKALVMVD PDSEVFGPPA
ARAALSAALD QTTLTTTVFG DQGSPSTQFY PSGMLPDGAV PDTHDFDPAK LAETGRSGGD
VEIGYPTGDS SLQNLANQMQ VILQQAGLTA TVRDFPLAQF FALGENPGQR PDLLLASFNP
DAAHPDTWSR IYQYTDAPVN LQGCSVPAAD ALLDAGSAEP DPAKSRALYV EAAKEYRDSL
CWINLADLHN TIAARKGYSG WSSQPAWMWD TDFSTLAYRD