Gene Mkms_4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4005 
Symbol 
ID4611945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4222327 
End bp4223994 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content67% 
IMG OID639793689 
Productextracellular solute-binding protein 
Protein accessionYP_939987 
Protein GI119870035 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.326685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTGC GACGCCTGAT CTCGGCCGCC CTCGTTGCCA CCCTCACCCT CGCCGCGTGT 
TCCAGCGGTG ACGAGGAGAC CCCGTCGGCC GGCGGTAGCG CGGAGGTGGG CGCCACCAAC
GACGTCAATC CTCAGGACGT GTCCAACCTG CGCCAGGGCG GCAACCTGCG CTTGGCGCTG
ACCGCATTCC CGGCGAACTT CAACAGCCTG CACATCGACG GCAACGTCGC CGATGCGGCC
GGGATGCTCA GGGCCACGAT GCCGCGTGCG TTCCGGATCG CCGCGGACGG TTCGGCGACG
CTGAACACCG ACTACTTCAC CGGCGCCGAG ATCACCGGCA CCGACCCGCA GGTCGTCACC
TGGACGATCA ACCCGAAGGC GGTGTGGAGC GACGGCGCCC CGATCACCTG GGAGGACATC
GCCGCCCAGG TGAACGCCAC CAGCGGTAAG CAGGAGGGGT TCGCCTTCGC CAGCCCCAAC
GGCTCCGACC GCGTCGCGTC GGTGACCCGC GGGGCCGACG ACCGGCAGGC CGTCATGACC
TTCGCCAAGC CCTACGCGGA ATGGCGCGGC ATGCTCTCGG GCAACACCAT GCTGCTGCCC
AAGAGCATGA CCGCGACGCC CGAGGCGTTC AACCGCGGCC AACTCGACGC GCCGGGCCCG
TCGGCGGGTC CGTTCATCGT GTCGAACCTG GACCGTACGG CCCAGCGGAT CACGCTGACC
CGCAACCCGA AATGGTGGGG TGAGCCGCCA CTTCTGGACA GCATCACCTA CTCGGTGCTC
GACGACGCCG CCCGAATCCC CGCGCTGCAG AACAACGCGC TGGACGCCAC CGGTCTGGCG
ACCCTCGACG AGTTGACGAT CGCGCGGCGC ACCAACGGTG TGGCGATCCG GCGGGCGCCG
GGCAACAGCT GGTACCACTT CACGTTCAAC GGGGCACCGG GGTCGATCCT GGCCGACAAG
GCGCTGCGGC AGGCGATCGC GAAAGGCATT GACCGCCAGA CCATTGCGGC GGTCACCCAG
CGCGGCCTCG CCGACGACCC GGTGCCGCTC AACAACCACA TCTTCGTCGC GGGCCAGCAG
GGCTACCAGG ACAACAGCGG TGTGGTGGCG TTCGACCCCG AGAAGGCCAA ACAGGAACTC
GATGCGCTCG GATGGCGGCT CAACGGGCAG TTCCGGGAGA AGGACGGCCG CCAGCTGGTG
ATCCGCGACG TGCTGTTCGA CGCGCTGAGC ACCCGTCAGT TCGGCCAGAT CGCGCAGAAC
AACCTCGCCC AGATCGGTGT GAAACTCGAA CTGGACGCCA AGGGTGCGGC CGGCTTCTTC
ACCGACTACA TCAACACCGG CGACTTCGAC ATCGCGCAGT TCTCGTGGGT GGGTGACGCG
TTCCCGCTCT CGGGTCTGAC GCAGATCTAC GCCTCCAACG GGGAGAGCAA CTTCGGCAAG
ATCGGCAGCC CGCAGATCGA CGCGAAGATC GAGGAGGCGC TGGAGGAACT GGATCCGGCG
AAGGCTCAGC AGAAGGCCAA CGAGGTCGAC AAACTGCTGT GGGACGAGGT GTTCAGCCTG
CCGTTGACGC AGTCGCCGGG CAACGTGGCG GTGCGTGCCA ACCTCGCCAA CTTCGGGGCG
TTCGGCCTCG CGGACGCCGA CTACTCGAAG ATCGGCTTCG TGAAGTAG
 
Protein sequence
MTLRRLISAA LVATLTLAAC SSGDEETPSA GGSAEVGATN DVNPQDVSNL RQGGNLRLAL 
TAFPANFNSL HIDGNVADAA GMLRATMPRA FRIAADGSAT LNTDYFTGAE ITGTDPQVVT
WTINPKAVWS DGAPITWEDI AAQVNATSGK QEGFAFASPN GSDRVASVTR GADDRQAVMT
FAKPYAEWRG MLSGNTMLLP KSMTATPEAF NRGQLDAPGP SAGPFIVSNL DRTAQRITLT
RNPKWWGEPP LLDSITYSVL DDAARIPALQ NNALDATGLA TLDELTIARR TNGVAIRRAP
GNSWYHFTFN GAPGSILADK ALRQAIAKGI DRQTIAAVTQ RGLADDPVPL NNHIFVAGQQ
GYQDNSGVVA FDPEKAKQEL DALGWRLNGQ FREKDGRQLV IRDVLFDALS TRQFGQIAQN
NLAQIGVKLE LDAKGAAGFF TDYINTGDFD IAQFSWVGDA FPLSGLTQIY ASNGESNFGK
IGSPQIDAKI EEALEELDPA KAQQKANEVD KLLWDEVFSL PLTQSPGNVA VRANLANFGA
FGLADADYSK IGFVK