Gene Mkms_4114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4114 
Symbol 
ID4612054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4347404 
End bp4349278 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content72% 
IMG OID639793798 
Productextracellular solute-binding protein 
Protein accessionYP_940096 
Protein GI119870144 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGACCC GTCCCCGCCG CCACCGCGTG ACGTTGGGGG CGTTGGTCTC TGCCGTCGCA 
CTGCTGCTCG GTGGGTGCAC CGTGAGCCCG CCGCCGGCAC CGCAGAGCAC CGAGACGCCG
CAGACCACCC CGCCGCCGGC GCCCAAGGCC ACCCAGATCA TCGTCGCGAT CGATTCGATC
GGTCCGGGGT TCAACCCGCA TCTGCTGTCG GATCAGTCGC CGGTCAACGC GGCGATCGCC
GCCCTGGTGC TGCCGAGTTC GTTCCGGCCG ATCCCGGATC CGAACACGCC GACCGGATCG
CGGTGGGAGC TGGACCCGAC GCTGCTGGAA TCGGCCGAGG TGACCAACGA GAACCCGTTC
ACCGTCACCT ACAACATCCG GCCCGAGGCG CAGTGGACCG ACAACGCGCC GATCGGCGCC
GACGACTACT GGTATCTGTG GCGTCAGATG GTCAGCCAGC CCGGCGTCGT GGACCCCGCC
GGCTACGACC TGATCACCGG GGTGCAGTCC GTCGAGGGCG CCAAGCAGGT CGTCGTCACG
TTCTCCCAGC CCTACCCCGC GTGGCGCGAG CTGTTCAACA ACATCCTGCC CGCGCACATC
GTCAAGGACG TGCCCGGCGG TTTCGGAGCG GGTCTGGCCG AGGCGATGCC GGTGACGGGC
GGACAGTTCC GGGTGGAGAG CATCGATCCG CAGCGTGACG AGATCCTGCT GGCCCGCAAC
GACCGGTACT GGAGTGCACC CGCCAAACCC GACCTCGTGC TGTTCCGCCG CGGCGGTGCC
CCGGCGGCGC TGGCGGACTC GATCCGCAAC GGTGACACCC AGGTCGCCCA GGTGCACGGC
GGCGCAGCGA CCTTCGCGCA ACTGAGCGCG ATCCCGGATG TGCGCACGGC CCGCATCGTG
ACGCCGCGGG TCATGCAGGT GACGCTGCGC GGTCAGCTGC CCAAACTCGC CGATCCGTCG
GTTCGCCGGG CGATCCTCGG CCTGCTCGAC GTCGACCTGC TGGCGTCGGT GGGTGCCGGC
GACGACAACA CCGTCACCCT GGCCCAGGCG CAGGTGCGCT CACCGTCCGA TCCCGGGTAT
GTGCCGACGG CCCCACCCGC GCTGGGCACC GAGGCGGCCC TCGAACTACT GGCCGAGGCG
GGCTATCAGA TGGCGCCGGT GGGGGATGGC GCACCGGGGG CTCCCGCACC GGACCACAGT
CGCGGACGCC TCTCGAAAGA CGGTGCCCCG CTCACCCTGG TGCTCGGGGT GGCCGCCAAC
GATCCGACGT CGGTCGCCGT GGCGAACACC GCCGCCGACC AACTCCGCAG TGTCGGCATC
GAGGCGTCGG TGCTGGCGCT GGATCCGGTG AAGCTCTACG GGGAGGCGCT GGCCGAGAAC
CGGGTCGACG CCGTCGTGGG GTGGCACCAG GCCGGGGGAG ACCTGGCCAC CGCGCTGGCC
TCCCGGTACG GCTGTCCCGC GCTGGAGGCG ACGCCCGTGC AGACCGCGCC TGCGGCGCCG
CCCGGATCGT CGCGGCCGAC CACGACCGCG GCTCCGGCGA CGACGGCTCC CGCCACCCCG
ACGGCCACCC CCACCACCAC ACCCAGCCCC GCACCGGAAT CCGGCGCGCT GGTGCAGGCG
CCGAGCAACA TCACCGGAGT GTGTGACCGC AGCATCCAGC CGAAGATCGA CGCGGCCCTC
GACGGCAGCG AACCGATCGC CGAGGTGATC GACGCCGTCG AGCCACGGCT GTGGAACATG
TGGACGGTGT TGCCGATCCT GCAGGACACC ACGATCGTCG CGGCGGGGCC GAGCGTGCGC
AACGTGAGCC TCACGGGGGC CGTGCCGGTC GGCATCGTCG GCGATGCGGG CAGTTGGGTG
AAAACCCCGC AGTGA
 
Protein sequence
MPTRPRRHRV TLGALVSAVA LLLGGCTVSP PPAPQSTETP QTTPPPAPKA TQIIVAIDSI 
GPGFNPHLLS DQSPVNAAIA ALVLPSSFRP IPDPNTPTGS RWELDPTLLE SAEVTNENPF
TVTYNIRPEA QWTDNAPIGA DDYWYLWRQM VSQPGVVDPA GYDLITGVQS VEGAKQVVVT
FSQPYPAWRE LFNNILPAHI VKDVPGGFGA GLAEAMPVTG GQFRVESIDP QRDEILLARN
DRYWSAPAKP DLVLFRRGGA PAALADSIRN GDTQVAQVHG GAATFAQLSA IPDVRTARIV
TPRVMQVTLR GQLPKLADPS VRRAILGLLD VDLLASVGAG DDNTVTLAQA QVRSPSDPGY
VPTAPPALGT EAALELLAEA GYQMAPVGDG APGAPAPDHS RGRLSKDGAP LTLVLGVAAN
DPTSVAVANT AADQLRSVGI EASVLALDPV KLYGEALAEN RVDAVVGWHQ AGGDLATALA
SRYGCPALEA TPVQTAPAAP PGSSRPTTTA APATTAPATP TATPTTTPSP APESGALVQA
PSNITGVCDR SIQPKIDAAL DGSEPIAEVI DAVEPRLWNM WTVLPILQDT TIVAAGPSVR
NVSLTGAVPV GIVGDAGSWV KTPQ