Gene Mkms_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0114 
Symbol 
ID4615520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp128014 
End bp129039 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content67% 
IMG OID639789791 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_936123 
Protein GI119866171 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4521] ABC-type taurine transport system, periplasmic component 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family
[TIGR01729] taurine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.104516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTCA AAGCCCTTGC CGCCGTGGCG GTCGCGATGC TGGCGGTGGC CGGCTGCTCG 
GTCGACACGT CCGGTCAGGA TGCCGGGAAG CAGACGATTC GCATTGCCTA CCAGAGCTTC
CCGAGCGGCG ACCTGATCGT GAAGAACAAC CGCTGGCTCG AGGATGCGCT GCCGGACTAC
AACATCAAGT GGACGAAGTT CGACTCCGGC GCCGACGTCA ACACCGCGTT CATCGCCAAG
GAGGTCGATT TCGGCGCGCT GGGTTCGAGC CCCGTGGCCC GTGGCCTGTC GGCCCCGCTG
AACATCCCGT ACAAGGTGGC GTTCGTGCTC GACGTCGCAG GCGACAACGA GGCGCTGGTG
GCGCGCAACG GAAGCGGCGT CAACACGATC GCCGATCTGC GGGGCAAGCG CGTCGGCACG
CCGTTCGCCT CGACCGCGCA CTACAGCCTG CTCGCCGCGC TGGACCAGAA CGGGTTGTCG
CCCAACGATG TTCAGCTCGT GGACCTGCAG CCGCAGGCCA TCCTGGCGGC GTTCGACCGC
GGTGACATCG ACGCCGGGTA CTCGTGGTTG CCGACCCTGG ATCAGCTTCG CCGCAACGGC
AAGGACCTCA TCACCAGCCG ACAGCTGGCC CGCGACGGTA AGCCCACGCT CGACCTGGCC
GTGGTGGCCG ACGAGTTCGC CGAAGCCCAT CCGGACGTGG TCGACATCTG GCGTCAGCAG
GAGGCCCGCG CACTCACCGT CATCAAGGAC GACCCCGACG CCGCCGCCAA GGCCATCGCC
GCCGAAATCG GGTTGACGCC CGAGGAGGTC GCCGGACAGC TCACCCAGGG CGTGTACCTG
ACACCCGCGG AAGTGGCCTC GCCGGAGTGG CTGGGCTCCG AGGGTGCGCC GGGCAACATC
GCGGTCAACC TGGAGAGCGC GTCGCAGTTC CTCGCCGAGC AGAAGCAGAT CCCGGCCGCC
GCACCGTTGA AGACCTTTCA GGATGCGATC TACACCAAGG GTCTACCCGG TGCCATCACC
CAGTGA
 
Protein sequence
MRLKALAAVA VAMLAVAGCS VDTSGQDAGK QTIRIAYQSF PSGDLIVKNN RWLEDALPDY 
NIKWTKFDSG ADVNTAFIAK EVDFGALGSS PVARGLSAPL NIPYKVAFVL DVAGDNEALV
ARNGSGVNTI ADLRGKRVGT PFASTAHYSL LAALDQNGLS PNDVQLVDLQ PQAILAAFDR
GDIDAGYSWL PTLDQLRRNG KDLITSRQLA RDGKPTLDLA VVADEFAEAH PDVVDIWRQQ
EARALTVIKD DPDAAAKAIA AEIGLTPEEV AGQLTQGVYL TPAEVASPEW LGSEGAPGNI
AVNLESASQF LAEQKQIPAA APLKTFQDAI YTKGLPGAIT Q