Gene Mjls_0095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_0095 
Symbol 
ID4875841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp103925 
End bp104950 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content67% 
IMG OID640137409 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001068399 
Protein GI126432708 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4521] ABC-type taurine transport system, periplasmic component 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family
[TIGR01729] taurine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.106306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTCA AAGCCCTTGC CGCCGTGGCG GTCACGATGC TGGCGGTGGC CGGCTGCTCG 
GTCGACACGT CCGGTCAGGA TGCCGGGAAG CAGACGATTC GCATTGCCTA CCAGAGCTTC
CCGAGCGGCG ACCTGATCGT GAAGAACAAC CGCTGGCTCG AGGATGCGCT GCCGGACTAC
AACATCAAGT GGACGAAGTT CGACTCCGGC GCCGACGTCA ACACCGCGTT CATCGCCAAG
GAGGTCGATT TCGGCGCGCT GGGTTCGAGC CCCGTGGCCC GTGGCCTGTC GGCCCCGCTG
AACATCCCGT ACAAGGTGGC GTTCGTGCTC GACGTCGCAG GCGACAACGA GGCGTTGGTG
GCGCGCAACG GAAGCGGCGT CAACACGATC GCCGATCTGC GGGGTAAGCG CGTCGGCACG
CCGTTCGCCT CGACCGCGCA CTACAGCCTG CTCGCCGCGC TGGACCAGAA CGGGTTGTCG
CCCAACGATG TTCAGCTCGT GGACCTGCAG CCGCAGGCCA TCCTGGCGGC GTTCGACCGC
GGTGACATCG ACGCCGGGTA CTCGTGGTTG CCGACCCTGG ATCAGCTTCG CCGCAACGGC
AAGGACCTGA TCACCAGCCG ACAGCTGGCC CGCGACGGTA AGCCCACGCT CGACCTGGCC
GTGGTGGCCG ACGAGTTCGC CGAAGCCCAT CCGGACGTGG TCGACATCTG GCGTCAGCAG
GAGGCCCGCG CACTGACCGT CATCAAGGAC GACCCCGACG CCGCCGCCAA GGCCATCGCC
GCCGAAATCG GGTTGACGCC CGAGGAGGTC GCCGGACAGC TCACCCAGGG CGTGTACCTG
ACACCCGCGG AAGTGGCCTC GCCGGAGTGG CTGGGCTCCG AGGGTGCGCC GGGCAACATC
GCGGTCAACC TGGAGAGCGC GTCGCAGTTC CTCGCCGAGC AGAAGCAGAT CCCGGCCGCC
GCACCGTTGA AGACCTTCCA GGATGCGATC TACACCAAGG GTCTACCCGG TGCCATCACC
CAGTGA
 
Protein sequence
MRLKALAAVA VTMLAVAGCS VDTSGQDAGK QTIRIAYQSF PSGDLIVKNN RWLEDALPDY 
NIKWTKFDSG ADVNTAFIAK EVDFGALGSS PVARGLSAPL NIPYKVAFVL DVAGDNEALV
ARNGSGVNTI ADLRGKRVGT PFASTAHYSL LAALDQNGLS PNDVQLVDLQ PQAILAAFDR
GDIDAGYSWL PTLDQLRRNG KDLITSRQLA RDGKPTLDLA VVADEFAEAH PDVVDIWRQQ
EARALTVIKD DPDAAAKAIA AEIGLTPEEV AGQLTQGVYL TPAEVASPEW LGSEGAPGNI
AVNLESASQF LAEQKQIPAA APLKTFQDAI YTKGLPGAIT Q