Gene Mjls_5348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5348 
Symbol 
ID4881045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5599842 
End bp5600786 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content65% 
IMG OID640142662 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001073602 
Protein GI126437911 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.684846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCA CGCGCAGACG CACCCGATGG GCGGCCCTGC TGGTCTCTGT GACGTTGGCA 
CTGACCGCGT GCGGCAGTTC GAACCCGCTC GGCGGCGGTG AGATCTCGGG CGACCTCAAG
ACGGTCACCG TCGGGTCGGC CGACTTCACC GAGTCCAAGA TCATCGCCGA GATCTACGCT
CAGGCATTGG AGGCCAACGG TTTCGACATC ACCAGGCAGT TCGGCATCGG CAGCCGCGAG
ACGTACATCC TTGCCGTACA GGACCACTCG ATCGATCTGA TCCCCGAATA CACCGGGAAC
CTGTTGCAGT ACTTCGACGA AGAGACCGAC GTGACGACCC CGGACGCGGT GATCCTCGGT
CTGCTGCGGG CGCTGCCCGG TGACCTGTCG ATCCTGTATC CCTCACCGGC CGAGGACAAG
GACACCGTCG CGGTGTCGGC CGAGACGGCG CAGCGGTGGA ATCTCAAGAC GATCGGCGAT
CTGGCGGCCC ACTCACCGGA GGTGAAGTTC GGCGGGCCGT CGGAGTTCCT CAACCGCACC
GAGGGCCTGC CGGGGCTCAA GGAGAAGTAC GGTCTGGACA TCGCACCGTC GAACTTCGTG
GCGATCAGCG ACGGCGGCGG GCCCGCGACG GTCCGTGCAC TGACCGACGG AACGGTCACC
GCGGCGAACA TCTTCAGCAC CTCACCGGCC ATCAAGCAGA ACAACCTGGT GGTGCTGGAG
GATCCGAAGA ACAACTTCCT CGCGGCCAAC GTCGTGCCGC TGGTCGCCTC GCAGAAGAAG
TCCGACGAGC TCAAATCGGT GCTCGACGCG GTCAGCGCGA AACTGACGAC CGAGGGCCTC
ATCGAACTGA ACGCCGCAGT CGAAGGCAAC CGCGGCGTCG ATCCCGACGA GGCCGCGCAG
AAGTGGGTGG CCGACAACGG GTTCGACAAA CCCCTGACGA AGTAG
 
Protein sequence
MASTRRRTRW AALLVSVTLA LTACGSSNPL GGGEISGDLK TVTVGSADFT ESKIIAEIYA 
QALEANGFDI TRQFGIGSRE TYILAVQDHS IDLIPEYTGN LLQYFDEETD VTTPDAVILG
LLRALPGDLS ILYPSPAEDK DTVAVSAETA QRWNLKTIGD LAAHSPEVKF GGPSEFLNRT
EGLPGLKEKY GLDIAPSNFV AISDGGGPAT VRALTDGTVT AANIFSTSPA IKQNNLVVLE
DPKNNFLAAN VVPLVASQKK SDELKSVLDA VSAKLTTEGL IELNAAVEGN RGVDPDEAAQ
KWVADNGFDK PLTK