Gene Mjls_4737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4737 
Symbol 
ID4880436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4985310 
End bp4986677 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content63% 
IMG OID640142042 
Productextracellular solute-binding protein 
Protein accessionYP_001072993 
Protein GI126437302 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGCGC AGACAATAGC GCGCCGGCTG GCGGGAGGTC TGGCCGCTGC CGGGCTGGTG 
TTCACGTCCG GCTGTTCCGG GGCCGGCAGC CTCGGATCGT CCGACAACGA GGTGACGATC
GCCCTGGTGT CCAACTCGCA GATGACCGAC GCGCAGCAGC TGTCTTCGGA GTTCGAGAAG
AAGAACCCGG GCACGAAGCT CAAGTTCATC ACGCTGTCCG AGAACCAGGC GCGCGCCAAG
ATCACCATGT CGGCCGCGAT GGGCGGCAGT GAGTTCGACG TCGTGATGAT CAGTAACTTC
GAGACCCCGC AGTGGGCCAA AGACGGCTGG CTGGTGAATC TCTCGGATTA CGCGGAGAAC
ACCCCAGGAT ATGACCAGGA CGACTTCATC TCGTCGCTGC GGGAATCGTT GTCGTACGAG
GGAGACATGT ACGCGGTTCC CTTCTACGGC GAATCGTCGT TTCTGATGTA CCGCAAGGAC
CTGTTCGAGC AGGCGGGCAT CAAGGTCGAC CAGAACCCGG ACTATCAGCC CACCTGGCCT
GAAGTCGCCC AGTGGGCCGA GACGCTCAAG ACCGACGACC GCGCCGGCAT CTGTCTGCGG
GGAAAGCCCG GCTGGGGTGA GGTACTCGCA CCGCTGGACA CCGTCATCAA CACCTTCGGC
GGGCGCTGGT TCGACGAGCA GTGGAACGCC CAACTCGACA GCCCCGAGGT CAAGAAGGCC
GTCAACTTCT ACGTCGACAC GGTCAAGAAC TTCGGCGAAC TGGGTGCGGC GTCAACAGGA
TTCCAGGAGT GCGCGAACCT GTTCGGCCAG GGTCAGACCG CGATGTGGTA CGACGCGACG
TCGGCGGTCT CGGTGCTCGA GGACCCCAAG GAGTATCCCG ACCTGGTCGG CAAGATCGGA
TACCTGCCCG CCCCGATCCT CGTGAAGCCC AACTCGGGCT GGCTCTACAC CTGGGCGCTG
GGCATCCCCA AGGCTGCCAA GAATCCAGAC GGCGCATGGG AGTTCATCTC GTGGATGACC
AGTAAGGATT ACATGAAACT GGTCGGGGAG AGGCTCGGCT GGGCGCGTGT CCCACCGGGC
AGCCGGACGT CGACCTACAC CGACCTGCCC GAGTACGAGG CCATCTCGAA GTCCTATGGG
CCGCTGACGC TGAAGTCGAT CGAGAGCGCG ACCCCGAATC AGCCGACGGT GCAACCAGTT
CCGTACACCG GCATCCAGTT CGTCGGCATC CCGGAGTTCC AGGATCTCGG GACCCGGGTG
AGCCAGCAGA TCAGCGCGGC GATCGCCGGA CAGAAGTCGG TGGACGACGC GCTCGCCCAG
GCACAGGAAT ACGCCGAGGT CGTCGGCCGC ACGTATCAGG AGAAGTGA
 
Protein sequence
MKAQTIARRL AGGLAAAGLV FTSGCSGAGS LGSSDNEVTI ALVSNSQMTD AQQLSSEFEK 
KNPGTKLKFI TLSENQARAK ITMSAAMGGS EFDVVMISNF ETPQWAKDGW LVNLSDYAEN
TPGYDQDDFI SSLRESLSYE GDMYAVPFYG ESSFLMYRKD LFEQAGIKVD QNPDYQPTWP
EVAQWAETLK TDDRAGICLR GKPGWGEVLA PLDTVINTFG GRWFDEQWNA QLDSPEVKKA
VNFYVDTVKN FGELGAASTG FQECANLFGQ GQTAMWYDAT SAVSVLEDPK EYPDLVGKIG
YLPAPILVKP NSGWLYTWAL GIPKAAKNPD GAWEFISWMT SKDYMKLVGE RLGWARVPPG
SRTSTYTDLP EYEAISKSYG PLTLKSIESA TPNQPTVQPV PYTGIQFVGI PEFQDLGTRV
SQQISAAIAG QKSVDDALAQ AQEYAEVVGR TYQEK