Gene Mjls_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1034 
Symbol 
ID4876775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1121170 
End bp1122780 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content68% 
IMG OID640138348 
Productmajor facilitator transporter 
Protein accessionYP_001069333 
Protein GI126433642 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.283059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGACA CCCTCCCGCG CACCGACCAG GATATCGACG CCGAGATCGC CGCTCTGTCG 
AAGCGGAAAC GGATCTGGCT GCTGGTCATC GCCAGCGTCG ACGTGCTGAT GGTCATCTCG
TCGATGGTGG CGCTCAACGC GGCGCTGCCC GACATCGCGC TGCAGACCTC CGCGACACAG
TCCCAGTTGA CCTGGATCGT CGACGGTTAC ACGCTGGCGT TGGCCTGCCT GCTGCTGCCG
GCGGGCGCCC TCGGCGACCG CTACGGCCGG CGGGGTGCGC TCCTGGTGGG CCTCGCGATC
TTCGCGGTGG CCTCGCTGGC CCCGGTGCTG TTCGACAGCC CGATGCAGAT CATCATCGCG
CGGGCCGTCG CCGGCGTCGG CGCGGCGCTC ATCATGCCCG CCACCCTCTC GCTGCTCACC
GCCGCGTTCC CGAAGTCCGA GCGCAACAAG GCCGTCGGCA TCTGGGCCGG CGTGGCGGGG
TCGGGCGCGA TCTTCGGCTT CCTCGGTACC GGGCTCCTGC TGAACTACTT CTCGTGGCAG
TCGATCTTCT ACATGTTCGC CGGCGGGGCA CTGCTGATGT TCGTGGCGAC CTGCACCATC
GGCTCTTCCC GCGACGAGAC CGCCACCCCC ATCGACTGGG TGGGCGCCGC GCTGATCGGC
ACCGCGATCG CGGTGTTCGT GCTGGGGGTG GTCGAGGCGC CGGTACGCGG GTGGACCGAC
CCGGCAGTGC TCGGTTGTCT GGGCGCCGGG GTGGTGCTGG CCGGGTTGTT CGCCGTGGTC
CAGCTGCGCC GTGCGCATCC ACTGCTCGAT GTCCGGTTGT TCCGACGGCC GGATTTCGCC
ACTGGCGCCG CAGGCATCAC ATTCCTGTTC ATCGCGAACT TCGGGTTCTT CTACGTCGCG
ATGCAGTTCA TGCAGCTGGT CATGGGCTAC AGCGCGCTGG AGACCGCATT CGCCTTGTCG
CCGTTGGCGT TCCCGGTGCT GATACTCGGC GGCACACTGC CTCTGTATCT GCCGAAGGTG
GGTCTGCGCT TCGCGGTCAC CGTTGGCCTT CTCCTGCTTG CCACGGGCCT GTTCCTCATG
CGTTTCCTGG AGGCCGACGC GACCTTCCTC GACCTCATGT GGCCAATGCT GCTCGCCGCA
TCGGGCATCG GACTGTGCAC GGCGCCGACG ACTTCGGCGA TCATGAACGC CGTGCCTGAC
GAGAAGCAGG GCGTCGCCTC GGCGGTCAAC GACGCCACCC GCGAGGTCGG TGCCGCCGTC
GGCATCGCAG TGGCGGGATC GGTCCTGGCC GCCGTGTACC AGAGCGCGCT GGCCCCGAAC
CTCGGCGCTC TGCCCGAGCA GATCCGCGAC GCCGCAACCG ATTCGCTGGC CCACGCGCTG
GCGATCTCCG AACAGATGGG TCCGCAGGGC GAACAGTTGG CCGACTTCGC TCGAGACGCG
TTCATGCAGG CCGCCGACCA GGCGTTGTTC GCACTCTCGG CGCTTCTGGT GGTCGGGGCG
GTCTTCGTGG CGATCTGGTC TCCCGGACGA GACGGACGAC AGTGGGCCGC GATCCGGCGG
CGGCGAGGAG CAGACGAGAA CCGGTCGGCA CCTGCGGAGG TTGCGCCGTA G
 
Protein sequence
MVDTLPRTDQ DIDAEIAALS KRKRIWLLVI ASVDVLMVIS SMVALNAALP DIALQTSATQ 
SQLTWIVDGY TLALACLLLP AGALGDRYGR RGALLVGLAI FAVASLAPVL FDSPMQIIIA
RAVAGVGAAL IMPATLSLLT AAFPKSERNK AVGIWAGVAG SGAIFGFLGT GLLLNYFSWQ
SIFYMFAGGA LLMFVATCTI GSSRDETATP IDWVGAALIG TAIAVFVLGV VEAPVRGWTD
PAVLGCLGAG VVLAGLFAVV QLRRAHPLLD VRLFRRPDFA TGAAGITFLF IANFGFFYVA
MQFMQLVMGY SALETAFALS PLAFPVLILG GTLPLYLPKV GLRFAVTVGL LLLATGLFLM
RFLEADATFL DLMWPMLLAA SGIGLCTAPT TSAIMNAVPD EKQGVASAVN DATREVGAAV
GIAVAGSVLA AVYQSALAPN LGALPEQIRD AATDSLAHAL AISEQMGPQG EQLADFARDA
FMQAADQALF ALSALLVVGA VFVAIWSPGR DGRQWAAIRR RRGADENRSA PAEVAP