Gene Mkms_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1024 
Symbol 
ID4614688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1117456 
End bp1119066 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content68% 
IMG OID639790701 
Productmajor facilitator superfamily transporter 
Protein accessionYP_937028 
Protein GI119867076 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.836546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.65546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGACA CCCTCCCGCG CACCGACCAG GATATCGACG CCGAGATCGC CGCTCTGTCG 
AAGCGGAAAC GGATCTGGCT GCTGGTCATC GCCAGTGTCG ACGTGCTGAT GGTCATCTCG
TCGATGGTGG CGCTCAACGC GGCGCTGCCC GACATCGCGC TGCAGACCTC CGCGACACAG
TCCCAGTTGA CCTGGATCGT CGACGGTTAC ACGCTGGCGC TGGCCTGCCT GCTGCTGCCG
GCGGGCGCCA TCGGCGACCG CTACGGTCGG CGGGGTGCGC TCCTGGTGGG CCTCGCGATC
TTCGCGGTGG CCTCGCTGGC CCCGGTGCTG TTCGACAGCC CGATGCAGAT CATCATCGCG
CGGGCCGTCG CCGGCGTCGG CGCGGCGCTC ATCATGCCCG CCACCCTCTC GCTGCTCACC
GCCGCGTTCC CGAAGTCCGA GCGCAACAAG GCCGTCGGCA TCTGGGCCGG CGTGGCGGGG
TCGGGCGCGA TCTTCGGCTT CCTCGGTACC GGGCTCCTGC TGAACTACTT CTCGTGGCAG
TCGATCTTCT ACATGTTCGC CGGCGGGGCA CTGCTGATGT TCGTGGCGAC CTGCACCATC
GGCTCTTCCC GCGACGAGAC CGCCACCCCC ATCGACTGGG TGGGCGCCGC GCTGATCGGC
ACCGCGATCG CGGTGTTCGT GCTGGGGGTG GTCGAGGCGC CGGTACGCGG GTGGACCGAC
CCGGCAGTGC TCGGTTGTCT GGGCGCCGGG GTGGTGCTGG CCGGGTTGTT CGCCGTGGTC
CAGCTGCGCC GTGCGCATCC ACTGCTCGAT GTCCGGTTGT TCCGACGGCC GGATTTCGCC
ACTGGCGCCG CAGGCATCAC ATTCCTGTTC ATCGCGAACT TCGGGTTCTT CTACGTCGCG
ATGCAGTTCA TGCAGCTGGT CATGGGCTAC AGCGCGCTGG AGACCGCATT CGCCTTGTCG
CCGTTGGCGT TCCCGGTGCT GATACTCGGC GGCACACTGC CTCTGTATCT GCCGAAGGTG
GGTCTGCGCT TCGCGGTCAC CGTTGGCCTT CTCCTGCTTG CCACGGGCCT GTTCCTCATG
CGTTTCCTGG AGGCCGACGC GACCTTCCTC GACCTCATGT GGCCAATGCT GCTCGCCGCA
TCGGGCATCG GACTGTGCAC GGCGCCGACG ACTTCGGCGA TCATGAACGC CGTGCCTGAC
GAGAAGCAGG GCGTCGCCTC GGCGGTCAAC GACGCCACCC GCGAGGTCGG TGCCGCCGTC
GGCATCGCAG TGGCGGGATC GGTCCTGGCC GCCGTGTACC AGAGCGCGCT GGCCCCGAAG
CTCGGCGCTC TGCCCGAGCA GATCCGCGAC GCCGCAACCG ATTCGCTGGC CCACGCGCTG
GCGATCTCCG AACAGATGGG TCCGCAGGGC GAACAGTTGG CCGACTTCGC TCGAGACGCG
TTCATGCAGG CCGCCGACCA GGCGTTGTTC GCACTCTCGG CGCTTCTGGT GGTCGGGGCG
GTCTTCGTGG CGATCTGGTC TCCCGGACGA GACGGACGAC AGTGGGCCGC GATCCGGCGG
CGGCGAGGAG CAGACGAGAA CCGGTCGGCA CCCGCGGAGG TTGCGCCGTA G
 
Protein sequence
MVDTLPRTDQ DIDAEIAALS KRKRIWLLVI ASVDVLMVIS SMVALNAALP DIALQTSATQ 
SQLTWIVDGY TLALACLLLP AGAIGDRYGR RGALLVGLAI FAVASLAPVL FDSPMQIIIA
RAVAGVGAAL IMPATLSLLT AAFPKSERNK AVGIWAGVAG SGAIFGFLGT GLLLNYFSWQ
SIFYMFAGGA LLMFVATCTI GSSRDETATP IDWVGAALIG TAIAVFVLGV VEAPVRGWTD
PAVLGCLGAG VVLAGLFAVV QLRRAHPLLD VRLFRRPDFA TGAAGITFLF IANFGFFYVA
MQFMQLVMGY SALETAFALS PLAFPVLILG GTLPLYLPKV GLRFAVTVGL LLLATGLFLM
RFLEADATFL DLMWPMLLAA SGIGLCTAPT TSAIMNAVPD EKQGVASAVN DATREVGAAV
GIAVAGSVLA AVYQSALAPK LGALPEQIRD AATDSLAHAL AISEQMGPQG EQLADFARDA
FMQAADQALF ALSALLVVGA VFVAIWSPGR DGRQWAAIRR RRGADENRSA PAEVAP