Gene Mjls_4599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4599 
Symbol 
ID4880298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4826724 
End bp4827929 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID640141902 
Productmajor facilitator transporter 
Protein accessionYP_001072855 
Protein GI126437164 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0252819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCGA ATCGCACGCA GGCGCCCGCC CGATCGCACC GATTGCGCTG GGTCCTGCTG 
GCGTTGTGCG TCACCGAGAT CACCAGTTGG GGGGTGCTGT ACTACGCCTT CACCGTGCTC
TCCGACCAGA TCACCGACGA GACGGGATGG TCGGCCCCCG CGGTGACCGC AGCGTTCTCG
GCGGGACTGG TGACCTCGGC GTTCGTCGGT ATCCCGGTCG GCAGGTGGCT GGACCGGGTT
GGTCCACGCT GGATCATGAC GGCCGGTTCG GCCCTGGGTT GCGTTTCCGT GGTCGCAGTG
GTGGCCGCAT CGTCGTACGC CTGGTTCGTC GCGGCGTGGG TGCTCGCCGG TGTGGCGATG
AGCGCGGTCT TCTATGCACC GGCGTTCGCC GCCCTCACCC GGTTCTTCGG CGCCGACGCC
GTGCGGGCAC TGACGGTCCT GACGTTGGTG GCCGGCTTCG CCAGCACCGT GTTCGCACCG
CTGACCGCCG CGTTGTCGGC GCAGTTGAGT TGGCGCAGCG CCTATCTGGT GTTGGCCGTG
GTGATGGCGG TCATCACGAT CCCCGCGCAC TTCTTCGGCC TGCGGCGACC CTGGCCGGCC
GTGGTGACCG CTCACGCGGT GGAGTCCCCG ACGCGCACGG CGCGCAGTGG TGCGTTCATC
GCGTTGGTGG CGGTGTTCGC CCTCGGTGGG GTGGCGTCGT ATGCGGTGAT CGTCAACCTG
GTGCCCCTGA TGGCCGAGCG CGGCATCAGC ACCGGCGCCG CGGCGGTCGC GCTGGGGCTC
GGCGGCGCCG GGCAGGTTCT GGGCCGCCTG GGTTATCAGA CACTGGTGCG CCGCGTCGGT
GTCGTGACGC GGACGGTGGT GATCATGGCC GGTATCGCGG CCACGACCGC GCTCCTGGGG
GTCTTCACCG GCTACGCCGC CCTGCTGGCG GTCGCGATCG GCGCGGGCGT GATGCGCGGA
ATCATGACGC TGCTTCAGGC GACAGCGGTC ACCGAGCGTT GGGGCGCAAC CCATTACGGC
CATCTCAGCG GGATCCTCAA CGCACCGGTG ATGATCGCCA CCGCCATCGG GCCCTTCGTC
GGTGCGGCCC TGGCGAGCAT CCTCGGCGGT TACGCGGCGA TGTTCCTCGC GCTCGGTGCG
GTCGCAGCCG TCGCCGCGGT CACGGCTTTG GCGACCTCGA CTCATTCCCG GCGCCGGCGA
GACTGA
 
Protein sequence
MSSNRTQAPA RSHRLRWVLL ALCVTEITSW GVLYYAFTVL SDQITDETGW SAPAVTAAFS 
AGLVTSAFVG IPVGRWLDRV GPRWIMTAGS ALGCVSVVAV VAASSYAWFV AAWVLAGVAM
SAVFYAPAFA ALTRFFGADA VRALTVLTLV AGFASTVFAP LTAALSAQLS WRSAYLVLAV
VMAVITIPAH FFGLRRPWPA VVTAHAVESP TRTARSGAFI ALVAVFALGG VASYAVIVNL
VPLMAERGIS TGAAAVALGL GGAGQVLGRL GYQTLVRRVG VVTRTVVIMA GIAATTALLG
VFTGYAALLA VAIGAGVMRG IMTLLQATAV TERWGATHYG HLSGILNAPV MIATAIGPFV
GAALASILGG YAAMFLALGA VAAVAAVTAL ATSTHSRRRR D