Gene Mkms_4815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4815 
Symbol 
ID4616230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5045303 
End bp5046514 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content70% 
IMG OID639794506 
Productmajor facilitator superfamily transporter 
Protein accessionYP_940795 
Protein GI119870843 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.651796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCGT TCGACCGTTC GACCGGCGAC GATACTCAGC GCTGGGCCTA CCCATTGCTG 
CTCGTCCTGA GCGGGGTCGC GCTCGGCGTC TCCGGCCTAC CCGCGCCGCT CTACGGCATG
TACGAGGCGA ACTGGCATCT GTCCCCGCTG GCCACCACGA TCATTTTCGC GGTGTACGCG
ATCGCCGCAC TCGGCGCGGT GCTGGTGTCC GGCCGGATCT CCGACGTGGT CGGCCGCAAG
CCGGTGCTGA TCGCCGCCCT CGCGGCGCTG GTGATCGGAC TCGGGGTGTT CCTCGTCGCC
GACAACATGG CCCTGCTGCT GCTCGCACGG GCCATCCACG GCGCCGCCGT CGGCTCGATC
GTCGTCGCCG GGGCCGCCGC ACTGCTCGAC CTGCGACCCC ACCACGGGGT GCGCTCCGGG
CAACTCAGCG GCGTGAGCTT CAACATCGGG ATGACCGTCG CAATCCTCGG ATCGGCGTTG
CTGGCTCAGT ACGCACCGCA TCCGCTGCGC ACGCCCTACG CCGTTGTCGC GGTGCTGTGC
CTCATCGTCG GCGCCGGTCT CCTGGCACTG CGCGAACCAC ATATCGCACG CACGCGAGGC
CGCATCCGCG TCGCAAAACC CGCGGTGCCA CAGGAGATCA GAGGTGACTT CTGGTTCTCC
GCGCTGGGCG CCATGGCGTC CTGGTCGGTG CTGGGCGTGT TGCTGTCGCT CTACCCGTCG
CTGGCGGCGC GGCAGACGCA TATCGACAAC CTGGTCTTCG GCGGCGCGGT CGTCGGCACC
ACGGCGTTCG CCGCCGCGCT CGCCCAACTT GCATCAACGC GGGTGCCGGC CCGTCGCGCC
GCGATCGTCG GCGACATCGG CATGGCGCTC TCACTGCTGC TCACGATCCC GGTGCTGCTG
AGCCATCAGT GGCCGCTGGT GTTCGTGGCG GCAGCGCTGC TCGGTGCGAC ATTCGGGCTC
GGGTTCGGCG GCTCGTTGCG CCACCTGTCC GACGTGGTCC CCGCCGGCAG GCGCGGTGAG
ACCATGTCGG CGTTCTACCT GGCGGCGTAC ACGGCGATGG CGGTCCCGAC GCTGCTCGCC
GGGTGGGCGG CGACCACCTG GGAACTCGTG GTGGTGTTCC CGTGGTTCGC CGTCGCGGTG
GCCACTGCCT GCGTGGGTGC GGCCGTCGCG GGTATGCGTA GCGCCAGAGC CGCTGCCGCC
GAAGCCTACT GA
 
Protein sequence
MVAFDRSTGD DTQRWAYPLL LVLSGVALGV SGLPAPLYGM YEANWHLSPL ATTIIFAVYA 
IAALGAVLVS GRISDVVGRK PVLIAALAAL VIGLGVFLVA DNMALLLLAR AIHGAAVGSI
VVAGAAALLD LRPHHGVRSG QLSGVSFNIG MTVAILGSAL LAQYAPHPLR TPYAVVAVLC
LIVGAGLLAL REPHIARTRG RIRVAKPAVP QEIRGDFWFS ALGAMASWSV LGVLLSLYPS
LAARQTHIDN LVFGGAVVGT TAFAAALAQL ASTRVPARRA AIVGDIGMAL SLLLTIPVLL
SHQWPLVFVA AALLGATFGL GFGGSLRHLS DVVPAGRRGE TMSAFYLAAY TAMAVPTLLA
GWAATTWELV VVFPWFAVAV ATACVGAAVA GMRSARAAAA EAY