Gene Mkms_3910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3910 
Symbol 
ID4611845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4117866 
End bp4119191 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content68% 
IMG OID639793589 
Productgeneral substrate transporter 
Protein accessionYP_939892 
Protein GI119869940 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAGTC AGACATCCGC GCCTGCACCC GCCGAGACGG CGGCCATCCG CAAAGCGGTC 
GCAGGCGCGT CGATCGGTAA CGCCGTCGAA TGGTTCGACT TCGCCATCTA CGGCTTCCTC
GCCACTTTCA TCGCCGCCAA GTTCTTCCCG GCGGGCAACG ACACGGCCGC CCTGCTGAAC
ACGTTCGCGA TCTTCGCCGC GGCCTTCTTC ATGCGGCCCC TCGGCGGTTT CGTCTTCGGC
CCGCTCGGCG ACCGGATCGG ACGTCAGCGG GTGCTCGCGG TGGTCATCCT GCTGATGTCG
GCGGCCACGC TCGGCATCGG CCTGCTGCCG ACCTACGCCA GCATCGGCGT CGCCGCACCG
CTGCTGCTGC TGTTCCTGCG GTGCCTGCAG GGGTTCTCCG CGGGCGGTGA ATACGGTGGC
GGCGCAGTCT ATCTCGCCGA GTACGCCCGT GACCGCAACC GCGGGTTGAC GGTCACGTTC
ATCGCCTGGT CCGGTGTCGT CGGCTTCCTG CTCGGCTCGG TGACGGTGAC GGTGCTGCAG
GCGCTGTTGC CCACCGCGGC GATGGACAGC TACGGCTGGC GGATCCCGTT CCTGATCGCC
GCACCGCTGG GCCTGGTCGG CCTCTACATC CGGTTACGCC TCGACGACAC GCCCGAGTTC
GCGGCGCTGT CGGAGACCGA CCGGGTGGCG TCGTCGCCGC TGCGTGAGGC GGCCACCACC
GCGTGGCGGC CGATCCTGCA GGTGATCGGC CTGTTCATCG TGTTCAATGT CGGCTACTAC
GTGGTGTTCA CGTTCCTGCC GACCTACTTC ATCAAGACCC TCGAATTCGG CAAGACGGCG
GCGTTCGTCT CGATGTCGCT GGCCAGTCTG GTCGCCCTGG TGCTGATCCT GCCGCTGGCC
GCGCTGTCGG ACCGCATCGG CAGGCGTCCG CTGCTGATCG GCGGGACGGT GGCGTTCGTG
ATCCTCGCGT ATCCGCTGTT CCTGCTGCTG AATTCCGGTT CGCTCGCCGC GGCGATCACC
GCGCACTGCG TGCTGGCGGC CATCGAGTCG ATCTACGTGT CGACCGCGGT CACCGCCGGC
GTCGAACTGT TCGCCACCCG GGTGCGCTAC AGCGGGTTCT CCATCGGCTA CAACGTGTCG
GTCGCGGCGT TCGGCGGGAC GACGCCCTAC GTCGTCACGT GGCTGACCGC GACGACCGAG
AACAACCTCG CCCCGGCGCT GTACCTGATC GTCGCGGCCG TCGTATCGCT GGCGACGCTG
CTCACCCTGC GGGAGTCCGC CGGCCGGCCG CTGGCCGCGA CAGTGGCCGC TGACGTGCGA
CGATGA
 
Protein sequence
MESQTSAPAP AETAAIRKAV AGASIGNAVE WFDFAIYGFL ATFIAAKFFP AGNDTAALLN 
TFAIFAAAFF MRPLGGFVFG PLGDRIGRQR VLAVVILLMS AATLGIGLLP TYASIGVAAP
LLLLFLRCLQ GFSAGGEYGG GAVYLAEYAR DRNRGLTVTF IAWSGVVGFL LGSVTVTVLQ
ALLPTAAMDS YGWRIPFLIA APLGLVGLYI RLRLDDTPEF AALSETDRVA SSPLREAATT
AWRPILQVIG LFIVFNVGYY VVFTFLPTYF IKTLEFGKTA AFVSMSLASL VALVLILPLA
ALSDRIGRRP LLIGGTVAFV ILAYPLFLLL NSGSLAAAIT AHCVLAAIES IYVSTAVTAG
VELFATRVRY SGFSIGYNVS VAAFGGTTPY VVTWLTATTE NNLAPALYLI VAAVVSLATL
LTLRESAGRP LAATVAADVR R