Gene Mkms_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0235 
Symbol 
ID4615464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp254842 
End bp256443 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content70% 
IMG OID639789910 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_936242 
Protein GI119866290 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.981168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACAC CGCAAACACG ACCCGTGGTG TCCTCATGGG CGCCGTTCTC CTCTCCTGTC 
TACCGCGCGC TGTGGATCGC GCAGTTCGTC TCGAACCTCG GCACGTGGAT GCAGACGGTG
GGTGCACAGT GGATGCTGGT CGGCGACCCC GGTGCCGCGG TACTCGTGCC GCTGGTGCAG
ACCGCGACGA CGCTGCCGAT CATGCTGCTG GCGTTGCCGT CGGGTGTGCT CGCCGATCTG
ATCGACCGGC GCCGGCTTCT GATCGCCACC CAGGCCGCGA TGGCGTCGGG GGTGGCGGCG
CTGGCGATGC TGACCGGCTT CGGCCTCGCG ACACCCACCG TTCTGCTGCT TCTGCTCTTC
CTGATCGGGT GTGGGCAGGC CCTGACAACG CCCGCGTGGC AGGCCATTCA ACCGGAACTC
GTCCCGCGCG AACAGATTCC GGCCGCGGCG GCGTTGGCCA GCATGAGCGT GAACGGCGCC
CGGGCGATCG GCCCCGCGGT CGCCGGCGTC CTGGTGTCCC TGTCGGGGCC CACCACGGTG
TTCGCGCTCA ATGCGTTCTC GTTCATCGGC ATCGTCATGG TGCTGCTCTG GTGGCGGCGG
CCCGTGGAGG AGGCGACGAT GCCTCTCGAG CGGCCGATAT CCGCGCTGAG CGCCGGCCGG
CGGTACATCC GCAGCTCACC GGTGATCCGG CGGATCCTGT TGCGGACCGT GCTGTTCACC
GCACCCGCCA GCGCGCTGTG GGGCCTGCTC GCGGTGATCG CCGCCAACCA GCTGAACCTG
TCGTCGTCGG GGTACGGGCT GCTGCTCGGC GCGCTGGGTG TCGGGGCGGT GCTGGGTGCG
GTGGTGTTGT CGCGGCTGCA TGCGCGCTTC GGCCAGAACC AGCTGATGGT GATGGGTGCG
GTCGGTTTCG CCGGTGCCAC CGTGGTACTC GCGACGGTGC ACGTGCTGGC CGCGGTGCTC
GCCGCGCTGG TGGTGGGCGG GGTGTCGTGG CTGCTCACGA TGTCGACACT CAACGCCTCG
ATGCAGCTGA GCCTGCCCGC CTGGGTGCGG GCGCGCGGAC TGTCGGTCTA CCAGTTGGTC
TTCACCGGAA GTCAGGCGAT CGGCTCGTTG GTCTGGGGTG TGGTCGCGGG CGCGACGAGC
GGGGTGACGG CGTTGCTGAT CAGCGCTGCC CTGCTGATCG TGTGCGGGGT GTCGGTCGCG
TGGTGGCCGC TGCACCCGGC CACCGGCACG CTCGACGTGA CGCCGTCGGC GCACTGGGGT
GAGCCGGCGC TGGTGTTCGA GCCCGATCCG CAGGACGGGC CGGTGGTGGT GCTGCAGTCC
TACGTCGTGG CGCCGAAGGA CGAGGCGGGT TTCCTGGCGC TCATGCAGCG GGTCCGGCGG
TCTCGGCAGC GGACCGGCGC GATGGAGTGG GGGATCTTCC GCAGCGGCGA GTCCGCCGAC
ACCTTCGTGG AACTCTTCCT CGTCCGGTCG TGGGACGAAC ATCTGCGCCA GCATCTGGTG
CGCCAGACCG CCCTCGATCT GGCCCTCGAG CGTGAGATCG AGGGCTATGT CCACGGCGAG
TCGACGCTGC GGCATTTCAT CGCGGTGCGG AACGGGCGTT GA
 
Protein sequence
MATPQTRPVV SSWAPFSSPV YRALWIAQFV SNLGTWMQTV GAQWMLVGDP GAAVLVPLVQ 
TATTLPIMLL ALPSGVLADL IDRRRLLIAT QAAMASGVAA LAMLTGFGLA TPTVLLLLLF
LIGCGQALTT PAWQAIQPEL VPREQIPAAA ALASMSVNGA RAIGPAVAGV LVSLSGPTTV
FALNAFSFIG IVMVLLWWRR PVEEATMPLE RPISALSAGR RYIRSSPVIR RILLRTVLFT
APASALWGLL AVIAANQLNL SSSGYGLLLG ALGVGAVLGA VVLSRLHARF GQNQLMVMGA
VGFAGATVVL ATVHVLAAVL AALVVGGVSW LLTMSTLNAS MQLSLPAWVR ARGLSVYQLV
FTGSQAIGSL VWGVVAGATS GVTALLISAA LLIVCGVSVA WWPLHPATGT LDVTPSAHWG
EPALVFEPDP QDGPVVVLQS YVVAPKDEAG FLALMQRVRR SRQRTGAMEW GIFRSGESAD
TFVELFLVRS WDEHLRQHLV RQTALDLALE REIEGYVHGE STLRHFIAVR NGR