Gene Mmcs_0225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0225 
Symbol 
ID4109071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp247502 
End bp249103 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content70% 
IMG OID638029350 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_637402 
Protein GI108797205 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.383838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACAC CGCAAACACG ACCCGTGGTG TCCTCATGGG CGCCGTTCTC CTCTCCTGTC 
TACCGCGCGC TGTGGATCGC GCAGTTCGTC TCGAACCTCG GCACGTGGAT GCAGACGGTG
GGTGCACAGT GGATGCTGGT CGGCGACCCC GGTGCCGCGG TACTCGTGCC GCTGGTGCAG
ACCGCGACGA CGCTGCCGAT CATGCTGCTG GCGTTGCCGT CGGGTGTGCT CGCCGATCTG
ATCGACCGGC GCCGGCTTCT GATCGCCACC CAGGCCGCGA TGGCGTCGGG GGTGGCGGCG
CTGGCGATGC TGACCGGCTT CGGCCTCGCG ACACCCACCG TTCTGCTGCT TCTGCTCTTC
CTGATCGGGT GTGGGCAGGC CCTGACAACG CCCGCGTGGC AGGCCATTCA ACCGGAACTC
GTCCCGCGCG AACAGATTCC GGCCGCGGCG GCGTTGGCCA GCATGAGCGT GAACGGCGCC
CGGGCGATCG GCCCCGCGGT CGCCGGCGTC CTGGTGTCCC TGTCGGGGCC CACCACGGTG
TTCGCGCTCA ATGCGTTCTC GTTCATCGGC ATCGTCATGG TGCTGCTCTG GTGGCGGCGG
CCCGTGGAGG AGGCGACGAT GCCTCTCGAG CGGCCGATAT CCGCGCTGAG CGCCGGCCGG
CGGTACATCC GCAGCTCACC GGTGATCCGG CGGATCCTGT TGCGGACCGT GCTGTTCACC
GCACCCGCCA GCGCGCTGTG GGGCCTGCTC GCGGTGATCG CCGCCAACCA GCTGAACCTG
TCGTCGTCGG GGTACGGGCT GCTGCTCGGC GCGCTGGGTG TCGGGGCGGT GCTGGGTGCG
GTGGTGTTGT CGCGGCTGCA TGCGCGCTTC GGCCAGAACC AGCTGATGGT GATGGGTGCG
GTCGGTTTCG CCGGTGCCAC CGTGGTACTC GCGACGGTGC ACGTGCTGGC CGCGGTGCTC
GCCGCGCTGG TGGTGGGCGG GGTGTCGTGG CTGCTCACGA TGTCGACACT CAACGCCTCG
ATGCAGCTGA GCCTGCCCGC CTGGGTGCGG GCGCGCGGAC TGTCGGTCTA CCAGTTGGTC
TTCACCGGAA GTCAGGCGAT CGGCTCGTTG GTCTGGGGTG TGGTCGCGGG CGCGACGAGC
GGGGTGACGG CGTTGCTGAT CAGCGCTGCC CTGCTGATCG TGTGCGGGGT GTCGGTCGCG
TGGTGGCCGC TGCACCCGGC CACCGGCACG CTCGACGTGA CGCCGTCGGC GCACTGGGGT
GAGCCGGCGC TGGTGTTCGA GCCCGATCCG CAGGACGGGC CGGTGGTGGT GCTGCAGTCC
TACGTCGTGG CGCCGAAGGA CGAGGCGGGT TTCCTGGCGC TCATGCAGCG GGTCCGGCGG
TCTCGGCAGC GGACCGGCGC GATGGAGTGG GGGATCTTCC GCAGCGGCGA GTCCGCCGAC
ACCTTCGTGG AACTCTTCCT CGTCCGGTCG TGGGACGAAC ATCTGCGCCA GCATCTGGTG
CGCCAGACCG CCCTCGATCT GGCCCTCGAG CGTGAGATCG AGGGCTATGT CCACGGCGAG
TCGACGCTGC GGCATTTCAT CGCGGTGCGG AACGGGCGTT GA
 
Protein sequence
MATPQTRPVV SSWAPFSSPV YRALWIAQFV SNLGTWMQTV GAQWMLVGDP GAAVLVPLVQ 
TATTLPIMLL ALPSGVLADL IDRRRLLIAT QAAMASGVAA LAMLTGFGLA TPTVLLLLLF
LIGCGQALTT PAWQAIQPEL VPREQIPAAA ALASMSVNGA RAIGPAVAGV LVSLSGPTTV
FALNAFSFIG IVMVLLWWRR PVEEATMPLE RPISALSAGR RYIRSSPVIR RILLRTVLFT
APASALWGLL AVIAANQLNL SSSGYGLLLG ALGVGAVLGA VVLSRLHARF GQNQLMVMGA
VGFAGATVVL ATVHVLAAVL AALVVGGVSW LLTMSTLNAS MQLSLPAWVR ARGLSVYQLV
FTGSQAIGSL VWGVVAGATS GVTALLISAA LLIVCGVSVA WWPLHPATGT LDVTPSAHWG
EPALVFEPDP QDGPVVVLQS YVVAPKDEAG FLALMQRVRR SRQRTGAMEW GIFRSGESAD
TFVELFLVRS WDEHLRQHLV RQTALDLALE REIEGYVHGE STLRHFIAVR NGR