Gene Mjls_0215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_0215 
Symbol 
ID4875961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp229090 
End bp230691 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content70% 
IMG OID640137529 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_001068519 
Protein GI126432828 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.155573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.878116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACAC CGCAAACACG ACCCGTGGTG TCCTCGTGGG CGCCGTTCTC GTCTCCTGTC 
TACCGCGCGC TGTGGATCGC GCAGTTCGTC TCGAACCTCG GCACGTGGAT GCAGACGGTG
GGTGCACAGT GGATGCTGGT CGGCGACCCC GGTGCCGCGG TACTCGTGCC GCTGGTGCAG
ACCGCGACGA CGCTGCCGAT CATGCTGCTG GCGTTGCCGT CGGGTGTGCT CGCCGATCTG
ATCGACCGGC GCCGGCTGCT GATCGCCACC CAGGCCGCGA TGGCGTCGGG GGTGGCGGCG
CTGGCGATGC TGACCGGCTT CGGCCTCGCG ACACCCACCG TGCTGCTGCT TCTGCTCTTC
CTGATCGGGT GTGGGCAGGC CCTGACGACG CCGGCGTGGC AGGCCATTCA ACCGGAACTC
GTCCCGCGCG AACAGATTCC GGCCGCGGCG GCGTTGGCCA GCATGAGCGT GAACGGCGCC
CGGGCGATCG GCCCGGCGGT CGCCGGCGTC CTGGTGTCGC TGTCGGGGCC CACCACGGTG
TTCGCGCTCA ATGCGTTCTC GTTCATCGGC ATCGTCATGG TGCTGCTCTG GTGGCGGCGG
CCCGTGGAGG AGGCGACGAT GCCTCTCGAG CGGCCGATAT CGGCGCTGAG CGCCGGCCGG
CGGTACATCC GCAGCTCACC GGTGATCCGG CGGATCCTGT TGCGGACCGT GCTGTTCACC
GCACCCGCCA GCGCGCTGTG GGGCCTGCTC GCGGTGATCG CCGCCAACCA GCTGAACCTG
TCGTCGTCGG GGTACGGGCT GCTGCTCGGC GCGCTGGGTG TCGGGGCGGT GCTGGGTGCG
GTGGTGTTGT CGCGGCTGCA TGCCCGCTTC GGCCAGAACC AGCTGATGGT GATGGGTGCG
GTCGGTTTCG CCGGTGCCAC CGTGGTACTC GCGACGGTGC ACGTGCTGGC CGCGGTGCTC
GCCGCGCTGG TGGTGGGCGG GGTGGCGTGG CTGCTCACGA TGTCGACCCT CAACGCCTCG
ATGCAGCTGA GCCTGCCCGC CTGGGTGCGG GCGCGCGGAC TGTCGGTCTA CCAGTTGGTC
TTCACCGGAA GTCAGGCGAT CGGCTCGTTG GTCTGGGGTG TGGTCGCGGG CGCGACGAGC
GGGGTGACGG CGTTGCTGAT CAGCGCTGCC CTGCTGATCG TGTGCGGGGT GTCGGTCGCA
TGGTGGCCGC TGCACCCGGC CACCGGCACG CTCGACGTGA CGCCGTCGGC GCACTGGGGT
GAGCCGGCGC TGGTGTTCGA GCCCGATCCG CAGGACGGAC CGGTGGTGGT GCTGCAGTCC
TACGTCGTGG CGCCGGAGGA CGAGGCGGGT TTCCTGGCGC TCATGCAGCG GGTCCGGCGG
TCTCGGCAGC GGACCGGCGC GATGGAGTGG GGGATCTTCC GCAGCGGCGA GTCCGCCGAC
ACCTTCGTGG AACTCTTCCT CGTCCGGTCA TGGGACGAAC ATCTGCGCCA GCATCTGGTG
CGCCAGACCG CCCTCGATCT GGCCCTCGAG CGTGAGATCG AGGGCTACGT CCACGGCGAG
TCGACGCTGC GGCATTTCAT CGCGGTGCGG AACGGGCGTT GA
 
Protein sequence
MATPQTRPVV SSWAPFSSPV YRALWIAQFV SNLGTWMQTV GAQWMLVGDP GAAVLVPLVQ 
TATTLPIMLL ALPSGVLADL IDRRRLLIAT QAAMASGVAA LAMLTGFGLA TPTVLLLLLF
LIGCGQALTT PAWQAIQPEL VPREQIPAAA ALASMSVNGA RAIGPAVAGV LVSLSGPTTV
FALNAFSFIG IVMVLLWWRR PVEEATMPLE RPISALSAGR RYIRSSPVIR RILLRTVLFT
APASALWGLL AVIAANQLNL SSSGYGLLLG ALGVGAVLGA VVLSRLHARF GQNQLMVMGA
VGFAGATVVL ATVHVLAAVL AALVVGGVAW LLTMSTLNAS MQLSLPAWVR ARGLSVYQLV
FTGSQAIGSL VWGVVAGATS GVTALLISAA LLIVCGVSVA WWPLHPATGT LDVTPSAHWG
EPALVFEPDP QDGPVVVLQS YVVAPEDEAG FLALMQRVRR SRQRTGAMEW GIFRSGESAD
TFVELFLVRS WDEHLRQHLV RQTALDLALE REIEGYVHGE STLRHFIAVR NGR