Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_0235 |
Symbol | |
ID | 4615464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 254842 |
End bp | 256443 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639789910 |
Product | protein of unknown function DUF894, DitE |
Protein accession | YP_936242 |
Protein GI | 119866290 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.981168 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACAC CGCAAACACG ACCCGTGGTG TCCTCATGGG CGCCGTTCTC CTCTCCTGTC TACCGCGCGC TGTGGATCGC GCAGTTCGTC TCGAACCTCG GCACGTGGAT GCAGACGGTG GGTGCACAGT GGATGCTGGT CGGCGACCCC GGTGCCGCGG TACTCGTGCC GCTGGTGCAG ACCGCGACGA CGCTGCCGAT CATGCTGCTG GCGTTGCCGT CGGGTGTGCT CGCCGATCTG ATCGACCGGC GCCGGCTTCT GATCGCCACC CAGGCCGCGA TGGCGTCGGG GGTGGCGGCG CTGGCGATGC TGACCGGCTT CGGCCTCGCG ACACCCACCG TTCTGCTGCT TCTGCTCTTC CTGATCGGGT GTGGGCAGGC CCTGACAACG CCCGCGTGGC AGGCCATTCA ACCGGAACTC GTCCCGCGCG AACAGATTCC GGCCGCGGCG GCGTTGGCCA GCATGAGCGT GAACGGCGCC CGGGCGATCG GCCCCGCGGT CGCCGGCGTC CTGGTGTCCC TGTCGGGGCC CACCACGGTG TTCGCGCTCA ATGCGTTCTC GTTCATCGGC ATCGTCATGG TGCTGCTCTG GTGGCGGCGG CCCGTGGAGG AGGCGACGAT GCCTCTCGAG CGGCCGATAT CCGCGCTGAG CGCCGGCCGG CGGTACATCC GCAGCTCACC GGTGATCCGG CGGATCCTGT TGCGGACCGT GCTGTTCACC GCACCCGCCA GCGCGCTGTG GGGCCTGCTC GCGGTGATCG CCGCCAACCA GCTGAACCTG TCGTCGTCGG GGTACGGGCT GCTGCTCGGC GCGCTGGGTG TCGGGGCGGT GCTGGGTGCG GTGGTGTTGT CGCGGCTGCA TGCGCGCTTC GGCCAGAACC AGCTGATGGT GATGGGTGCG GTCGGTTTCG CCGGTGCCAC CGTGGTACTC GCGACGGTGC ACGTGCTGGC CGCGGTGCTC GCCGCGCTGG TGGTGGGCGG GGTGTCGTGG CTGCTCACGA TGTCGACACT CAACGCCTCG ATGCAGCTGA GCCTGCCCGC CTGGGTGCGG GCGCGCGGAC TGTCGGTCTA CCAGTTGGTC TTCACCGGAA GTCAGGCGAT CGGCTCGTTG GTCTGGGGTG TGGTCGCGGG CGCGACGAGC GGGGTGACGG CGTTGCTGAT CAGCGCTGCC CTGCTGATCG TGTGCGGGGT GTCGGTCGCG TGGTGGCCGC TGCACCCGGC CACCGGCACG CTCGACGTGA CGCCGTCGGC GCACTGGGGT GAGCCGGCGC TGGTGTTCGA GCCCGATCCG CAGGACGGGC CGGTGGTGGT GCTGCAGTCC TACGTCGTGG CGCCGAAGGA CGAGGCGGGT TTCCTGGCGC TCATGCAGCG GGTCCGGCGG TCTCGGCAGC GGACCGGCGC GATGGAGTGG GGGATCTTCC GCAGCGGCGA GTCCGCCGAC ACCTTCGTGG AACTCTTCCT CGTCCGGTCG TGGGACGAAC ATCTGCGCCA GCATCTGGTG CGCCAGACCG CCCTCGATCT GGCCCTCGAG CGTGAGATCG AGGGCTATGT CCACGGCGAG TCGACGCTGC GGCATTTCAT CGCGGTGCGG AACGGGCGTT GA
|
Protein sequence | MATPQTRPVV SSWAPFSSPV YRALWIAQFV SNLGTWMQTV GAQWMLVGDP GAAVLVPLVQ TATTLPIMLL ALPSGVLADL IDRRRLLIAT QAAMASGVAA LAMLTGFGLA TPTVLLLLLF LIGCGQALTT PAWQAIQPEL VPREQIPAAA ALASMSVNGA RAIGPAVAGV LVSLSGPTTV FALNAFSFIG IVMVLLWWRR PVEEATMPLE RPISALSAGR RYIRSSPVIR RILLRTVLFT APASALWGLL AVIAANQLNL SSSGYGLLLG ALGVGAVLGA VVLSRLHARF GQNQLMVMGA VGFAGATVVL ATVHVLAAVL AALVVGGVSW LLTMSTLNAS MQLSLPAWVR ARGLSVYQLV FTGSQAIGSL VWGVVAGATS GVTALLISAA LLIVCGVSVA WWPLHPATGT LDVTPSAHWG EPALVFEPDP QDGPVVVLQS YVVAPKDEAG FLALMQRVRR SRQRTGAMEW GIFRSGESAD TFVELFLVRS WDEHLRQHLV RQTALDLALE REIEGYVHGE STLRHFIAVR NGR
|
| |