Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3789 |
Symbol | |
ID | 8430799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 3965090 |
End bp | 3966280 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 645036016 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003193119 |
Protein GI | 258516897 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.206252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAGCAAA AAGAATCTCT TTGGACCAAA GATTTCATTT TAATCTGTCT GGCCAATATG ATTATGTTTA TCAGCTTCTA TCTGCTTTTA CCTACATTAC CGGTTTTTGT TATTGATGTA TTAAAAGGGG ATAAGAGCAA GGTTGGCTAT ATCATTGGTA TTCTGTCTCT AACAGCAGTC TTGGTGCGGC CAGTTTCCGG CTATATGCTG GATACCCTGA GCCGCAAAAA AGTTTTGCTT GTGGCCCTGC TGGCTTTTAT CCTTTCTATG GCAGCTTATA ATTTTGTCAC CGGTTTAACA CTCTTATTAG TTTTAAGGGC CCTACATGGT TTTTCCTGGG GTTTTACTAC CACCGGGGCC GGAACCATCG CCGCTGATGT GGTACCGCCG ACAAGAAGGG GCGAAGGAAT GGGTTATTTC GGGCTGTCCA ACACCTTCTC TATGGCGATA GGACCCAGCC TGGGTTTATT CATCATCAAT AAAGCCGGCT TTACTTCGTT ATTTAATGCC TGTGTGCTTA CCGCCCTGCT AGGCCTATTG TTTGTTCTTC CCATATCTTA TAAAGAACAG ATTACTTCAA AAGATAAAAG TATTATGAGC CTGAATAGCT TTTTCGAGGC CAAAGTTTTT TCACTGTCAG CCATGATTTT TTTTATTGCC GTAGTCTATG GAGGTATTGT TTCCTTTATT ACTATTTACG GAAAGGACCT GGGAATAAAA AACGCCGGCA CCTTTTTTCT GGTATACGCA CTCACATTAC TATTGGTAAG ACCGATAGCC GGTAAAACCT TCGACAAAAA CGGCCCGCTG AAAATCATGG CCCTGGGCTT TATCTCCATA TCCATGGCCT TCGTTCTTCT TTTTATAGCC AAAGGAAACA CGCTTTTTCT TTTATCAGCC GTCAGTATGG GTATAGGTTT TGGCATAGTC CACCCCACAG CAATGGCTAT GGCTATTAAC CGGGTAAAAC CTTACCGCAG GGGTGCCGCC AACGCTACTA TTATGAGCGC CTTTGACTTA GGGATAGGTT TAGGGTCAAT TTTTCTAGGC ATTCTTTCCG ATCAAACAGG CATGTCCTAC ATGTATCTAA CCTGTAGTCT GATAATTCTA ATACCGCTTG TCATGTTTTA TTTGATAGAT GCCCGGGAAT TCATAAAAAA ACGGGAAACA AAACACCACT CCCATAAATA G
|
Protein sequence | MEQKESLWTK DFILICLANM IMFISFYLLL PTLPVFVIDV LKGDKSKVGY IIGILSLTAV LVRPVSGYML DTLSRKKVLL VALLAFILSM AAYNFVTGLT LLLVLRALHG FSWGFTTTGA GTIAADVVPP TRRGEGMGYF GLSNTFSMAI GPSLGLFIIN KAGFTSLFNA CVLTALLGLL FVLPISYKEQ ITSKDKSIMS LNSFFEAKVF SLSAMIFFIA VVYGGIVSFI TIYGKDLGIK NAGTFFLVYA LTLLLVRPIA GKTFDKNGPL KIMALGFISI SMAFVLLFIA KGNTLFLLSA VSMGIGFGIV HPTAMAMAIN RVKPYRRGAA NATIMSAFDL GIGLGSIFLG ILSDQTGMSY MYLTCSLIIL IPLVMFYLID AREFIKKRET KHHSHK
|
| |