Gene Dtox_3789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3789 
Symbol 
ID8430799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3965090 
End bp3966280 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content42% 
IMG OID645036016 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003193119 
Protein GI258516897 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.206252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGCAAA AAGAATCTCT TTGGACCAAA GATTTCATTT TAATCTGTCT GGCCAATATG 
ATTATGTTTA TCAGCTTCTA TCTGCTTTTA CCTACATTAC CGGTTTTTGT TATTGATGTA
TTAAAAGGGG ATAAGAGCAA GGTTGGCTAT ATCATTGGTA TTCTGTCTCT AACAGCAGTC
TTGGTGCGGC CAGTTTCCGG CTATATGCTG GATACCCTGA GCCGCAAAAA AGTTTTGCTT
GTGGCCCTGC TGGCTTTTAT CCTTTCTATG GCAGCTTATA ATTTTGTCAC CGGTTTAACA
CTCTTATTAG TTTTAAGGGC CCTACATGGT TTTTCCTGGG GTTTTACTAC CACCGGGGCC
GGAACCATCG CCGCTGATGT GGTACCGCCG ACAAGAAGGG GCGAAGGAAT GGGTTATTTC
GGGCTGTCCA ACACCTTCTC TATGGCGATA GGACCCAGCC TGGGTTTATT CATCATCAAT
AAAGCCGGCT TTACTTCGTT ATTTAATGCC TGTGTGCTTA CCGCCCTGCT AGGCCTATTG
TTTGTTCTTC CCATATCTTA TAAAGAACAG ATTACTTCAA AAGATAAAAG TATTATGAGC
CTGAATAGCT TTTTCGAGGC CAAAGTTTTT TCACTGTCAG CCATGATTTT TTTTATTGCC
GTAGTCTATG GAGGTATTGT TTCCTTTATT ACTATTTACG GAAAGGACCT GGGAATAAAA
AACGCCGGCA CCTTTTTTCT GGTATACGCA CTCACATTAC TATTGGTAAG ACCGATAGCC
GGTAAAACCT TCGACAAAAA CGGCCCGCTG AAAATCATGG CCCTGGGCTT TATCTCCATA
TCCATGGCCT TCGTTCTTCT TTTTATAGCC AAAGGAAACA CGCTTTTTCT TTTATCAGCC
GTCAGTATGG GTATAGGTTT TGGCATAGTC CACCCCACAG CAATGGCTAT GGCTATTAAC
CGGGTAAAAC CTTACCGCAG GGGTGCCGCC AACGCTACTA TTATGAGCGC CTTTGACTTA
GGGATAGGTT TAGGGTCAAT TTTTCTAGGC ATTCTTTCCG ATCAAACAGG CATGTCCTAC
ATGTATCTAA CCTGTAGTCT GATAATTCTA ATACCGCTTG TCATGTTTTA TTTGATAGAT
GCCCGGGAAT TCATAAAAAA ACGGGAAACA AAACACCACT CCCATAAATA G
 
Protein sequence
MEQKESLWTK DFILICLANM IMFISFYLLL PTLPVFVIDV LKGDKSKVGY IIGILSLTAV 
LVRPVSGYML DTLSRKKVLL VALLAFILSM AAYNFVTGLT LLLVLRALHG FSWGFTTTGA
GTIAADVVPP TRRGEGMGYF GLSNTFSMAI GPSLGLFIIN KAGFTSLFNA CVLTALLGLL
FVLPISYKEQ ITSKDKSIMS LNSFFEAKVF SLSAMIFFIA VVYGGIVSFI TIYGKDLGIK
NAGTFFLVYA LTLLLVRPIA GKTFDKNGPL KIMALGFISI SMAFVLLFIA KGNTLFLLSA
VSMGIGFGIV HPTAMAMAIN RVKPYRRGAA NATIMSAFDL GIGLGSIFLG ILSDQTGMSY
MYLTCSLIIL IPLVMFYLID AREFIKKRET KHHSHK