Gene Dtox_0497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0497 
Symbol 
ID8427432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp509690 
End bp511069 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content45% 
IMG OID645032867 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003190045 
Protein GI258513823 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.128275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAA ACAAATGCGG AGTTGCCCTA ATCCTGGTTC TGGTCCTCGG CCTGGGGGTA 
TTTACCGCCG GCTGCGGCAA TACTCAGGCT GCCAGAGAAG CTAATAATCC AGATTCCCTG
ACCATTGCGG TTGTGACCAA AGATATGTAC CTGGATACGG CAGTAAAAAA GTTTGAGGAA
CTTCATCCCA GTGTCAGTGT GAAAGTGAAA GAATACACCT CCAGTACTTC GGAAAAGGGT
GAAGGTGTTA GGGCGGCAGA TCCAGGCGAT ATTGAAAAAT ATGTCACGAC TATGAATACC
CAGCTCATGT CCGGGCAGGG CAGCGATATA ATTTTATTAA ACAACCTGCC CTATCAGACT
TATGCGGATA AGAATCTTTT AGTTGATTTG GGCGGGCTGA TGCAGTCGGA TCAAAGCTTC
GACAGAGGCA AATACTACCA AAATATTTTT AAGGCTTTAG AATATAAAGA TAAACTCTAT
GCCCTGCCGG TTAATATCAG TATTGATATG ATTGCCGCCG ATAGAACCTT ACTGGCTGAT
TCCCAGGTGC CAATCGACGA TAGCAACTGG GATTGGAACG ATTTTGTCAA GATGGCGGAA
AAGGCCATTA ACGACAAGCA AAACGGGGCA ACCCAGGAGA TGTACGCCTT GGCCGGGATG
GATGAAAAAA GACTGATTAC AACTCTGGTT AAAGAAAACT ATGACAACCT GGTCGATCCG
GAGAAGAAAA CAGCCAATTT TACCGGTCAA GAATTTCTTG ATTTACTTTC CTTAAGTAAA
TATCTGATTG ACCACAAATT GGTTAATACG GATACAGCCC AGACCAACAT TACGGACCTG
GCCTCCCGGG GTAAACTTGT TTTTAATTTC ACTTCTCTTA GAGGGTTTTG GGACCTGCAA
GTGGCCAAGG CGATTTTCAG CGAGGGTGTT CAGCTTTTAA AGCCCCCCGG AAACGTGTTC
TTTTCCACGG ACTCCATGTA TGGGATCAGC AGTAAATCAG CCAATCAAGA ACTGGCCTGG
GAATTTTTAA AATTTTTGGT TTCTGATGAT ATGATGACTC AGGGAGGAAT GCCGGTTAAT
AAAAGTGTGC TTCCCCAGAT TGCCCAGAAT TTCACCCAGG CTATACAAAA AAACGGTGGG
AGAATAAGAA TTAAGGACGA TGGCATTCCG GCTCAATCTG TAACACTGCA CCCCCCCACT
CAGGAGGATG TCGATTACAT GGAAAACCTG CTGAGTCAGG CGAACGTCTA CATCGGAACG
GACCAGAAGA TTATTTCGAT TGTTCAGGAG GAAACCGCGG CTTTCTTTAC GGGACAGAAG
ACAGCGGAGG CGACTGCCCA ATTGATTCAG GACAGGGTGA GCACTTACTT AAATGAATAA
 
Protein sequence
MIKNKCGVAL ILVLVLGLGV FTAGCGNTQA AREANNPDSL TIAVVTKDMY LDTAVKKFEE 
LHPSVSVKVK EYTSSTSEKG EGVRAADPGD IEKYVTTMNT QLMSGQGSDI ILLNNLPYQT
YADKNLLVDL GGLMQSDQSF DRGKYYQNIF KALEYKDKLY ALPVNISIDM IAADRTLLAD
SQVPIDDSNW DWNDFVKMAE KAINDKQNGA TQEMYALAGM DEKRLITTLV KENYDNLVDP
EKKTANFTGQ EFLDLLSLSK YLIDHKLVNT DTAQTNITDL ASRGKLVFNF TSLRGFWDLQ
VAKAIFSEGV QLLKPPGNVF FSTDSMYGIS SKSANQELAW EFLKFLVSDD MMTQGGMPVN
KSVLPQIAQN FTQAIQKNGG RIRIKDDGIP AQSVTLHPPT QEDVDYMENL LSQANVYIGT
DQKIISIVQE ETAAFFTGQK TAEATAQLIQ DRVSTYLNE