Gene Dtox_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2046 
Symbol 
ID8429028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2225468 
End bp2226595 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content43% 
IMG OID645034367 
Productpeptidase T-like protein 
Protein accessionYP_003191498 
Protein GI258515276 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01883] peptidase T-like protein
[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTTAACA GAGATCGTTT GATTAACGAG TTTTTGCAAA TGGTGCAAAT CGATAGCTTA 
TCAGGCCGGG AAAGACAAAT CGCTGACTAT TTGAAGGATA AATTAATTTC CCTGGGATTA
AGTGTGACAG AGGATGCAGC CGGGGAAAAA GCCGGCGGCA ATGCCGGCAA TATAATTGCT
AAAATCCCGG CCAACAACAG CAGTGCCCCT GTTATTATGC TTTGTTCTCA CATGGATACT
GTCGAACCAG GTATAGGAGT AAAGCCTGTT ATTAAAGATA ATTTCATCTG CTCCCAGAGT
GATACCGTCT TAGGTTCGGA TGATAAGGCC GGTATTGCTG CTATTTTAGA AGCAATCAGG
CAGGTTAAAG AGAACGATGT TCCTCACGGG GGTATTGAAG TCGTCTTTAC TATTTGGGAG
GAAGGCGGGC TTTTTGGTGC CAAGTATTTG GATTACAGTC TGCTGGAGGC AAAAATAGGG
TATGTGCTGG ACAGCGACGG GACTCCCGGC ACCATTATTA CCAGGGCACC CTCTCAGGAT
AAGATTTATG CTGAAATCCA CGGGCGTGCA GCGCATGCAG GTATAAACCC GGAAGATGGT
ATTAATGCTA TACTGGTGGC GGCTAATGCT ATAGCCGGGC TTAATTTGGG CCGGATTGAC
GAAGAGACTA CCTGTAATAT AGGTGTGATT TGCGGTGGTA AAGCAACTAA CATTGTTCCT
GATTTAGTGA AAATTGAAGG AGAAACCAGA AGTCTTGATG TCTCAAAAAG ACAGGCGCAA
AATAAGGTTA TTTGCCGTGC TATAGAGCAG GCGGCTGAAA GATTTGATAC AAAAGCGGAC
ATTAACGTAG AGCCTGAGTA TACCAGCTTT AATCTGTCTG AAGATAGCCT GTCAGTAAAG
ATTGCGATAA AAGCCGCACA AAACCTGGGT TTAACCCCGC GCTTGGAAAA AACAGGCGGC
GGCAGTGATG CCAATGTATT TAATAATATG GGTATTGAGA CAGCTAATCT TGGTATTGGA
ATGCGTAAAG TACATACTAA AGAAGAATAT ATAGCAATTG AAGATCTGAT TAACAATGCC
AGGTATGTAG TGGAAATAAT CAAAACAGCC GGTGAATTAA GTTTATAA
 
Protein sequence
MVNRDRLINE FLQMVQIDSL SGRERQIADY LKDKLISLGL SVTEDAAGEK AGGNAGNIIA 
KIPANNSSAP VIMLCSHMDT VEPGIGVKPV IKDNFICSQS DTVLGSDDKA GIAAILEAIR
QVKENDVPHG GIEVVFTIWE EGGLFGAKYL DYSLLEAKIG YVLDSDGTPG TIITRAPSQD
KIYAEIHGRA AHAGINPEDG INAILVAANA IAGLNLGRID EETTCNIGVI CGGKATNIVP
DLVKIEGETR SLDVSKRQAQ NKVICRAIEQ AAERFDTKAD INVEPEYTSF NLSEDSLSVK
IAIKAAQNLG LTPRLEKTGG GSDANVFNNM GIETANLGIG MRKVHTKEEY IAIEDLINNA
RYVVEIIKTA GELSL