Gene Dtox_4192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4192 
Symbol 
ID8431206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4367667 
End bp4369409 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content51% 
IMG OID645036385 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_003193483 
Protein GI258517261 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATAG ATGCAATAAC ACTGGAGGTG CTGCGTAATT CCCTGCAGTC TGTCGCTGAG 
GAGATGGGCG TAACCCTGAT TCGCACGGCT TTGTCTCCCA ACATAAAGGA TCGCCGTGAC
TGTTCCACCG CTGTGTATAC ACCGGAAGGC GAACTGGTTG CCCAGGCCGA GCATATACCT
CTGCACTTAG GATTAATGCC CACAGTGGTA CAAGCCGTAT TAAAACGCTT TCCCCTTTCG
GAAATGGAGC CGGGTGACAC TATAATCATA AATGATCCTT ATGTCAGCGG TTCACACCTG
CCGGATATTT GTTTGATCAG CCCGGTTTAT TATTTGGATA AGCCTTTGGC TATAGTGGCC
AACCTGGCTC ACCATGTTGA TGTCGGGGGG GCTGTGCCGG GCAGCATGTC AACCGCCGCA
CGGGAAATTT TTCAGGAAGG CCTGCGTATA CCTCCGGTCA AACTGATGCA AAAAAATAAG
ATTAACCGTG ACTTGCTGGA TGTTTTAGCC AACAATGTGC GCACTTCGAA GGAGTTTTAC
GGCGATATAG AAGCGCAGTT GACCGCCAAC CGGGTAGGAG AGATCAGGCT CAAGGAACTG
GGGGCGAGGT ATGGTTTGGA ATTATTAAAC CACTATATGA ACGAAATAAT TGCCTATGGT
GAGCGCAGGA TGCGGGCCGC CCTGTCTGCC TTGCCTTCCG GCGTATACAG TTATGAAGAC
TATCTGGAAG GTGACGGTAT AACCGATCAA CCGGTAAAAA TAAAGGTTGC TTTAATCTTG
GAAGGGGACA GCCTGACTGT TGATTTTGGT GGAACCGATC CGCAAGCTGT CGGCCCGGTT
AATGCCGCCA GGGGAGTAAT TATGGCCTGT GTTTACTATA CGATCAAGGC GGTAGCCGAT
CCCTGGCTGC CCTCAAGCGC CGGTATTGGT TACCCGATAA AAGTAATAAC TCCTGCCGGC
AGCCTGGTCA ATCCACTCTT CCCGGCACCT GTGGCACACG CCAATATTAA CACTGCGCAG
CGGGTGGCAG ATGTTATTTT AGGCGCCCTG GCCGGTGCCG TGCCCCAAAA AGTTACTGCG
GCCGGTACCG GCAGTATGAG TAATTTTACT ATCGGAGGAG TCAATAAGCT GAACGGGTGC
TATTACTCTT ATGTGGAAAC TTACGGCGGG GGGCAGGGAG CCAAGCACAA TCAGGACGGT
ATGGATGGGG TGCATGTGCA TATGACCAAC ACGCGCAATA CCCCGGTGGA GGTAATTGAA
AATAACTACC CTCTAAGAGT AGAAAAATAC GGTTTGCTTA CGGATTCGGG GGGGCCCGGT
GAATACCGTG GGGGAACAGG TTTAGTCAGA GAAATTACCG TTTTGGGCGC TGAGGCTGTT
GTCTCGGTGA GTACCGAAAG AGCTGTGTTT GCACCCTGGG GTTTGTCAGG AGGGTTAGCC
GGACGCAGAG CGGGCTACGC CATAAAAAAC AGTGCCAGTG CGGAGCATGG GGCCAATTTG
GGCGGTAAGT TTACAGGACA GGTTGCGGCA AATACCACAA TCGTTTTGGA AACCGCCGGG
GGAGGCGGTT TCGGTAATCC TCTGAAAAGG GATCCGAGCA AAGTTCGGCA GGATGTTTTA
AACGGTCTGG TCTCTTTTAA AGCAGCCCGT GATTATTACG GTGTGGTTAT TTCGAAGAGT
TATGTGGTGG ATGAGGAGTC AACAAAGCTT TTAAGGGATG AGTTAAGGAG AAGCAGTCCA
TGA
 
Protein sequence
MTIDAITLEV LRNSLQSVAE EMGVTLIRTA LSPNIKDRRD CSTAVYTPEG ELVAQAEHIP 
LHLGLMPTVV QAVLKRFPLS EMEPGDTIII NDPYVSGSHL PDICLISPVY YLDKPLAIVA
NLAHHVDVGG AVPGSMSTAA REIFQEGLRI PPVKLMQKNK INRDLLDVLA NNVRTSKEFY
GDIEAQLTAN RVGEIRLKEL GARYGLELLN HYMNEIIAYG ERRMRAALSA LPSGVYSYED
YLEGDGITDQ PVKIKVALIL EGDSLTVDFG GTDPQAVGPV NAARGVIMAC VYYTIKAVAD
PWLPSSAGIG YPIKVITPAG SLVNPLFPAP VAHANINTAQ RVADVILGAL AGAVPQKVTA
AGTGSMSNFT IGGVNKLNGC YYSYVETYGG GQGAKHNQDG MDGVHVHMTN TRNTPVEVIE
NNYPLRVEKY GLLTDSGGPG EYRGGTGLVR EITVLGAEAV VSVSTERAVF APWGLSGGLA
GRRAGYAIKN SASAEHGANL GGKFTGQVAA NTTIVLETAG GGGFGNPLKR DPSKVRQDVL
NGLVSFKAAR DYYGVVISKS YVVDEESTKL LRDELRRSSP