Gene Dtox_3741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3741 
Symbol 
ID8430751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3907497 
End bp3908657 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content45% 
IMG OID645035969 
Productamidohydrolase 
Protein accessionYP_003193072 
Protein GI258516850 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00788915 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCAC TTACCGGAGG AAATATTTTT ACTATGGTTG GTGATAATAT TGAAAACGGT 
ACTGTACTTA TTAAGGACGG TAATATTATC GAGATTGGAG AAAGTATAGC TATCCCATCC
GGTACGGAAA TAATCAGGGT GAACGGCAGG ATTATTACAC CCGGTTTAAT TGATGCTCAC
AGTCATATAG GTATGTTTGA GGAGATATAC CGGGTAGAGG GTGATGACGG GAATGAAATG
ACCGATCCTC TGACCCCGCA TCTCAGGGCT ATTGATGCAG TTAACCCGGA GGATTTAGCT
TTTCAGGATG CTTTAAAGGG CGGTGTGACA ACAGTAGTTA CCGGTCCCGG CAGTGCTAAT
ATTCTGGGGG GTGAGATGGC TGCTCTGAAA ACCTGGGGTA AAACTGTCGA AGATATGATT
GTGAAGTTTC CTATTGGTTT AAAAGCAGCG CTGGGTGAAA ATCCCAAACG TGTGTACGGT
ACAACAAGCA AAGCGCCTCA TACCAGAATG GCCAGTGCGG CAATTTTGCG TGAAGCACTG
GTCAGCGCAC AGAATTACAT AAAAAAGCAG GAAAACTGGA TTAACGGGGA GAAAGATAAA
CATGAGCCTG AGCGAGATTT GAAACTGGAA TCTATTGGCC GGGTTTTAAA ACGAGAAATA
CCGCTGCGAG TGCATGCTCA CCGGGCCGAC GATATTATGA CGGCAGTTCG TATCGCCCGT
GAATTCGACC TGAAGATAAT AATTGAGCAT TGTACAGAGG GATACAAAGT AGCGGATGAA
CTGGCGCGCT TGAATATTCC GGCGATAATC GGACCTATAA TTACTAACCG GGCCAAGGTT
GAACTGCAGG GTATTAATCT TTCTAATGGT AAATATTTGG AGAAAGCCGG GGTTAATTTT
GCTATAATGA CAGATCATCC TGTTGTCCCA ATACAGTACC TGGCGTTATC AGCCGGCCTG
ACGGTACAGG GCGGTTTGTC AGAAAAAACA GCACTGCAGG CAATAACAGT CAATGCTGCT
AAATTACTGG GTTTGAATGG CCTGGGTACA CTGGAAAAAG GTCAAAAGGC TGACTTGGTG
GTTTGGTCAG GCCACCCCTT TGATCTAAGG TCAAAAGTTG AATTGGTTTA CATTAATTGT
AACTTGTTGT CATTCGATTA A
 
Protein sequence
MLALTGGNIF TMVGDNIENG TVLIKDGNII EIGESIAIPS GTEIIRVNGR IITPGLIDAH 
SHIGMFEEIY RVEGDDGNEM TDPLTPHLRA IDAVNPEDLA FQDALKGGVT TVVTGPGSAN
ILGGEMAALK TWGKTVEDMI VKFPIGLKAA LGENPKRVYG TTSKAPHTRM ASAAILREAL
VSAQNYIKKQ ENWINGEKDK HEPERDLKLE SIGRVLKREI PLRVHAHRAD DIMTAVRIAR
EFDLKIIIEH CTEGYKVADE LARLNIPAII GPIITNRAKV ELQGINLSNG KYLEKAGVNF
AIMTDHPVVP IQYLALSAGL TVQGGLSEKT ALQAITVNAA KLLGLNGLGT LEKGQKADLV
VWSGHPFDLR SKVELVYINC NLLSFD