Gene Dtox_1442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1442 
Symbol 
ID8428397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1495475 
End bp1497031 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content39% 
IMG OID645033776 
Producttransposase IS4 family protein 
Protein accessionYP_003190934 
Protein GI258514712 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGTA AACTAACAAA TCAAATTAAC TTTTCTGATA CAGATGAATG GTATAAACGA 
ATACCCCAAA ATTCATTTTG GCATAAAGTG CGTCAATGGG CAGAAGAAAA TCTGTCAGAT
GATGACTTTG CCCATCTATA TTCCAGTAAT AGGGGACGCC CGTCATTACC CCCGGTGTTT
ATGCTGAAAT CCATACTGAT ACAGCTTGAA AAACAATACT CCGATCGTGT TATGGAAGAA
TCAGCAATGT TCGATGACCG CGTAAAATAT GCACTTTGCC TTAGCCGTAC CCCCCAGATA
AAACTTGACC ATGCCACATT ATGCAGATAT CGCAAAATAT TCTTAGAAGA CGAGCAGGGA
AAGAAAATAT TAAAGAAAAC AATAGAAAAT GCAGCAGAAG CAGGGCTATT CGAAGAAGCT
AATAAAGATG CTGTCGACTC ATTTATGATT CATGGAGCGG CAGCCCGTCA AAGTACCTTT
ACCATGATCC GGAAAGCCAC AGCCCGGGTA TTACGCCAGG CTGACGTTGA GGGCTTTATT
GAAGATATAA GACAAAAACT CAATCGTGAT GATTACCTTA ACAATAAAAA ACCCACTATA
GACTGGGATA ACATAGAAGC TCGCAATAAA TTACTCACAG AAATGGTCCT TGACGCCAGA
ACAATAGCCT TATGGGCAAA AGAGAACAAA GCCAAAATCA GTGAAGAACT AAACCAATGC
ATAGAACTTT TGCAAATAGT AGCAGAGCAG GACATTGAAG AAAAAGATGG AAATATAGCC
ATCCGTCAGG GTGTAGCTAA AGACAGAATC ATATCAGTAG AAGATCCGCA AATGCGTCAT
GGACGCAAAA CAACGAGCAG TAAAACAGAT GGTTACAAGG GACACATCAT GTCCGGCGGT
ATAGAAAACA AGATAATAAC AGCTGCAGAA ATAACTGCTG CTAACGTTCC GGACAGTGAA
CCTGTACCTG ACTTAATCAA GCAGCGTCAA GAAAATACAG GAAGTAAACC TGATTCCCTT
AGCGGTGATA CCGCATATGG TGGTGCTGAA ACCAGAAAAC ACATTAAAAA AGAAAAAATC
AAACTAATCG CCAAAGTACC TCCGTCAACC AATGTAAATG GATGTTTTAA TAAAGACAAA
TTCATCATTG ATCTGGATAA CAAATTTATA GAATGCCCTG CCGGAGTTCG ACTTGAAATA
GATAAGGAAC TGGGAGAAAA GGAAATATGC TGTAAATTCC CAAAAGAACA GTGTCAAAAT
TGCGATCTAA GAAATCAATG CACCAAAAGT AAAGATGGCA GAACAGTAAG AATACATCCG
CATGAAGCAT TATTGCAAAA GGCACGTAAA CAACAACAAA CAGCAGAGTT TAAAGAGGAG
TATCGTTTCC GTAGTCGGAT TGAGAGAATA ATTTATTGTG TTACTAAAAA TGGAGCCCGA
AAGGGTAAAT ATAATGGACT TGAAAAAAAT AGATTTAAGT TGCAACTCCA TACGGCTTTA
CATAATATCA AAACAATACT TTCTTTAGCT AAAAAAAATC CTGCGATAGT TGTATAG
 
Protein sequence
MMGKLTNQIN FSDTDEWYKR IPQNSFWHKV RQWAEENLSD DDFAHLYSSN RGRPSLPPVF 
MLKSILIQLE KQYSDRVMEE SAMFDDRVKY ALCLSRTPQI KLDHATLCRY RKIFLEDEQG
KKILKKTIEN AAEAGLFEEA NKDAVDSFMI HGAAARQSTF TMIRKATARV LRQADVEGFI
EDIRQKLNRD DYLNNKKPTI DWDNIEARNK LLTEMVLDAR TIALWAKENK AKISEELNQC
IELLQIVAEQ DIEEKDGNIA IRQGVAKDRI ISVEDPQMRH GRKTTSSKTD GYKGHIMSGG
IENKIITAAE ITAANVPDSE PVPDLIKQRQ ENTGSKPDSL SGDTAYGGAE TRKHIKKEKI
KLIAKVPPST NVNGCFNKDK FIIDLDNKFI ECPAGVRLEI DKELGEKEIC CKFPKEQCQN
CDLRNQCTKS KDGRTVRIHP HEALLQKARK QQQTAEFKEE YRFRSRIERI IYCVTKNGAR
KGKYNGLEKN RFKLQLHTAL HNIKTILSLA KKNPAIVV