Gene Dtox_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2017 
Symbol 
ID8428999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2186427 
End bp2187983 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content39% 
IMG OID645034344 
Producttransposase IS4 family protein 
Protein accessionYP_003191475 
Protein GI258515253 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.167973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGTA AACTAACAAA TCAAATTAAC TTTTCTGATA CAGATGAATG GTATAAACGA 
ATACCCCAAA ATTCATTTTG GCATAAAGTG CGTCAATGGG CAGAAGAAAA TCTGTCAGAT
GATGACTTTG CCCATCTATA TTCCAGTAAT AGGGGACGCC CGTCATTACC CCCGGTGTTT
ATGCTGAAAT CCATACTGAT ACAGCTTGAA AAACAATACT CCGATCGTGT TATGGAAGAA
TCAGCAATGT TCGATGACCG CGTAAAATAT GCACTTTGCC TTAGCCGTAC CCCCCAGATA
AAACTTGACC ATGCCACATT ATGCAGATAT CGCAAAATAT TCTTAGAAGA CGAGCAGGGA
AAGAAAATAT TAAAGAAAAC AATAGAAAAT GCAGCAGAAG CAGGGCTATT CGAAGAAGCT
AATAAAGATG CTGTCGACTC ATTTATGATT CATGGAGCGG CAGCCCGTCA AAGTACCTTT
ACCATGATCC GGAAAGCCAC AGCCCGGGTA TTACGCCAGG CTGACGTTGA GGGCTTTATT
GAAGATATAA GACAAAAACT CAATCGTGAT GATTACCTTA ACAATAAAAA ACCCACTATA
GACTGGGATA ACATAGAAGC TCGCAATAAA TTACTCACAG AAATGGTCCT TGACGCCAGA
ACAATAGCCT TATGGGCAAA AGAGAACAAA GCCAAAATCA GTGAAGAACT AAACCAATGC
ATAGAACTTT TGCAAATAGT AGCAGAGCAG GACATTGAAG AAAAAGATGG AAATATAGCC
ATCCGTCAGG GTGTAGCTAA AGACAGAATC ATATCAGTAG AAGATCCGCA AATGCGTCAT
GGACGCAAAA CAACGAGCAG TAAAACAGAT GGTTACAAGG GACACATCAT GTCCGGCGGT
ATAGAAAACA AGATAATAAC AGCTGCAGAA ATAACTGCTG CTAACGTTCC GGACAGTGAA
CCTGTACCTG ACTTAATCAA GCAGCGTCAA GAAAATACAG GAAGTAAACC TGATTCCCTT
AGCGGTGATA CCGCATATGG TGGTGCTGAA ACCAGAAAAC ACATTAAAAA AGAAAAAATC
AAACTAATCG CCAAAGTACC TCCGTCAACC AATGTAAATG GATGTTTTAA TAAAGACAAA
TTCATCATTG ATCTGGATAA CAAATTTATA GAATGCCCTG CCGGAGTTCG ACTTGAAATA
GATAAGGAAC TGGGAGAAAA GGAAATATGC TGTAAATTCC CAAAAGAACA GTGTCAAAAT
TGCGATCTAA GAAATCAATG CACCAAAAGT AAAGATGGCA GAACAGTAAG AATACATCCG
CATGAAGCAT TATTGCAAAA GGCACGTAAA CAACAACAAA CAGCAGAGTT TAAAGAGGAG
TATCGTTTCC GTAGTCGGAT TGAGAGAATA ATTTATTGTG TTACTAAAAA TGGAGCCCGA
AAGGGTAAAT ATAATGGACT TGAAAAAAAT AGATTTAAGT TGCAACTCCA TACGGCTTTA
CATAATATCA AAACAATACT TTCTTTAGCT AAAAAAAATC CTGCGATAGT TGTATAG
 
Protein sequence
MMGKLTNQIN FSDTDEWYKR IPQNSFWHKV RQWAEENLSD DDFAHLYSSN RGRPSLPPVF 
MLKSILIQLE KQYSDRVMEE SAMFDDRVKY ALCLSRTPQI KLDHATLCRY RKIFLEDEQG
KKILKKTIEN AAEAGLFEEA NKDAVDSFMI HGAAARQSTF TMIRKATARV LRQADVEGFI
EDIRQKLNRD DYLNNKKPTI DWDNIEARNK LLTEMVLDAR TIALWAKENK AKISEELNQC
IELLQIVAEQ DIEEKDGNIA IRQGVAKDRI ISVEDPQMRH GRKTTSSKTD GYKGHIMSGG
IENKIITAAE ITAANVPDSE PVPDLIKQRQ ENTGSKPDSL SGDTAYGGAE TRKHIKKEKI
KLIAKVPPST NVNGCFNKDK FIIDLDNKFI ECPAGVRLEI DKELGEKEIC CKFPKEQCQN
CDLRNQCTKS KDGRTVRIHP HEALLQKARK QQQTAEFKEE YRFRSRIERI IYCVTKNGAR
KGKYNGLEKN RFKLQLHTAL HNIKTILSLA KKNPAIVV