Gene Dtox_4090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4090 
Symbol 
ID8431104 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4258034 
End bp4259358 
Gene Length1325 bp 
Protein Length441 aa 
Translation table11 
GC content35% 
IMG OID645036288 
Producttransposase IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_003193386 
Protein GI258517164 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCTC GCTTGGAAGA CTTTATCATC ACGCTTGCAC TTAATACGAG TTGTGAAGGT 
ACTGCCCGTA TTTGTAAACA GATGAATATC AATATTAGCG GTGATACTGT AATTAAGATC
CTGTTACGCA ATGCCAAATC CATCGATCCT GAGTACGGTG AATTTATAGG TGTTGATGAT
TGGGCCTATA AAAAGGGACA TACCTACGGG ACCATTATAT GTGATGGTGC TTCCCATAAA
CCAATTGCCC TCTTAGATGG TCGCGACGGA AGTGCCTTAA AAGAATGGCT AGAGAGAAAT
CAACACATTA AAACAGCTAC GAGAGATAGA GCCAGTAGTT ATGCAAAAGC CATTGAGGAA
GCACTGCCAC AGGCGATGCA GATTGCCGAC AGATTCCACC TTCACCAAAA TCTTTTAAAA
GCGATCAAAG ACGCACTGGG ACGAGAAATT CCAGCAAAAA TAATGATTCC TATAGCGAAT
TCAGCTCCTA ATTTAGCTGA CTCACCAGCT ATGGACGAGC CTAAATTAAA AAAAATGTGT
TAACTGATGC TGAGAAAAAT CGAAGAGAAA CGATTATTAA GATTCAATCT TACTTATCTC
AAGGTTATTC AAGTAAAGCC ATTTGTGAGA TGATGCACAC AACTTATAGG CAAATTAGGA
AGTTTTCAAT AGGTGATCCC GATATTCTAT GCTGCAGTAA TAAATTGAAG TCAAATTCCT
TATGTAGATC CGAGCTTGAT CAATATAAAA ACATCATTTT GGAACAATTA GCTTTAAAGG
CAAAAATCAA AAGTATCTAT GAATTAATCC TTGAGAGAGG ACATACCGGA AAACGCACTA
ACTTTTATGA TTATTGCAAA AAACTTATAG AGAAAAATGA TGTTGCTCAC CCTACAAACA
CCAATATTCT TGATGTCAAA CTTAATAAGA ACAAACCCAA AGGCCATTTC ATTGAAAGAA
ATCGAATATT AAAATACCTT TGGTCTAACT TGAATATTCC ATTGGCAGAT ATTGATTTCA
TTTTAGAGAG GTATCCATTC TTAAAAGAAA TGGGAGATTG TATTTCAGAC TTTCGAAATA
TTTATGTCAA CAAGAGCGTT ACACTCTTAA AAGAGTTTGT AGATAAATAT GTTAAAAGCA
ATAACAAGAA CCTGAAGTCA TTTGCAAATG GTATTTTTAA AGACTTTATA GCTGTTAAAA
ATTCAGTTAT CAGTGAATAC AGTAATGGAT TTATTGAAGG TAATAACAAT CGTCTAAAAA
TGATCAAACG CACCATGTAT GGAAGAGCCG GTTTAAATCT TCTCAGAGCT AAGATTATCT
ATTAG
 
Protein sequence
MTSRLEDFII TLALNTSCEG TARICKQMNI NISGDTVIKI LLRNAKSIDP EYGEFIGVDD 
WAYKKGHTYG TIICDGASHK PIALLDGRDG SALKEWLERN QHIKTATRDR ASSYAKAIEE
ALPQAMQIAD RFHLHQNLLK AIKDALGREI PAKIMIPIAN SAPNLADSPA MDEPKLKKNV
LTDAEKNRRE TIIKIQSYLS QGYSSKAICE MMHTTYRQIR KFSIGDPDIL CCSNKLKSNS
LCRSELDQYK NIILEQLALK AKIKSIYELI LERGHTGKRT NFYDYCKKLI EKNDVAHPTN
TNILDVKLNK NKPKGHFIER NRILKYLWSN LNIPLADIDF ILERYPFLKE MGDCISDFRN
IYVNKSVTLL KEFVDKYVKS NNKNLKSFAN GIFKDFIAVK NSVISEYSNG FIEGNNNRLK
MIKRTMYGRA GLNLLRAKII Y