Gene Dtox_4100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4100 
Symbol 
ID8431114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4269432 
End bp4270520 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content31% 
IMG OID645036297 
Productglycosyl transferase group 1 
Protein accessionYP_003193395 
Protein GI258517173 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0825804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGA GAATACTGAT ACTATCAAAT CATTTTATAA CACTATACAA TTTTAGAAAA 
GAATTGATTG TTTATCTGGT TAAATGCGGA CATGAGGTTA TTATTTCTTT ACCAAAAGCT
AAAGAAAATA TTTTTTTTAG TGACTTGGGA TGTAAGGTTA TTGAAACTCA CGTAGATCGC
AGGGGAATTA ACCCTATTAT AGACTTAAGG CTGATTATTA ACTACATTAA AATAATGCAG
AAAACGAAAC CCGATATCAT ATTCTCATAT ACTATTAAAC CCAATATTTA TGGAGCCATA
GCCTCTAACA TAACAAAGAA TGCACAAATT TGCAATATAA CGGGAACTGG TTCGACATTT
ATGAGGAACA ATGCTGTTAG TTTAATAGCA AAGGTAATGT ACAAGATTTC GGTTAAGAAG
TCATACAAGA TTTTTTTTCA AAATATTGGT GACAGGGATT TATTTATTAA AAACGGGATG
GTTGGAGATA ATTTTTGTAT GATTCCTGGT TCTGGAGTGA ACCTAAATCA ATATAAACTA
TGTGATTTAC CACTCGGTGA TGAGATTAAT TTTGTTTTTA TAGGTAGGAT TATGAAATTA
AAAGGTATTG ATCTATACCT TAAGTGTGCT AAAGTAATTA AAGAGAGGTA CCCTAGAACA
AATTTTTACA TTGCAGGTTT TGTTGAAGAA GATAAATATA AAGAAGTAAT TAATTATTAT
CATGCAAAAG GCGTTATTAA TTATATAGGG TTTGAAAAGA ATATCAAGTC ACTTATTCAA
AAATGTCATT GTACAATCCT ACCATCTTAT GGGGGAGAGG GTGTACCGAA TGTGCTTTTA
GAGACTGCAG CTATAGGAAG AATATGTATT GCATCAGCAG TTAATGGTTC TAAAGACGTT
GTTGAGGATG GAGTTACAGG TTATCTATTT GAAAGCGGAA ATATTAAAGA ATTGATAAAT
AAGGTTATGA ATTTTTTGGA ATTAGATTAT GAGACGAAAA AGAAGATGGG CATGGCAGGT
AGAGAAAAGG TAGAAAGGGA ATTTGATAGA CAGATTGTGG TTAAAGCATA TATGGCAGAA
GTAAAGTGA
 
Protein sequence
MSKRILILSN HFITLYNFRK ELIVYLVKCG HEVIISLPKA KENIFFSDLG CKVIETHVDR 
RGINPIIDLR LIINYIKIMQ KTKPDIIFSY TIKPNIYGAI ASNITKNAQI CNITGTGSTF
MRNNAVSLIA KVMYKISVKK SYKIFFQNIG DRDLFIKNGM VGDNFCMIPG SGVNLNQYKL
CDLPLGDEIN FVFIGRIMKL KGIDLYLKCA KVIKERYPRT NFYIAGFVEE DKYKEVINYY
HAKGVINYIG FEKNIKSLIQ KCHCTILPSY GGEGVPNVLL ETAAIGRICI ASAVNGSKDV
VEDGVTGYLF ESGNIKELIN KVMNFLELDY ETKKKMGMAG REKVEREFDR QIVVKAYMAE
VK