Gene Dtox_4116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4116 
Symbol 
ID8431130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4284791 
End bp4286413 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content49% 
IMG OID645036311 
ProductO-antigen polymerase 
Protein accessionYP_003193409 
Protein GI258517187 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0014553 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000561153 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTTTAAGC GTAGTTGTAT TATAAATTCT TGCCTGTACT TGTGGAACTC TAGTTTAACA 
GCGCGCCTTT TATCTGCATT CTGGCTGGCT GTGACCGGCT GCTGGCAGTC CGGTTTTGTT
TACCATACCG TGGCTGCGGA TGTGGTGGCT CAAAGCTTTT TGTACAGGCA GTTCAACCGC
CTGACCGGTA AAATAGAGCA GCTGATGCCG CGAACTGCTC TGGCCGTAAG GGGGTTTTAT
TGGCAAAACG GCCCCGGTAT GATAACCAGG CCGGTTTTTT GGTATTTGAT TGCGGGCATT
GTCCTTTTAA CAGCTATACC GGCGGCCTTT ATTGACCGGC AGGCAGCCTT GACGATAGCG
GCGGGTGTTT TAACTATGGC CGTGGTTTTC GCCATACCTG AGTCGGGCCT TTACCTGGCC
GCACTGCTTT TGCCCTTCAC ATCTTTTAAT AACCTGGCCC TGCTCGGTGT ATTGACTGCA
CTGGCCTATT CAGCCAGGGT TATTGCCGGG GGCAAGGTTG AGCTTTGCCG CAGCTCCATA
TTGCTGCCGG TGCTGATTTT TTTTGCTGTT CTGGTATATG GCACTGTGAC CTCAGTGCTG
CCCCGGAGCA GTGCCTATGA ATTTTCTATT TATGTGGCGG CCGTGATATA TTTGTTTTTA
ATTATAAATA TGATTGACAA TAAACAAAAG CTGAGCCTCC TGTTAGGCTG CCTGGCACTG
TCAGCCGCGG CGGTTGCGGC TGACGCCATA TATGATTATT ATTTCGGGGT CACCTTTGTT
GATTTGCATA AGGGATGGGT TGATCCGGAG CTGAACCCGG ATATTAAAAA CCGCGCCTCG
GCAGTGTTTG AAAATCCCAA TTTGCTGGCG CAGTATTTTG TTCTGGTGAT ACCCATTACC
GCCTCTTTAA TTGCCGTGGT CAAAAGAATC GGCTATAAAT TTTTGCTCCT GGCCATAGCC
TGCCTGGCCG GTACGGCACT GGTGCTCACC TATTCCAGAG GCGGTTTATA CGGGTTTGTT
TTTGCCATGG CGGTTCTGGC TATGATCAGG GGTCCCAAGT TTTTACCCCT GTTTTTTGCG
GCAGCTGTAA TCGGAGCCTT CTTTTTGCCT CACACAGTCA TAGACCGCCT GGCTACGGCG
GATAATTTGA ATGACAGTTC GGTAGTTTAC CGTTTTGACA TCTGGAAGTC CACGCTTATG
ATGATCAAGG ATTATTGGCT GACCGGAGTC GGTGTGGGCA CAGAGGCTTT TATGAGGGTT
TATTATGTCT ATATGATGAA TTCAGCCATT ATGCCGCACG CGCATAACCT TTATCTCCAG
CTTCTCAGCG AAACCGGCAT CTTCGGGCTG GCTGCTTTTC TTTTATTGAT GTACAAAATA
TACCAGACTG TTTTCCGTCT GGTGTCGAGC AAGTTAAGTT ATATAAAGTG GTTGAATGCC
GGAATAGCCG GTGCCATGGC CGGCTTTTTG CTGCAGTCGC TGTTTGACTA CGGTCTCTGG
TATTATAAGC TGGGGGTTCT GTTCTGGATT TTGATAGGCG TATATATTGT GCTGGAGAAA
CTTAACGCGC GGGAGAAAGG AGCGGTATTT GATGGTGAAA ATCCGCAAGG ACAGGGAAAG
TGA
 
Protein sequence
MFKRSCIINS CLYLWNSSLT ARLLSAFWLA VTGCWQSGFV YHTVAADVVA QSFLYRQFNR 
LTGKIEQLMP RTALAVRGFY WQNGPGMITR PVFWYLIAGI VLLTAIPAAF IDRQAALTIA
AGVLTMAVVF AIPESGLYLA ALLLPFTSFN NLALLGVLTA LAYSARVIAG GKVELCRSSI
LLPVLIFFAV LVYGTVTSVL PRSSAYEFSI YVAAVIYLFL IINMIDNKQK LSLLLGCLAL
SAAAVAADAI YDYYFGVTFV DLHKGWVDPE LNPDIKNRAS AVFENPNLLA QYFVLVIPIT
ASLIAVVKRI GYKFLLLAIA CLAGTALVLT YSRGGLYGFV FAMAVLAMIR GPKFLPLFFA
AAVIGAFFLP HTVIDRLATA DNLNDSSVVY RFDIWKSTLM MIKDYWLTGV GVGTEAFMRV
YYVYMMNSAI MPHAHNLYLQ LLSETGIFGL AAFLLLMYKI YQTVFRLVSS KLSYIKWLNA
GIAGAMAGFL LQSLFDYGLW YYKLGVLFWI LIGVYIVLEK LNAREKGAVF DGENPQGQGK