Gene Dtox_4118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4118 
Symbol 
ID8431132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4287736 
End bp4288887 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content53% 
IMG OID645036313 
Productglycosyl transferase group 1 
Protein accessionYP_003193411 
Protein GI258517189 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0967144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000882956 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCTGAAAG GTGAAACCAT AGTCTGCCTG GCAGCCGCCG ATTACCACGG CATGTGGGCC 
AGAGCCCAGC AATTAATGGC TGTTTATGCC CGGCATGACT GTCGGGTATT GTACATAGAC
CCCCCGGTTA CCCTGCTTTC CCCTATAAAA AATCCAGAAC TGCGCAAAAG ATTAACTACA
CAGCTTGACC GGGTAGGGGA AAATACTTAT ATTTTCAGAC CCCCGGTTTT TTTGCCGTTC
GGAAACATGC GCCGCCGGAT TAATAAGATA AACCAGAGAC GCCTGGCCTG CGCAGTAAAT
AAGACCTTAA AAAAAATCGG CTGGTCCCCC ACGCTCTGGT GGACTTACCT GGTCAACAGT
GTGGATCTGC TGCCTTATTT GCCGGGAAGA GCTATGGTAT GCTATGACTG TGCTGACGAG
CACTCGGCCT TCCCGGGCTT AATTGATGCC GCTGTAGTGG ATAAGATGGA GAGAGAGCTT
TTTGCCGCCT CCTCCGTTAA TCTGGTTACG GCCAGGCAAC TGCTGGAACG TAAAAAAACA
TATGCTCCTG ATATCGAATT TATACCTAAC GGGGCGGATG TGGAGCACTT CGGACAGGCC
CTGTCAGCAT CGCTGCCGGC AGCCGAGGAA GTTGCGCATT TGCCCGGTCC GGTTATCGGC
TATGTCGGGG CTGTCAGCAG CTGGCTGGAT CAGGAGGCTC TGGCTGCATT GGCCGGAGCG
CAGCCCGGCT GGTCTATTGT TTTAATCGGG CCGGTGGATA CAGATGTGGC CCTGTTAAAG
CAGTACAGCA ATATTTATTT CCTGGGCAAA AAAGATTACA GGGATTTGCC GGGATACATT
AAAGCCTTTA ATCTTTGCGT CATTCCTTTT AAGATTAATG ACTTAACCGT AGGGGTTAAC
CCCGTCAAGC TGTATGAATA TCTGGCAGCC GGCAAGCCGG TGGTCTCCAC GGCTCTGCCG
GAGGTGCGTG GGTTTGCCGC ACTGGTGAGT ATAGCCGAAA ACAGTCAGCG GTTTGTAGAT
TTAGTCAAGG AAGAAATAAG GACTGACAGC AGGGAAAAGG CGGCCCGCAG GGTGAGGGCT
GCGTTGGAAA ACTCCTGGGA AGCCAGGGCG GAAGCCGCGG CCGGAAAGAT CTTAGCTGCT
CGCGGTCATT GA
 
Protein sequence
MLKGETIVCL AAADYHGMWA RAQQLMAVYA RHDCRVLYID PPVTLLSPIK NPELRKRLTT 
QLDRVGENTY IFRPPVFLPF GNMRRRINKI NQRRLACAVN KTLKKIGWSP TLWWTYLVNS
VDLLPYLPGR AMVCYDCADE HSAFPGLIDA AVVDKMEREL FAASSVNLVT ARQLLERKKT
YAPDIEFIPN GADVEHFGQA LSASLPAAEE VAHLPGPVIG YVGAVSSWLD QEALAALAGA
QPGWSIVLIG PVDTDVALLK QYSNIYFLGK KDYRDLPGYI KAFNLCVIPF KINDLTVGVN
PVKLYEYLAA GKPVVSTALP EVRGFAALVS IAENSQRFVD LVKEEIRTDS REKAARRVRA
ALENSWEARA EAAAGKILAA RGH