Gene Dtox_4119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4119 
Symbol 
ID8431133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4288881 
End bp4290014 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content49% 
IMG OID645036314 
Productglycosyl transferase group 1 
Protein accessionYP_003193412 
Protein GI258517190 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0989352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000308882 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGAATCAAT TGCGGGTTTT GCATATAATT GGCGGGGGTG AATTCGGCGG CGCGGAACGC 
CATATATTAA ACCTGTGCGC TTCTCTGGAT TCCCGGAAGG TAGCCATAAC CGTTGTTTGT
CTGTTTGAGG AACCTTTTGC CCGTATGGCC AAAGAAATTG GGGTCAATGT TCTGGTTATG
CCCATGCGGC ATAAACTTGA TATAGGAACT CTTTCCAGAC TCACCGAGGT AATTAAAAAA
AATCGGCCTG ATTTAGTGCA CACACACGGT GTAAGAGCCA ATCTTTTAGG TCGTCTGGCC
GCCAAAATGG CCGGAGTAAA TAGAATTGTT ACAACTGTGC ACAGCCTGTT GGTTCGCGAT
TACCCTGATT TTTGGAGCCG CCTGGCCAAC TCCTGGACAG AGCGCTTGAC AAGAGGACTT
ACCGACCACT TTATTGCTGT ATCGCAGGGT TTGAAAAACG CTCTTATTGC AGACGGTATA
CCGGAGAATA AGATTACGGT GATTTATAAC GGGCTTGATT TGGACAGATT CAGGCCTTGC
CTGCCTCCCG GTACCTACAG GCACTGGCTG GGCTATGAAG AGGGTGTTCC GCTGGTAGCC
ATAGTAGCCC GCCTGCATTC GGTCAAAGGA CACAGCTTTT TTTTGCAAGC AGCTGCCGAA
GTGTTGAAAG TCATACCCAG AGTCAGGTTT CTGGTAGTCG GTACCGGGCC TGATGAAGCT
GTATTAAAGG AAATGACTGC TAAGCTTGGC CTGCAGGAGG TTGTTAATTT TACCGGTTTT
ATCACGGAGA TACCTGATTT AATGGCTGAC ATGGATGTGC TGGTGATTCC GTCTCTATGG
GAAGGTTTTG GGCTTACGGC CATTGAAGCT ATGACAGTCG GTTTGCCGGT AGTTGCTACC
GAGGTGGGTG GATTGCCGGA AGTAGTGAGA CCGGGAGAAA CAGGCATCCT GGTTCCTTCG
TCAGATGTAC CGTCTTTAGC CAAGGGGATT ATCTGGGTGC TGCAGCACCC CAAAGAAGCA
TCCCAGATGG CCGAAAACGG CAGGCAAATT GTTAGTCAGC AATTCAGTTC CAAAGGGATG
GCCAGAAAAA CAGAGTTAAC CTACCAAAAA GTAATGAGGT GTGATCCTTG CTGA
 
Protein sequence
MNQLRVLHII GGGEFGGAER HILNLCASLD SRKVAITVVC LFEEPFARMA KEIGVNVLVM 
PMRHKLDIGT LSRLTEVIKK NRPDLVHTHG VRANLLGRLA AKMAGVNRIV TTVHSLLVRD
YPDFWSRLAN SWTERLTRGL TDHFIAVSQG LKNALIADGI PENKITVIYN GLDLDRFRPC
LPPGTYRHWL GYEEGVPLVA IVARLHSVKG HSFFLQAAAE VLKVIPRVRF LVVGTGPDEA
VLKEMTAKLG LQEVVNFTGF ITEIPDLMAD MDVLVIPSLW EGFGLTAIEA MTVGLPVVAT
EVGGLPEVVR PGETGILVPS SDVPSLAKGI IWVLQHPKEA SQMAENGRQI VSQQFSSKGM
ARKTELTYQK VMRCDPC