Gene Dtox_4144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4144 
Symbol 
ID8431158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4315263 
End bp4316783 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content43% 
IMG OID645036337 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_003193435 
Protein GI258517213 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000251241 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTCCGGCG ATCCGGCAGT AAAGGTACTT ATGGATATGA AAAACAGGAC TTGCTCAAGT 
TTGGATGAGA CTGTCATATT TTCGCGCAGT CTTCTGGCTG TACCACTTTT AATAATTTTG
GATATGCTGG CTATTGCTTT TAGCCTGGGA ATTGCCTATC TGATCAGGAC ATATGTTTTA
CCGGTTTTGC TGCCGGGATT TTTTCACTCG GGATTGCTGT CCAGTTCCCT GCAAAACTTA
TGGTGGCAGC CTTTTATTTT GGTTGCCTGC ATGATTTATG AAGAGCTGTA CCAGAAAAGA
TTGCCATACT GGAAAGAAGT AGAGAAAATA TTAAAGGCTT GTACACTTGC CGTAATATTC
TCCATAGCCT TGCTCTACCT GGCCAAGCTG AGCGGTGAAA TGTCCAGAAC TCTGGTGATG
ATGACCTGGT TTATGACGGC AGTGTCTATC CCGCTGCTTC GCTATACGGG AAAACTGCTG
CTGGTTAAGG CCGGTGTCTG GAGTAAACCT GTGCTGGTTA TCGGTGCGGG AAAAACCGCG
GAATTGATAG CAAGCGCTTT AAGCAGAGAA AAGACTATGG GCTATGAGAT TATAGGTTTG
TTGGATGACA ACCCTAATCT TACAGGTATT TATAATGCAA AATCTAAGAA GGTCATGCCC
GTACTGGGAA CATTTGTGCA AGCGGAGAAG ATTATTTCGC AAACCAGGGT ACAGGAAGTA
ATTGTCGCTG CACCCGGTAT GCCGTCTAAG AAATTGGTTG AGCTGACTAA TAGCCTGCAG
CCTTTGGTGA ACAATGTGAT GGTTGTACCG GACCTGTTCG GTTTATCTAT GAACGGCATA
GAAGTAGAGT ATTTTTTTGA GGAGCAGGCC CTGCTGCTAA ATGTAAAAAA CAGGCTTAAG
TCAACACTTA ACAGAAGCAT AAAGAGACTT TTTGACGTCT TGACCGGTTC AATACTTTTG
TTATTATGTA TTCCCTTACT CTGTATGATA GCCATTGCCA TAAAAATTGA TTCCAGAGGA
CCGGTATTCT TTGCTCACGA ACGTATGGGG CAGGGGAGTG AATGTTTTAC CTGTTATAAA
TTTCGAACTA TGTATTTAGA GGGTGAGAGC TTATTAAAGA AGCACCTGAG GAAGAACCCT
CAGGCAAGGC AGGAGTGGTT GAAATACAAT AAAATTAAAG ATAATGATCC TCGCGTTACC
AGAGTTGGCG CAGTCTTAAG AAGATTTAGT CTGGACGAAT TGCCTCAGCT AATAAATGTT
GTATTTGGCA ACATGAGCTT GGTGGGTCCC AGGCCATATT TAATGAGAGA GAAAAAGCAA
ATGGGTAACT GGGTTTTCGA TATACATGTG GCAAAACCCG GTATAACGGG GCTTTGGCAG
GTAAGCGGTC GCAATGAAAT TGAATTTGAG GGAAGACTTA AGCTTGATGT TTGGTATGTT
AGAAACTGGT CATTGTGGCT GGATATTGTT ATGCTTTTGA AAACGCTTAA AGTGGTTTTG
AAGCGCGATG GGGCTTATTA A
 
Protein sequence
MSGDPAVKVL MDMKNRTCSS LDETVIFSRS LLAVPLLIIL DMLAIAFSLG IAYLIRTYVL 
PVLLPGFFHS GLLSSSLQNL WWQPFILVAC MIYEELYQKR LPYWKEVEKI LKACTLAVIF
SIALLYLAKL SGEMSRTLVM MTWFMTAVSI PLLRYTGKLL LVKAGVWSKP VLVIGAGKTA
ELIASALSRE KTMGYEIIGL LDDNPNLTGI YNAKSKKVMP VLGTFVQAEK IISQTRVQEV
IVAAPGMPSK KLVELTNSLQ PLVNNVMVVP DLFGLSMNGI EVEYFFEEQA LLLNVKNRLK
STLNRSIKRL FDVLTGSILL LLCIPLLCMI AIAIKIDSRG PVFFAHERMG QGSECFTCYK
FRTMYLEGES LLKKHLRKNP QARQEWLKYN KIKDNDPRVT RVGAVLRRFS LDELPQLINV
VFGNMSLVGP RPYLMREKKQ MGNWVFDIHV AKPGITGLWQ VSGRNEIEFE GRLKLDVWYV
RNWSLWLDIV MLLKTLKVVL KRDGAY