Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_4117 |
Symbol | |
ID | 8431131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 4286470 |
End bp | 4287657 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 645036312 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003193410 |
Protein GI | 258517188 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0723713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000000981139 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGATGAAAC AGTTCCGGGC TGTGGTTTTG GTGGGAATGT ACGAGTGGCA TGACCGGGAG ATCAATGATA CGGTTAGGTG TCTGGCGCGT GCTTTCGAGG GTTTTGACCG CTACTATATC GACCCGCCCA AAGGTCTGAG AGCCTTGAAA GGCAATATGT CCTATGTTTT AAAATCCCCG CGCTGGCAAT GGTCACAGGA CGGTGAGGTG GCCGTGGGTG TCCCGCCTCT GGGTTTTCTG CCGGTAAAGC TCGGTCTGAG GGAGCGCGCC AACCGCTGCG CGGCCTGCGG CCTAATTCGC AGGCTGAAAA GAAATTATGG CGCTGATTGG CGCGAGCATA CTCTGTTCTA TGTTTCTTCC GGCAGCTATA CGATAACCGG TTTTATTGAT ATGCTGGCTC CCAAGCACAT GGTCTTTCAC CTGCTGGATG ATAACTTTGC CTTTCCCATT ATTAAGAATG ACCGCAGGGT TTGGGAAGAA AATAAAGCAT TCATGGATTT TATGCTGCTG CACAGTTCAC TGGTGCTGGC GGTTTCACAG GAATTGGTGG GCAAATACAG TGAAATGTAT AACAGAAAAA TATATCTGCT GGGCAACGGG GTTGATGTGG AGCACTTCAG TCCGGAAAAC AAGGCCTGGC CGGAGGCGCC GGAACTGGCG GGAATTTCAG AACCCGTGCT CCTGTTTATC GGAGCGGTTA ATTCCTGGAT TGATATAGGG CTGCTTAAGG AGTTGGCCGA AAAGAGACCT GCTTACAAGT TGGTGATAAT CGGCCCCTGT TACGAAAGCA GCATTGATTT GGCTGTCTGG AACAGCCTGA AGGAAATGAG CGGCGTGCTG TGGCTGGGCA GCAGGCCTTT TGCCGAGCTG CCTCACTATA TTCAACATGC TTCGGTACTC CTGCTGCCCA GAACCAGGGA TGAACACTCG TTGGCCTCCG ACCCGCTGAA ACTTTACGAG TACCTGGCTA CCGGTAAGCC TGTGGTGGCG GTAGGGATTC CGGCGGTACA AAAATTTGCC GCTTTTGTTT ATGCGGCTGC CGGCAGAGAA GATTTTATTA AGCTTACAGA TCGGGCATTG TCCGAGTGGA ATGAGGAAAA ACAGTCATTA CAGCTGGCTG CGGCTGAGGA ATATTCCTGG TCTTCCAGAA TTGGCACAAT TCTAAAATTG CTGGCCGAGA CCGGATAA
|
Protein sequence | MMKQFRAVVL VGMYEWHDRE INDTVRCLAR AFEGFDRYYI DPPKGLRALK GNMSYVLKSP RWQWSQDGEV AVGVPPLGFL PVKLGLRERA NRCAACGLIR RLKRNYGADW REHTLFYVSS GSYTITGFID MLAPKHMVFH LLDDNFAFPI IKNDRRVWEE NKAFMDFMLL HSSLVLAVSQ ELVGKYSEMY NRKIYLLGNG VDVEHFSPEN KAWPEAPELA GISEPVLLFI GAVNSWIDIG LLKELAEKRP AYKLVIIGPC YESSIDLAVW NSLKEMSGVL WLGSRPFAEL PHYIQHASVL LLPRTRDEHS LASDPLKLYE YLATGKPVVA VGIPAVQKFA AFVYAAAGRE DFIKLTDRAL SEWNEEKQSL QLAAAEEYSW SSRIGTILKL LAETG
|
| |