Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_4135 |
Symbol | |
ID | 8431149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 4303993 |
End bp | 4307712 |
Gene Length | 3720 bp |
Protein Length | 1239 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 645036328 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003193426 |
Protein GI | 258517204 |
COG category | [S] Function unknown |
COG ID | [COG3551] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000243324 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTCAAG AAGGTAAAGA CACGAAAAAT AAAAAATTAA TCGTAGTTTT AGGAATGCAC CGGAGTGGTA CCAGCGCTGT AACACGTGGT CTTCAGGTTC TCGGAGTGGA ACTCGGTAAC AGATTAATGC CGCCTCTGGA GGGCAATAAT GATAAAGGTT TTTTCGAAGA CATGGATTTA TACGCACTTA ATGTAGAAAT GCTACAATCA CTTGGAATAG ATTGGCATTA CCTTGCCCCT GTCGAAGCAA GTGATGTTGA CGTGCTTCGT AAAAAAGGAT ATTTTCTTCG TGCAACTGAA TTGCTTCGTC AAAAAGTTTG TGATGTCCCT GTCTTTGGCT TTAAAGATCC GCGTGTTGCG AAACTACTAC CATTTTGGAA AGAGGTATTT ATACACTGCC AATTTGAACT GGGCTATATC CTGGCTGTAC GACACCCTTT AAGCGTATTC AAGTCTCTGG AGAAACGTGA TGGTTTTACT TCTGAAAAAA GTTATATGCT ATGGTTGGAG CATATTCTTA CTAGTCTTTC AGGTATTGAG GGATATAGAT GTGTACCTGT TGATTACGAC AGACTGATGC ATGTACCTGA ACGGGAATTG CAACGAATTG CCATTGGTCT TGAATTAAAA ATTGATGCAG ATGAGTTGCT GCGATACAAA ACTGAATTTT TAGACGAAGG GTTACGACAT ACGGTTTACG ATTTGAAGGA TCTGTTGCTG GATGATACGA TTCCCCCTAT CGCATATGAA ATCTATACTG CTCTAAACGA TGTTTGTGCG GGTAAGAGAC GGATAGACGA TATAGAGCTA AAGAACCAGA TTTCGCTTTG GAGTAGCGAA TTCTACCGTA TTAAATCATT TTTGCGTCTG GCCGATAAGT TTTCCGAACA GATAGTAACT CTTAACCAGA CAGTAAATGA GCGGGATTTT CAGATCAACG AATTGAGCCA AGCCATAGCT GATCGGGATA TCCTGTTAGC TAACCTCAAT CAGGTTGTAA GCGAGCGAGA CGGACAGATA TTAAAACTCA ATCAAGATAT AGCTGGAAGA GACGAGCAAA TAACAGTTTA TAACCAGGTT ATATCTGAGC AGGATAATCA GATATACAGT CTCAATCAGC TTATAACTGA GAGGAATGGT CAGATAGGCA CCCTCAATCA AGTTCTAGCT GAGCGGGACG TTAAGATAAG TACCCTCAAT CAAGTTGTAG CTAAGCAGGA TGTTCAGATA ATCAGCCTAA CCCAGTCTGT GGGCGAGCGT GATGAGCAGA TTGTTGGTCT TAACCAAGCT ATGGCCGAGT GCGAAGAACA GTTTGCCGAG GTTTTTTCTT CACGTTCATG GAGGGTTACC AAGCCTATGC GGTTTTTGGG CAGGGTGATG CGGGGTGAAA GAGAAAATGT TCTGGCAGGT TTGCGATCTT ATATTCATTT TGATGGGCGT ACAAACCAAA AGAATAACCA AGCATCTAGG AGTCGTATAG CACGGAAATT GGTGTCTCAT TATTATTCTA GTGTTCTATC TCGACGATGT TTTGCCGCAA TAAAGGCTGT ATCAGAAAAA CTTGTTCAAC CATGGTTTCC GGTACTTCAT AATAATTTAT TCGTGGATTT TAAACAATAT TTGCAAAAGC AGTGGTTAGG TGGTAGCGGT GATGTTTCCA ATGATGCTTC GACAGAGTTA TACTCTTTTC CGAAGATAGT TAAGAATGAA AAATACTACC CGAAAGTGTC GGTTATAGTT CCTAATTATA ACCATGCTCG ATACTTGAGG CAGCGGCTTG AGTCTATTTA TCGTCAAACA TACCAGAACT ATGAAGTAAT TCTGCTTGAT GATGCTTCAA GTGATGAAAG TATTCAGATA CTAAAGGAAT TTCAATTCAA TTACCCAAGC AAAACAAGAT GCTGCTTTAA CAGCGATAAC TCTGGCGGTG TTTTCTATCA ATGGAAACGA GGCTTTAATT TAGCAACAGG GGAATTAATC TGGATTGCAG AGAGTGACGA TTACTGTTCG GAAAATTTGT TAGAAGAGTT AGTTAGGTAC TTTAAGAATG AAGCTGTAAT GTTAGCGTAT TGCAGAACAG TTTTTGTTGA TGGAGATTCA TCCCAGCAGA TCTGGAGTTT AGAGGAGTAC CTATCTGAAT TAAATTCTGC ATTATGGCTT AAGCCATTTA TTAAATCCGC TCATCAATTG GTAAATACTT CATGGGGGAT AAAAAACATA GTGCCCAACG TAAGTAGTGC GATATTTCGC AATCCTGGAA AATTAGAGCT ATTAGAAGAC GATGGATGGA AGCAAATGAG AATATGTGGC GATTGGGTCT TCTATTTACA TGTTATTCGT GGAGGATTGG TAGCCTATTC ACCGAATACA ACTAATTATT ATCGAATTCA CAACAAAAAT ACCTCAGTTG GCACATACAA GCAGGATATT TATTATCGCG AACATGAAAT GGTTGCCAAA GAGTTAATTA AGCTCTATCG CTTAGAAGAG AGGGTTCTGG AAAAACAGCA GCGTGTTCTT GAAGCCCATT GGTGTTCCTG CCGCGAGAAT TTTGATGAGG GTTCTTTCAG GAAATGCTAT GATCGGAAAC GAATTAGTCT ACTAGCCAAA AACCGTAAAG CAAATTTACT TATGATTACA TATGCACTGG CGACAGGTGG TGGTGAAACT TTCCCAATTA GACTGGCCAA TTTATTAAAG TCGGTAGGAT ACTCTGTTAC TTTATTGAAT TGCCATCAAG AACCTACGGA GATCGGTGTC AGGAATATGT TGCGTGGAGA TATACCATTG TTAGAGTTGG ACTATCTGCA TAAACTAAAT GCGGTTGTTG GTGATATGGG CATTGAAATT GTGCATTCTC ACCATGGTTG GGTTGACGGG ACCATATGTG CGTTATTAGA AGATAATCCG AATTGTAAGA TTATTGTTAC TTCACATGGA ATGTATGAGA TGATGCCGCC AGTTGATTTA GATAGGATTA TTCCCTTGCT TGATAAAAAA GTTGACAAGA TTGTTTATAT CGCAGATAAG AATCTTGAAC CATTTGAATC CAGCGTGATA GACAAGAATC GTTTTGTTAA AATAGGCAAT GCACTTGAAA AGTTGGAATA TGAACCTGTT TCGAGAGAAG AGCTAGGAAT AAGTAAAGAT GCTTTTGTTC TCTGTTTGGT TAGTCGAGCT ATCCCGGATA AGGGCTGGCA AGAAGGCATA GAGGCAATTA AATTAGCACG CCAACTTAGT GGAAAAGAAA TTCACTTGCT GTTAGTTGGG GATGGCCCTG AATATGAAAG ATTGAAAGGA AATGTTAGAT ATAGTTATGT CCACCTCTTG GGTTTTAGAT CTAATGTTCG TGATTTTTTT GCAGCTTCTG ATTTGGGCTT TCTTCCATCT AAGTTTCGTG GTGAGAGCTT TCCGCTAGTA ATTATTGAAT GCCTTCAGTC TAATCGTCCA ATGCTGGCCA GTGATCTTGG AGAGGTTTCT AAAATGCTTG AAAGTGAATC AGGATTGGCA GGAAGCGTGT TCCCATTAAA CAATTGGAGA ATTCCTGTTG GATGTGTTGC GGAGATTATA TCGGAGTATG CAAAGAACAG AGATTTATAT TTAGAACATC TCAAGAGAGT TCCTGATACA GCAAAAAAAT TTGATCCGAA AGCATTGCTC CATAGTTATG AAGAGGTTTA TCGAGAAGTT CTTTCTAATC AGATGTATAA AGTAAACTAA
|
Protein sequence | MSQEGKDTKN KKLIVVLGMH RSGTSAVTRG LQVLGVELGN RLMPPLEGNN DKGFFEDMDL YALNVEMLQS LGIDWHYLAP VEASDVDVLR KKGYFLRATE LLRQKVCDVP VFGFKDPRVA KLLPFWKEVF IHCQFELGYI LAVRHPLSVF KSLEKRDGFT SEKSYMLWLE HILTSLSGIE GYRCVPVDYD RLMHVPEREL QRIAIGLELK IDADELLRYK TEFLDEGLRH TVYDLKDLLL DDTIPPIAYE IYTALNDVCA GKRRIDDIEL KNQISLWSSE FYRIKSFLRL ADKFSEQIVT LNQTVNERDF QINELSQAIA DRDILLANLN QVVSERDGQI LKLNQDIAGR DEQITVYNQV ISEQDNQIYS LNQLITERNG QIGTLNQVLA ERDVKISTLN QVVAKQDVQI ISLTQSVGER DEQIVGLNQA MAECEEQFAE VFSSRSWRVT KPMRFLGRVM RGERENVLAG LRSYIHFDGR TNQKNNQASR SRIARKLVSH YYSSVLSRRC FAAIKAVSEK LVQPWFPVLH NNLFVDFKQY LQKQWLGGSG DVSNDASTEL YSFPKIVKNE KYYPKVSVIV PNYNHARYLR QRLESIYRQT YQNYEVILLD DASSDESIQI LKEFQFNYPS KTRCCFNSDN SGGVFYQWKR GFNLATGELI WIAESDDYCS ENLLEELVRY FKNEAVMLAY CRTVFVDGDS SQQIWSLEEY LSELNSALWL KPFIKSAHQL VNTSWGIKNI VPNVSSAIFR NPGKLELLED DGWKQMRICG DWVFYLHVIR GGLVAYSPNT TNYYRIHNKN TSVGTYKQDI YYREHEMVAK ELIKLYRLEE RVLEKQQRVL EAHWCSCREN FDEGSFRKCY DRKRISLLAK NRKANLLMIT YALATGGGET FPIRLANLLK SVGYSVTLLN CHQEPTEIGV RNMLRGDIPL LELDYLHKLN AVVGDMGIEI VHSHHGWVDG TICALLEDNP NCKIIVTSHG MYEMMPPVDL DRIIPLLDKK VDKIVYIADK NLEPFESSVI DKNRFVKIGN ALEKLEYEPV SREELGISKD AFVLCLVSRA IPDKGWQEGI EAIKLARQLS GKEIHLLLVG DGPEYERLKG NVRYSYVHLL GFRSNVRDFF AASDLGFLPS KFRGESFPLV IIECLQSNRP MLASDLGEVS KMLESESGLA GSVFPLNNWR IPVGCVAEII SEYAKNRDLY LEHLKRVPDT AKKFDPKALL HSYEEVYREV LSNQMYKVN
|
| |