Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3994 |
Symbol | |
ID | 8431009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 4179783 |
End bp | 4181621 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 645036211 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_003193309 |
Protein GI | 258517087 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.460388 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGGTA AGGCAAGATT ACTGGTTCTG CTGGGCTGTG ATTTTCTGTT GGTGATTATG TCTTTGCTCG TTTCACTGCT TATCCGTTTC CCCAGTTGGC CTGAATTAAG CAATGCTCTA GTTAACTACA TTAATTTTGC CCCGGTTTGT GCGCTTGTTA TGCTTGTCTT TTTTTATTTT TTTGGCCTTT ATCAAAGAGT ATGGGCCTAT GCCAGCATAG GTGAGTTAGT GACCATAGTA AAAGCGGTTA CAACAGGAAA ACTTGTTGTC ATTGCTTTGA CCTATTTTAT TTTTACCCCG TTACCCAGAA GTGTTGTATT AATGTCCTGG GCTTTTAGCA TTATTCTTAT TGGCGGCTCC AGGTTGTACT GGAGAATATA TATGCAGAAG AAGAAATTTA TCGTGGCCGG CTGCCCGCTG GATAAAAGGA AAACGCTTAT AGTTGGTGCC GGTGATGCCG GGGTGCTGGT GGTTCGCGAA CTAATGAACC ATAACAGTGA GTATTTGCCG GTAGGATTTA TTGATGATGA TGCAAGTAAG CAGGGTATGG TTATCCTGGG CATACCGGTA TTAGGGAAAC GTGAAGAATT GCCGTCGATT ATCGAAAAAT GCAGAATAAA AGAGGTAATT ATTGCTATGC CGTCAGTGTC TGGACAAGTG ATAAAGGAAA CTGTCGAGAA GTGTCACAAT TCACGGGTAA AGTTAAAGAT ATTGCCGGGT GTTTACCAGT TAATCAGCGG GCAAGTGACC GTAAATCATA TTCGCGATAT TCAGGTGGAA GATTTATTGG GCAGAGAGCC TGTTGAGGTA GATTTAAGCG AGATTGCCGG TTATTTAACG GACAGGGTGG TTTTGGTTTC CGGTGCCGGC GGTTCGATTG GTTCAGAGCT ATGCAGGCAG GTGGTCAGGT TGAAGCCTAA GCTTCTGGTT GTTTTAGGGC ATGGTGAGAA CAGTATACAT AATATAGTTT TTGAACTTCG TGAAATGCAT GGCAGCGATC TGCCTATTGA GATAGTGATT GCTGATATAA GGGACAGGCA GAAGATTAAT TTGATTTTTA AGAAATATAG GCCGTCAGTG GTTTTTCATG CTGCTGCGCA TAAGCATGTA CCTCTGATGG AACTGCACCC TGATGAGTCA GTGAAGACGA ATGTTTTGGG TACAAAGAAT TTAGCAGAAG CAGCCGACAG GGTTGGAACA GATGTTTTTA TTATGCTTTC AACAGATAAA GCAGTTAATC CTTCCAGTGT TATGGGGGCT ACCAAGCGTT TGGCTGAGCT GATATTGCAG CAGATGAACA GTATAAGTGA TACTGTTTAT GCGGCTGTTC GGTTTGGCAA TGTCTTGGGG AGCAATGGCA GTGTAGTGCC TATCTTTAAG CGGCAGATTG CTCAGGGAGG GCCGGTTACT GTTACTCACC CGGAAATGAA GAGGTACTTT ATGACTATAC CCGAGGCTGT GCAGTTGGTG ATTCAGGCAG GGGCTATGGC TCAGGGTGGG GAGATATTTG TGCTGGACAT GGGTGAGCCG GTGAAGATTG TGGATTTAGC TAAATGTATT ATTGATTTGT CGGGCGTGGA TTGTGAAATT AAATTTACCG GGATTAGGCC GGGGGAGAAG CTGTTTGAGG AATTGCTGAC GGCAGAAGAG GGTTCTTCTG CTACCCGGCA TAGGAGGATA TTTGTGGCTA ATGCGGGGAG TGTGGATTTG GAGACATTGG AGTTGGAAGT TTTTCGTTTG AGAGAGTTGG GAGAGGATGT TGTGACCGGG GATGTTTTTA AAGCATTGAC GGTTCTCCTG CCAAATATAA AGATATATCG AAAAGATATG GTTGGCTAG
|
Protein sequence | MRGKARLLVL LGCDFLLVIM SLLVSLLIRF PSWPELSNAL VNYINFAPVC ALVMLVFFYF FGLYQRVWAY ASIGELVTIV KAVTTGKLVV IALTYFIFTP LPRSVVLMSW AFSIILIGGS RLYWRIYMQK KKFIVAGCPL DKRKTLIVGA GDAGVLVVRE LMNHNSEYLP VGFIDDDASK QGMVILGIPV LGKREELPSI IEKCRIKEVI IAMPSVSGQV IKETVEKCHN SRVKLKILPG VYQLISGQVT VNHIRDIQVE DLLGREPVEV DLSEIAGYLT DRVVLVSGAG GSIGSELCRQ VVRLKPKLLV VLGHGENSIH NIVFELREMH GSDLPIEIVI ADIRDRQKIN LIFKKYRPSV VFHAAAHKHV PLMELHPDES VKTNVLGTKN LAEAADRVGT DVFIMLSTDK AVNPSSVMGA TKRLAELILQ QMNSISDTVY AAVRFGNVLG SNGSVVPIFK RQIAQGGPVT VTHPEMKRYF MTIPEAVQLV IQAGAMAQGG EIFVLDMGEP VKIVDLAKCI IDLSGVDCEI KFTGIRPGEK LFEELLTAEE GSSATRHRRI FVANAGSVDL ETLELEVFRL RELGEDVVTG DVFKALTVLL PNIKIYRKDM VG
|
| |