Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_0992 |
Symbol | |
ID | 8427931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 1016870 |
End bp | 1018150 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 645033330 |
Product | glycoside hydrolase family 18 |
Protein accession | YP_003190504 |
Protein GI | 258514282 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00884991 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00000357477 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGAAAC TGGGAATCAA GTCAGTTACC TGTCTGGTTG TTTTTTTTCT TTTTGCCGTA TGGTTTCCCA CAGTCGCACT GGCGGGAACA AGTGTTACAC CGGGTAATTT AAGACTCGGT GACAGAGGGC CTGATGTTAC TTTGCTGCAG ACAAAACTTA AAGTTGCGGG CTTTTATCAA GGAGAAAAGG TTTCCGGTTA TTTTGGCTTG AATACACTTT TCGCAGTCTC AAAATTTGAG AAAGCTAACC GGCTGCGTGT TGACGGTATA GTTGATGCTG AGGAATGGAT TGCACTGCAA AAACTTACCG CTATTCCGGC AGACAAGCTA AAGAAGATGG TATTAGGTTA TTATACAGTG GATTATACAG GAGATAAGTT ATCCTATAAT TCCCTGGATA AGTACAGCAG TTATATAGAT ACTGTAGCTA CTTTCAGTTT CAAAGTTAAC CGCGATGGCA GTTTAACCGG TGAAGTGCCG CAGGATGCTT TAAAACTGGC CAAGGAGAGA AGCGTGGAAA CATTGCTGTT GGTTCATAAT ATTGGCCAGC CAATTGACAG TGATGCTGCT CACTACGCTC TGTCAGTTGC CGAAAACCGC AGCAGGCTGG AAGCAAACAT CATGTCAAAA GTGAAGGCCA ATGGTTATAA CGGTGTTAAT ATTGACATTG AAGCTTTACC GCCGGGAGAC AGGCAGTATT ATAATATTTT CCTAAAGGAA TTAGGCGACC AATTGCACAA GGAAAATTTG CTGCTTACCG TTTCTATCCC GGCTAAAACT TTTGACTCTA CCAATGATAG CTGGTCCGGT GCTTATAGTT ATAAGGATAT TGGCCAACTG GTTGATCAGG CTATGATTAT GACCTATGAT GAGCACTGGT TTGGCGGTTC TCCCGGTCCG ATTGCCTCGG TGCCCTGGAT TAACAAGGTT ATGGACTATG CAGTCGAGGT GATGCCCAGA GAAAAAATTT TTCTTGGCGT GGCTGCTTAT GGTTATGATT GGTCCAGTCA GGGAACCAGA GCAGTGCGCT GGAATCAAGT CAATGATTTA GTTAAGAATT CCGGTAATGT TATATGGGAC AATACAAACA GTGTACCCTG TGTCATTTAT TATAAAAATG GTGTCAGGCA TGAGTTGTGG TTTGAAAATA ACTACAGTTT GCGTTTTAAA TTGGAAACGG TTAAGAGTTA TAACGTTTCA GGTATAGCCA TCTGGCGTCT TGGTTTTGAG GATGACTCCT TTTGGAAAAT GGTTAATGAT GAATTTAGAC AGGCTGACTA A
|
Protein sequence | MGKLGIKSVT CLVVFFLFAV WFPTVALAGT SVTPGNLRLG DRGPDVTLLQ TKLKVAGFYQ GEKVSGYFGL NTLFAVSKFE KANRLRVDGI VDAEEWIALQ KLTAIPADKL KKMVLGYYTV DYTGDKLSYN SLDKYSSYID TVATFSFKVN RDGSLTGEVP QDALKLAKER SVETLLLVHN IGQPIDSDAA HYALSVAENR SRLEANIMSK VKANGYNGVN IDIEALPPGD RQYYNIFLKE LGDQLHKENL LLTVSIPAKT FDSTNDSWSG AYSYKDIGQL VDQAMIMTYD EHWFGGSPGP IASVPWINKV MDYAVEVMPR EKIFLGVAAY GYDWSSQGTR AVRWNQVNDL VKNSGNVIWD NTNSVPCVIY YKNGVRHELW FENNYSLRFK LETVKSYNVS GIAIWRLGFE DDSFWKMVND EFRQAD
|
| |