Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_2055 |
Symbol | |
ID | 8429037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 2233983 |
End bp | 2235788 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 645034376 |
Product | sulfatase |
Protein accession | YP_003191507 |
Protein GI | 258515285 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACTAATA AATTATTGAA TCAACATATA CATGAAATGC CTCTTTGCCA TAAACCAAAT TTTCTTGTTA TTTTAGTAGA TCAACAGCGT TATGCTGTCA GCTATGAGAA CGAAGAAATA AAGGTATGGA GAAAGACTCG TTTAAAAGCC CAAGAATTTT TAAAAAGCCG GGGATTTGAA TTCAAGAATC ATTATGCCGG AAGTGCCGCC TGTTGTCCCA GTCGTGCCAC CCTTTATACC GGGCAATACC CCTCTCTTCA TGGTGTCAGC CAAACAGACG GAGCTGCTAA GGGAGCATAT GATCCAGACA TGTTCTGGCT TAATCCCAAT ACAGTACCAA CTATGGGAGA TTATTTTAGG ACAGCAGGTT ACCAAACCTA CTGGAAGGGG AAATGGCATG CCTCTGCCGC AGATATTCTA GTTCCCGGCA CCCATAAGCC ATTTCTCTCT TATAATCAGG GAAATGGTGT ACCTATTCCG GATAATGAAA AGCTATATAT TAATGCAAAT GTACTTGCAA GTTTTGGATT TAACGGTTGG ATAGGGCCCG AGCCCCATGG TGTTAACCCG AGGAATACGG GTTCTTCTGC GGCTGCCGGA TTAAGCGGAA GAGATGTGGT GTATAGTCAG GATACAGTAG AACTGATTAG AGTATTAGAA AAAGAATACA ACGAATCAGA CGAGTGCCGG CCAAGGCCAT GGCTTATAAT GTGCTCCTTT GTAAATCCCC ATGATATAGC CTTGTTTGGG GCTATTAGCG GGTCATTGCC GCAATTTAAT TTTAAAGTTA ATCTCTCTGT TCCCTACATT TCACCCGCAC CTACGGCATC AGAGTCCTTA TTAACTAAAC CAAGTGCCCA ATCCAGCTAT AGAAGAATTT ATGCTTATGC CTTTCAGCCG CTGCTGGATA CTTTGTTCTA TCGCCAGCTC TATTATAGTC TGGAAATGGA AGCAGATACC CAAATATGCA GGGTTATTAA TGCCCTCAGG GAAACATCCT TTTATAATAA TACTATTATA ATCTTCACTT CCGATCATGG GGAGCTTCTT GGAGCCCATG GTCTATTTCA AAAATGGTAT CAGGCTTATG AAGAATCAAT CCATGTACCA TTAATAATTC ATAACCCAAC TCTTTTTGAC AAACCTGAAT CTACAGATAT GCTGACCAGT CATGTAGATA TTCTGCCCAC AATGTTAGGA ATAAGCGGGC TGGATACAGG GGCAATTCAT AAAGTTTTAG CTAACAGCCA TACGGAGGTT CATTCTCTTG TTGGCAGAAA TCTTTCGCCA TTGCTGAAGA GTAAGACAGA TTTTATAGAA GCCGGTGAGG CAATTTACTT TATGACCGAT GATAACATTA CTAAAGGTCT TAACCAGATA AGTTTTGCGG GTGTACCTTA TCATTCTGTA GCTCAGCCTA ATTCTATAGA GACAGTAATT GCGGCTCTTC CTACCGGCAG GGGCGGAACA AAGCAAATAT GGAAGTACTC ACGTTACTTT GATAACCCTC ATTTTTGGAA CATTTCCGGC AGAAGAGATC AGTTTGTTTA TAATGGACCA GTGAGAAGAA AATTCAATCC ATGCAATTAT AATGATACTC CCATTAGACC TCAAGCTGAC CAATATGAAA TATACAATAT TACAACCGAT CCTTTGGAAA TACGAAATGT TTCTTATGAG TCATATAACA ACCGATATTT TATGCAAATC AGGGAAATAC TCAATGAACT TTTAGAAGAA CAACGTAAAA AGAAAAGGTT GTACCCTGTT AGCGGAAATG TGCATGGTAA AACCCCCAAT TACTAA
|
Protein sequence | MTNKLLNQHI HEMPLCHKPN FLVILVDQQR YAVSYENEEI KVWRKTRLKA QEFLKSRGFE FKNHYAGSAA CCPSRATLYT GQYPSLHGVS QTDGAAKGAY DPDMFWLNPN TVPTMGDYFR TAGYQTYWKG KWHASAADIL VPGTHKPFLS YNQGNGVPIP DNEKLYINAN VLASFGFNGW IGPEPHGVNP RNTGSSAAAG LSGRDVVYSQ DTVELIRVLE KEYNESDECR PRPWLIMCSF VNPHDIALFG AISGSLPQFN FKVNLSVPYI SPAPTASESL LTKPSAQSSY RRIYAYAFQP LLDTLFYRQL YYSLEMEADT QICRVINALR ETSFYNNTII IFTSDHGELL GAHGLFQKWY QAYEESIHVP LIIHNPTLFD KPESTDMLTS HVDILPTMLG ISGLDTGAIH KVLANSHTEV HSLVGRNLSP LLKSKTDFIE AGEAIYFMTD DNITKGLNQI SFAGVPYHSV AQPNSIETVI AALPTGRGGT KQIWKYSRYF DNPHFWNISG RRDQFVYNGP VRRKFNPCNY NDTPIRPQAD QYEIYNITTD PLEIRNVSYE SYNNRYFMQI REILNELLEE QRKKKRLYPV SGNVHGKTPN Y
|
| |