Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1392 |
Symbol | |
ID | 6871181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1364141 |
End bp | 1366108 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642784558 |
Product | alkyl/aryl-sulfatase BDS1 |
Protein accession | YP_002215228 |
Protein GI | 198243730 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2015] Alkyl sulfatase and related hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0000000000000420171 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGATCGTTA AAAGTTTTGC GCTGGCGGGG CTACTCTCTT CCACTGCGCT GACACCTTTA TTTGCACAGG AAGCCCCAAA AGGTGCCACT GCTTCAACCA AGCAAGCTAA CGATGCGCTT TATAACCAAC TTCCTTTCTC TGATAACACC GATTTCACGA ATGCCCATAA AGGCTTTATC GCTGGTTTAC CTGAAGAGGT GATTAAGGGA GAGCAAGGGA ATGTCATCTG GAATCCACAG CAGTACGCTT TCATAAAAGA AGGGGAAAAA TCTCCTGACA CTGTTAACCC TAGTCTGTGG CGTCAGTCCC AGCTAATCAA TATCAGTGGC TTGTTTGAAG TCACAGACGG CGTCTACCAG ATTCGTAACC TTGATTTATC CAACATGACG ATTATCGAAG GTAAAGAGGG GATTACGGTT GTCGATCCGC TGGTTTCTGC GGAAACAGCC AAAGCCGGTA TGGATTTGTA TTTCAAAAAC CGTGGCAATA AGCCTGTTGT CGCCATCATT TATACTCATA GCCATGTTGA CCACTATGGC GGTGTGCGTG GCGTTGTCGA TGAAGCGGAC GTGAAATCCG GCAAGGTGAA AGTGTATGCG CCTGCTGGCT TTATGGAGGC AGCAGTAGCC GAGAATATTA TGGCCGGCAA CGTGATGAGC CGCCGTGCCA GCTATATGTA TGGCAACCTC CTGAAACCAG ATGCCTCCGG CCAGGTTGGC GCCGGACTGG GGACGACCAC CTCTGCGGGG ACGGTGACAC TGATTGCGCC CACTAATATC ATCGATAAAG ACGGCCAGAA AGAAGTGATT GATGGCCTGA CTTACGACTT TATGCTGGCC CCTGGTTCGG AAGCCCCTTC GGAAATGCTG TGGTTCATCG AAGAGAAGAA ACTCATCGAA GCCGCAGAGG ACGTCACTCA CACCCTGCAT AACACTTACT CGCTACGTGG CGCAAAAATT CGTGAGCCGT TGCCGTGGTC GAAATATATC AACGAAGCTA TAGTGCGTTG GGGTGACAAA GCTGAAATTA TTATGGCCCA GCACCACTGG CCGACCTGGG GTAACGAGAA TGTTGTTGGT CTGCTGAAAA GCCAGCGAGA CCTGTATCGT TATATCAATG ACCAGACTCT GCGCATGGCC AATGAAGGTC TGACTCGCGA CGAAATAGCG GCCAACTTTA AACTACCGGA TAGCCTGGCA AAAACCTGGG CCAACCGCGG CTATTACGGC TCCATCAGCC ATGACGTAAA AGCAACGTAT GTGCTGTATC TCGGTTGGTT CGATGGCAAT CCGGCAACCC TTGATGAGCT GCCACCCGAA GAAGCGGCCA AGAAATTTGT TGAATACATG GGCGGTGCCG ATGCGATTCT TCAGAAAGCT AAAGCAGACT TTGACCAGGG GAACTACCGT TGGGTTGCTC AGGTGGTGAG TAAGGTCGTG TTTGCCGATC CAAATAACCA GAATGCACGT AACCTTGAAG CCGATGCGCT GGAGCAATTG GGGTATCAGG CTGAATCTGG TCCATGGCGT AACTTCTACC TGACCGGTGC GCAGGAGCTG CGTAACGGTG TGGTTAAAGG TCCGACGCCA AATACAGCAA GTCCGGATAC CGTTCGGGCG ATGACCCCTG AAATGTTCTT CGACTTTCTG GCTGTACATA TCAACGGTGA AAAAGCGGGT AATGCCCGGG CGGTATTTAA TATTGACCTT GGCAGCGACG GCGGAAAGTA CAAGCTTGAG CTGGAAAATG GCGTGCTGAA CCACACGGCT AATGCTGAAG CGAAAGATGC TGATGCCACG ATTACTCTGA ACCGTGACAC GCTGAATAAA ATTATCCTGA AGGAAGAAAC TCTGAAGCAG GCTCAAGATA AAGGAGAAGT CAACGTTACC GGTAATGCTG CGAAACTGGA TGAGATGCTG GGCTATATGG ACAAGTTTGA GTTCTGGTTC AATATAGTTA CACCATAA
|
Protein sequence | MIVKSFALAG LLSSTALTPL FAQEAPKGAT ASTKQANDAL YNQLPFSDNT DFTNAHKGFI AGLPEEVIKG EQGNVIWNPQ QYAFIKEGEK SPDTVNPSLW RQSQLINISG LFEVTDGVYQ IRNLDLSNMT IIEGKEGITV VDPLVSAETA KAGMDLYFKN RGNKPVVAII YTHSHVDHYG GVRGVVDEAD VKSGKVKVYA PAGFMEAAVA ENIMAGNVMS RRASYMYGNL LKPDASGQVG AGLGTTTSAG TVTLIAPTNI IDKDGQKEVI DGLTYDFMLA PGSEAPSEML WFIEEKKLIE AAEDVTHTLH NTYSLRGAKI REPLPWSKYI NEAIVRWGDK AEIIMAQHHW PTWGNENVVG LLKSQRDLYR YINDQTLRMA NEGLTRDEIA ANFKLPDSLA KTWANRGYYG SISHDVKATY VLYLGWFDGN PATLDELPPE EAAKKFVEYM GGADAILQKA KADFDQGNYR WVAQVVSKVV FADPNNQNAR NLEADALEQL GYQAESGPWR NFYLTGAQEL RNGVVKGPTP NTASPDTVRA MTPEMFFDFL AVHINGEKAG NARAVFNIDL GSDGGKYKLE LENGVLNHTA NAEAKDADAT ITLNRDTLNK IILKEETLKQ AQDKGEVNVT GNAAKLDEML GYMDKFEFWF NIVTP
|
| |