Gene SeD_A1392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1392 
Symbol 
ID6871181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1364141 
End bp1366108 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content50% 
IMG OID642784558 
Productalkyl/aryl-sulfatase BDS1 
Protein accessionYP_002215228 
Protein GI198243730 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2015] Alkyl sulfatase and related hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0000000000000420171 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGATCGTTA AAAGTTTTGC GCTGGCGGGG CTACTCTCTT CCACTGCGCT GACACCTTTA 
TTTGCACAGG AAGCCCCAAA AGGTGCCACT GCTTCAACCA AGCAAGCTAA CGATGCGCTT
TATAACCAAC TTCCTTTCTC TGATAACACC GATTTCACGA ATGCCCATAA AGGCTTTATC
GCTGGTTTAC CTGAAGAGGT GATTAAGGGA GAGCAAGGGA ATGTCATCTG GAATCCACAG
CAGTACGCTT TCATAAAAGA AGGGGAAAAA TCTCCTGACA CTGTTAACCC TAGTCTGTGG
CGTCAGTCCC AGCTAATCAA TATCAGTGGC TTGTTTGAAG TCACAGACGG CGTCTACCAG
ATTCGTAACC TTGATTTATC CAACATGACG ATTATCGAAG GTAAAGAGGG GATTACGGTT
GTCGATCCGC TGGTTTCTGC GGAAACAGCC AAAGCCGGTA TGGATTTGTA TTTCAAAAAC
CGTGGCAATA AGCCTGTTGT CGCCATCATT TATACTCATA GCCATGTTGA CCACTATGGC
GGTGTGCGTG GCGTTGTCGA TGAAGCGGAC GTGAAATCCG GCAAGGTGAA AGTGTATGCG
CCTGCTGGCT TTATGGAGGC AGCAGTAGCC GAGAATATTA TGGCCGGCAA CGTGATGAGC
CGCCGTGCCA GCTATATGTA TGGCAACCTC CTGAAACCAG ATGCCTCCGG CCAGGTTGGC
GCCGGACTGG GGACGACCAC CTCTGCGGGG ACGGTGACAC TGATTGCGCC CACTAATATC
ATCGATAAAG ACGGCCAGAA AGAAGTGATT GATGGCCTGA CTTACGACTT TATGCTGGCC
CCTGGTTCGG AAGCCCCTTC GGAAATGCTG TGGTTCATCG AAGAGAAGAA ACTCATCGAA
GCCGCAGAGG ACGTCACTCA CACCCTGCAT AACACTTACT CGCTACGTGG CGCAAAAATT
CGTGAGCCGT TGCCGTGGTC GAAATATATC AACGAAGCTA TAGTGCGTTG GGGTGACAAA
GCTGAAATTA TTATGGCCCA GCACCACTGG CCGACCTGGG GTAACGAGAA TGTTGTTGGT
CTGCTGAAAA GCCAGCGAGA CCTGTATCGT TATATCAATG ACCAGACTCT GCGCATGGCC
AATGAAGGTC TGACTCGCGA CGAAATAGCG GCCAACTTTA AACTACCGGA TAGCCTGGCA
AAAACCTGGG CCAACCGCGG CTATTACGGC TCCATCAGCC ATGACGTAAA AGCAACGTAT
GTGCTGTATC TCGGTTGGTT CGATGGCAAT CCGGCAACCC TTGATGAGCT GCCACCCGAA
GAAGCGGCCA AGAAATTTGT TGAATACATG GGCGGTGCCG ATGCGATTCT TCAGAAAGCT
AAAGCAGACT TTGACCAGGG GAACTACCGT TGGGTTGCTC AGGTGGTGAG TAAGGTCGTG
TTTGCCGATC CAAATAACCA GAATGCACGT AACCTTGAAG CCGATGCGCT GGAGCAATTG
GGGTATCAGG CTGAATCTGG TCCATGGCGT AACTTCTACC TGACCGGTGC GCAGGAGCTG
CGTAACGGTG TGGTTAAAGG TCCGACGCCA AATACAGCAA GTCCGGATAC CGTTCGGGCG
ATGACCCCTG AAATGTTCTT CGACTTTCTG GCTGTACATA TCAACGGTGA AAAAGCGGGT
AATGCCCGGG CGGTATTTAA TATTGACCTT GGCAGCGACG GCGGAAAGTA CAAGCTTGAG
CTGGAAAATG GCGTGCTGAA CCACACGGCT AATGCTGAAG CGAAAGATGC TGATGCCACG
ATTACTCTGA ACCGTGACAC GCTGAATAAA ATTATCCTGA AGGAAGAAAC TCTGAAGCAG
GCTCAAGATA AAGGAGAAGT CAACGTTACC GGTAATGCTG CGAAACTGGA TGAGATGCTG
GGCTATATGG ACAAGTTTGA GTTCTGGTTC AATATAGTTA CACCATAA
 
Protein sequence
MIVKSFALAG LLSSTALTPL FAQEAPKGAT ASTKQANDAL YNQLPFSDNT DFTNAHKGFI 
AGLPEEVIKG EQGNVIWNPQ QYAFIKEGEK SPDTVNPSLW RQSQLINISG LFEVTDGVYQ
IRNLDLSNMT IIEGKEGITV VDPLVSAETA KAGMDLYFKN RGNKPVVAII YTHSHVDHYG
GVRGVVDEAD VKSGKVKVYA PAGFMEAAVA ENIMAGNVMS RRASYMYGNL LKPDASGQVG
AGLGTTTSAG TVTLIAPTNI IDKDGQKEVI DGLTYDFMLA PGSEAPSEML WFIEEKKLIE
AAEDVTHTLH NTYSLRGAKI REPLPWSKYI NEAIVRWGDK AEIIMAQHHW PTWGNENVVG
LLKSQRDLYR YINDQTLRMA NEGLTRDEIA ANFKLPDSLA KTWANRGYYG SISHDVKATY
VLYLGWFDGN PATLDELPPE EAAKKFVEYM GGADAILQKA KADFDQGNYR WVAQVVSKVV
FADPNNQNAR NLEADALEQL GYQAESGPWR NFYLTGAQEL RNGVVKGPTP NTASPDTVRA
MTPEMFFDFL AVHINGEKAG NARAVFNIDL GSDGGKYKLE LENGVLNHTA NAEAKDADAT
ITLNRDTLNK IILKEETLKQ AQDKGEVNVT GNAAKLDEML GYMDKFEFWF NIVTP