Gene SeD_A0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0041 
Symbol 
ID6874230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp44310 
End bp46025 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content50% 
IMG OID642783299 
Productarylsulfotransferase 
Protein accessionYP_002213993 
Protein GI198244177 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.0089989 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATACGT TAACTACAAC GTCTGTTGTC CTTCCTGCGC CGCGTCCGGC GATTAATCAG 
GGTATCGATA TCAATAATGA AATGGTGCTT AACCATACCG CTATTTATGA AAATTGCCTT
GCGCAGGTCA CGCAAGAGAA TACGGTAGAA AATGCGCTCA TGTTGTTAGA CCCTTACGGC
ACGGCGCCTT TAAGCGCTTA TGCCGGGGTC TGGAGTCTGG AACCGGCTGA GATCATAGTC
ACGGTCCAGG ATGCGGCAAA AACGGCGATG CCGGTAGAAC ATCTTTACAC CCTTACGGCA
GGCGCAAATC TGTTGCCGGT TCTGGGGCTG GTAGCGGATA CTGAAAACCG TATTGTCTTT
TCTCAGGCAG ATACGCCGCT TGCCGTCTAT ACGCTCATCA CACAGCCATT ACCGCCGGTA
GATTCCGCGG AGGTCGTATT AGGTTTTCCG ATTATCAACG TGACGCAACC TGCTACCGAT
GCGGACAAGA TGGCGCCAGG GTTTTATTTT ATTACGCATT TCGATCGCTA TAATTACGCA
TTAGATCAGA ATGGTCTGGT GCGCTGGTAC GTTACTCAGG ATTATCCGTC TTATAATTTT
GTTCGAATTG ATAATGGCCA TTTCCTCACT ACTTCAGAAG CGAAAAATAC CTATCTGGAT
ATGTATGAGT TCGACATGAT GGGGCGTCTT CACACATTCT ATAATCTCGA TAATCAATTT
CACCATTCTA TCTGGCCGTG GGATAGCAAT ACCATTGTTG CGCCCTCTGA ATATACCTCG
GGTCGGCCCG ACGATTTGAA AACCAATGAA GACGGCGTAT CGGTTGTCGA TCTGACTACC
GGACTGGAGA CGGCTTACTA CGATATGGCG AAGGTGCTGG ATACGACGCG GGTTTCCCGT
CCTTCAGGTA CGGCGCCGGG AGAAGACCCG ACGGTTAAAG ACTGGCTGCA TATAAACCAG
AGCTACGTGA ATGAGACGAA TCAGTTGTTA ATTGCGTCCG GGCGTCATCA GAGCGCGGTG
TTTGGCGTCG ATCTGCAAAC GCAAGCGCTA CGCTTTATTT TGTCAACGCA TGAAGACTGG
GACGACGCTT ATCAGCCTTA TCTTTTAACC CCGGTCGACA GTGAAGGTGT GGCGCTTTAT
GACTTTAGCA AACAGGAGGA TATCGACGCG GCCGACCGTG ACTTTTGGAC TTGGGGCCAG
CATAACGTCG TTGAAATCGC CAATAATACG CCGGGTATAG TGGAGTTTAT GGTATTTGAT
AACGGTAACT ACCGTTCGCG TGATGACAGC AAAAGCCTGT TACCGCCGGA TAACTACAGC
CGCATTGTCC ATTTCGTGGT GAATATGAAT GAGATGACCG TTATGCGGCC ATTTGAATAC
GGCAAGGAGC TGGGCGCGCG TGGCTACAGT AGCTGCGTTA GCGCGAAAGC GATCCAGCAG
AATGGCAATA TTGTGGTGCA TTTTGCCGAC TGCACGTTTG ATGAAAATGG CCGCGCCATC
TCTTGCCAGC CTGGCGAGAG CGATATTATC GATCCGCAGG CGGGCAGCGA GGCGATGGGG
CTGCTAATTT TACAGGAGAT TGCGCCTACG GAGAAAACCG TGCTTTTTGA AGCGACCATG
ACGTCAGGTT ACTACAAAAA CGCGGAAACG AACGGGGAAG GCTATCGCTA CGATATTACC
AGTTTCCGGG TGTATAAAAT GGATCTGTAC GCGTAG
 
Protein sequence
MNTLTTTSVV LPAPRPAINQ GIDINNEMVL NHTAIYENCL AQVTQENTVE NALMLLDPYG 
TAPLSAYAGV WSLEPAEIIV TVQDAAKTAM PVEHLYTLTA GANLLPVLGL VADTENRIVF
SQADTPLAVY TLITQPLPPV DSAEVVLGFP IINVTQPATD ADKMAPGFYF ITHFDRYNYA
LDQNGLVRWY VTQDYPSYNF VRIDNGHFLT TSEAKNTYLD MYEFDMMGRL HTFYNLDNQF
HHSIWPWDSN TIVAPSEYTS GRPDDLKTNE DGVSVVDLTT GLETAYYDMA KVLDTTRVSR
PSGTAPGEDP TVKDWLHINQ SYVNETNQLL IASGRHQSAV FGVDLQTQAL RFILSTHEDW
DDAYQPYLLT PVDSEGVALY DFSKQEDIDA ADRDFWTWGQ HNVVEIANNT PGIVEFMVFD
NGNYRSRDDS KSLLPPDNYS RIVHFVVNMN EMTVMRPFEY GKELGARGYS SCVSAKAIQQ
NGNIVVHFAD CTFDENGRAI SCQPGESDII DPQAGSEAMG LLILQEIAPT EKTVLFEATM
TSGYYKNAET NGEGYRYDIT SFRVYKMDLY A