Gene SeD_A4093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4093 
Symbol 
ID6875331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3941253 
End bp3942287 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content49% 
IMG OID642787042 
Productputative glycosyl transferase 
Protein accessionYP_002217669 
Protein GI198244256 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA GTAAAACCAA AGTGAGTATC ATTGTCCCGT TATATAATGC GGGAGCGGAT 
TTTAATGCTT GCATGGCGTC ATTAATCGCG CAAACGTGGT CGGCGCTGGA AATTATTATT
GTGAATGATG GATCGACGGA TCATTCCGTT GAGATAGCAA AATATTACGC GGAACATTAC
CCGCATGTTC GACTGCTTCA TCAGGCCAAT GCTGGCGCAT CTGTCGCCCG TAATCTTGGC
CTGCAAGCAG CGACCGGCGA TTATGTCGCC TTTGTCGATG CAGATGACCT GGTCTACCCG
AAGATGTATG AAACGCTGAT GACCATGGCG CTTAACGATG ATCTGGACGT TGCGCAGTGC
AACGCGGACT GGTGCGTCCG AAAAACCGGG CACGCCTGGC AATCTATTCC GACCGATCGC
CTGCGCTCCA CCGGGGTATT AAGCGGACCG GATTGGTTGC GTATGGCGTT GGCCTCGCGG
CGCTGGATGC ATGTTGTCTG GATGGGCGTT TATCGACGTG CGTTAATTAC CGATAACAAT
ATTACTTTCG TTCCCGGACT ACATCATCAG GACATATTAT GGTCGACGGA AGTTATGTTT
AATGCCACGC GCGTACGTTA TACCGAACAA TCATTATATA AATATTTCCT GCATGATAAT
TCGGTAAGCC GTTTGCAAAG ACAAGGCAGT AAAAATCTTA ATTACCAGCG GCATTATATT
AAAATTACGC GGTTATTAGA AAAGCTCAAT CGTGATTATG CCCGGCGTAT TCCGATTTAC
CCGGAGTTTC GCCAGCAAAT TACCTGGGAA GCGTTACGCG TTTGTCATGC GGTACGTAAA
GAGCCTGATA TTTTGACCCG CCAGCGTATG ATTGCCGAAA TTTTTACTTC TGGCATGTAT
AGACGGATGA TGGCTAACGT CCGCAGCGCG AAAGCGGCTT ATCAGACGCT GCTCTGGTCT
TTCCGGCTGT GGCAATGGCG CGACAAAACC TTGTCACACC GTCGTATGGC CCGTAAGGCG
CTCAATCTGT CTTAG
 
Protein sequence
MKNSKTKVSI IVPLYNAGAD FNACMASLIA QTWSALEIII VNDGSTDHSV EIAKYYAEHY 
PHVRLLHQAN AGASVARNLG LQAATGDYVA FVDADDLVYP KMYETLMTMA LNDDLDVAQC
NADWCVRKTG HAWQSIPTDR LRSTGVLSGP DWLRMALASR RWMHVVWMGV YRRALITDNN
ITFVPGLHHQ DILWSTEVMF NATRVRYTEQ SLYKYFLHDN SVSRLQRQGS KNLNYQRHYI
KITRLLEKLN RDYARRIPIY PEFRQQITWE ALRVCHAVRK EPDILTRQRM IAEIFTSGMY
RRMMANVRSA KAAYQTLLWS FRLWQWRDKT LSHRRMARKA LNLS