Gene SNSL254_A3987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3987 
Symbol 
ID6485801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3873791 
End bp3874825 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content48% 
IMG OID642739247 
Productputative glycosyl transferase 
Protein accessionYP_002042957 
Protein GI194445248 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.993989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA GTAAAACCAA AGTGAGTATC ATTGTCCCGT TATATAATGC GGGAGCGGAT 
TTTAATGCTT GCATGGCGTC GTTAATCGCG CAAACGTGGT CGGCGCTGGA AATTATTATT
GTGAATGATG GATCGACAGA TCATTCCGTT GAGATAGCAA AACATTACGC GGAACATTAC
CCACATGTTC GACTGCTTCA TCAGGCCAAT GCTGGCGCAT CTGTCGCCCG TAATCTTGGC
CTGCAAGCGG CGACCGGCGA TTATGTCGCC TTTGTCGATG CGGATGACCA GGTCTACCCG
AAGATGTATG AAACGCTGAT GACTATGGCG CTTAACGATG ATCTGGACGT TGCGCAGTGT
AATGCGGACT GGTGCGTCCG AAAAACCGGG CACGCCTGGC AATCTATTCC GACCGATCGT
CTGCGTTCCA CCGGGGTATT AAGCGGACCG GATTGGTTGC GTATGGCGTT GGCCTCGCGA
CGCTGGACGC ATGTTGTCTG GATGGGCGTT TATCGACGTG CGTTAATTAC CGATAACAAT
ATTACTTTCG TTCCCGGACT ACATCATCAG GACATATTAT GGTCGACGGA AGTTATGTTT
AATGCCACGC GCGTACGTTA TACCGAACAA TCATTATATA AATATTTCCT GCATGATAAT
TCGGTAAGCC GTTTGCAAAG ACAAGGCAAT AAAAATCTTA ATTATCAGCG GCATTATATT
AAAATTACGC GATTATTAGA AAAGCTCAAT CGTGATTATG CCCGTCGTAT TCCGATTTAC
CCGGAATTTC GCCAGCAAAT TACCTGGGAA GCGTTACGCG TTTGTCATGC GGTACGTAAA
GAGCCTGATA TTTTGACCCG CCAGCGTATG ATTGCCGAAA TTTTTACTTC TGGCATGTAT
AGACGGATGA TGGCTAACGT CCGCAGCGCG AAAGCGGCTT ATCAGACGCT GCTCTGGTCT
TTCCGGCTGT GGCAATGGCG CGACAAAACC TTGTCGCACC GTCGTATGGC CCGTAAGGCG
CTCAATCTGT CTTAG
 
Protein sequence
MKNSKTKVSI IVPLYNAGAD FNACMASLIA QTWSALEIII VNDGSTDHSV EIAKHYAEHY 
PHVRLLHQAN AGASVARNLG LQAATGDYVA FVDADDQVYP KMYETLMTMA LNDDLDVAQC
NADWCVRKTG HAWQSIPTDR LRSTGVLSGP DWLRMALASR RWTHVVWMGV YRRALITDNN
ITFVPGLHHQ DILWSTEVMF NATRVRYTEQ SLYKYFLHDN SVSRLQRQGN KNLNYQRHYI
KITRLLEKLN RDYARRIPIY PEFRQQITWE ALRVCHAVRK EPDILTRQRM IAEIFTSGMY
RRMMANVRSA KAAYQTLLWS FRLWQWRDKT LSHRRMARKA LNLS