Gene SeHA_C4032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4032 
Symbol 
ID6489352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3914364 
End bp3915398 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content48% 
IMG OID642744133 
Productputative glycosyl transferase 
Protein accessionYP_002047738 
Protein GI194449258 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0315657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA GTAAAACCAA AGTGAGTATC ATTGTCCCGT TATATAATGC GGGAGCGGAT 
TTTAATGCTT GCATGGCGTC GTTAATCGCG CAAACGTGGT CGGCGCTGGA AATTATTATT
GTGAATGATG GATCGACGGA TCATTCCGTT GAGATAGCAA AACATTACGC GGAACATTAC
CCACATGTTC GACTGCTTCA TCAGGCCAAT GCTGGCGCAT CTGTCGCCCG TAATCTTGGC
CTGCAAGCGG CGACCGGCGA TTATGTCGCC TTTGTCGATG CGGATGACCA GGTCTACCCG
AAGATGTATG AAACGCTGAT GACTATGGCG CTTAACGATG ATCTGGACGT TGCGCAGTGT
AATGCGGACT GGTGCGTCCG AAAAACCGGG CACGCCTGGC AATCTATTCC GACCGATCGT
CTGCGTTCCA CCGGGGTATT AAGCGGACCG GATTGGTTGC GTATGGCGTT GGCCTCGCGG
CGCTGGACGC ATGTTGTCTG GATGGGCGTT TATCGACGTG CGTTAATTAC CGATAACAAT
ATTACTTTCG TTCCCGGACT ACATCATCAG GACATATTAT GGTCGACGGA AGTTATGTTT
AATGCCACGC GCGTACGTTA TACCGAACAA TCATTATATA AATATTTCCT GCATGATAAT
TCGGTAAGCC GTTTGCAAAG ACAAGGCAAT AAAAATCTTA ATTATCAGCG GCATTATATT
AAAATTACGC GATTATTAGA AAAGCTCAAT CGTGATTATG CCCGTCGTAT TCCGATTTAC
CCGGAATTTC GCCAGCAAAT TACCTGGGAA GCGTTACGCG TTTGTCATGC GGTACGTAAA
GAGCCTGATA TTTTGACCCG CCAGCGTATG ATTGCCGAAA TTTTTACTTC TGGCATGTAT
AGACGGATGA TGGCTAACGT CCGCAGCGCG AAAGCAGCTT ATCAGACGCT GCTCTGGTCC
TTCCGGCTGT GGCAATGGCG CGACAAAACC TTGTCGCACC GTCGTATGGC CCGTAAGGCG
CTCAATCTGT CTTAG
 
Protein sequence
MKNSKTKVSI IVPLYNAGAD FNACMASLIA QTWSALEIII VNDGSTDHSV EIAKHYAEHY 
PHVRLLHQAN AGASVARNLG LQAATGDYVA FVDADDQVYP KMYETLMTMA LNDDLDVAQC
NADWCVRKTG HAWQSIPTDR LRSTGVLSGP DWLRMALASR RWTHVVWMGV YRRALITDNN
ITFVPGLHHQ DILWSTEVMF NATRVRYTEQ SLYKYFLHDN SVSRLQRQGN KNLNYQRHYI
KITRLLEKLN RDYARRIPIY PEFRQQITWE ALRVCHAVRK EPDILTRQRM IAEIFTSGMY
RRMMANVRSA KAAYQTLLWS FRLWQWRDKT LSHRRMARKA LNLS