Gene SeHA_C4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4050 
SymbolwaaA 
ID6488726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3932348 
End bp3933625 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content55% 
IMG OID642744151 
Product3-deoxy-D-manno-octulosonic-acid transferase 
Protein accessionYP_002047756 
Protein GI194449972 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1519] 3-deoxy-D-manno-octulosonic-acid transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.863642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAAT TGCTTTACAC CGCTCTTCTC TACCTTATTC AGCCTCTGAT CTGGATACGG 
CTTTGGGTGC GTGGACGTAA AGCGCCGGCC TATCGTAAGC GCTGGGGTGA ACGCTACGGA
TTCTACCGCC GTCCGTTGAA ACCGGGCGGA ATCATGCTGC ATTCCGTCTC GGTGGGCGAA
ACTCTGGCGG CCATCCCATT GGTCCGCGCT CTACGTCATC GCTATCCCGA TCTGCCTATT
ACCGTAACGA CCATGACGCC GACCGGCTCG GAGCGCGTCC AGTCCGCCTT TGGCAACGAT
GTTCAGCACG TTTACTTGCC TTATGATTTG CCCGATGCGC TCAATCGTTT CCTCAATAAG
ATTGATCCTA AGCTGGTATT GATCATGGAG ACTGAGCTCT GGCCAAATCT GATTGCTGCG
CTGCACAAAC GTCATATTCC GCTGGTTATC GCTAATGCGC GGCTTTCCGC CCGCTCCGCC
GCGGGTTATG CGAAGCTTGG CAAGTTTGTC CGTACGCTCT TGCGCCGTAT CACCCTGATT
GCCGCGCAAA ACGAAGAAGA TGGCGAACGC TTTGTGGCAT TGGGCGCGAA GAACAATCAG
GTCACGGTCA CCGGCAGTCT GAAATTTGAT ATTTCAGTTA CGCCGCAGCT GGCGGCTAAA
GCCGTTACGC TACGCCGCCA GTGGGCGCCG CACCGTCCGG TCTGGATTGC CACCAGCACC
CACGATGGCG AAGAGAGTAT CGTTATCGCC GCTCACCAGG CGTTATTACA TCAATTCCCG
AATTTATTAC TGATTCTGGT GCCCCGCCAT CCGGAGCGTT TCCCGGATGC TATCAATCTT
GTGCGTCAGG CAGGGTTAAG CTACATCACT CGTTCTTCGG GCGAAGTACC GTCCGCCAGC
ACCCAGGTCG TGGTAGGCGA TACCATGGGC GAATTAATGT TGCTCTATGG CATTGCCGAT
CTCGCCTTTG TTGGTGGTTC GCTGGTTGAA CGCGGCGGTC ATAACCCGCT GGAGGCCGCC
GCTCATGCGA TTCCGGTACT GATGGGTCCG CATACCTTTA ACTTTAAAGA TATTTGCGCC
CGTCTGGATC AGGCGAGCGG ACTTATCACG ATTACCGATG CGGCTACGCT GGCAAAAGAA
GTTTCCTCTT TACTGACCGA CGCTGATTAT CGTAATTTCT ACGGACGTCA CGCAGTTGAA
GTGCTGTATC AAAATCAGGG CGCGCTCCAG CGTCTGCTGC AACTGCTGGA ACCTTATCTG
CCACCGAAAA CGCATTGA
 
Protein sequence
MLELLYTALL YLIQPLIWIR LWVRGRKAPA YRKRWGERYG FYRRPLKPGG IMLHSVSVGE 
TLAAIPLVRA LRHRYPDLPI TVTTMTPTGS ERVQSAFGND VQHVYLPYDL PDALNRFLNK
IDPKLVLIME TELWPNLIAA LHKRHIPLVI ANARLSARSA AGYAKLGKFV RTLLRRITLI
AAQNEEDGER FVALGAKNNQ VTVTGSLKFD ISVTPQLAAK AVTLRRQWAP HRPVWIATST
HDGEESIVIA AHQALLHQFP NLLLILVPRH PERFPDAINL VRQAGLSYIT RSSGEVPSAS
TQVVVGDTMG ELMLLYGIAD LAFVGGSLVE RGGHNPLEAA AHAIPVLMGP HTFNFKDICA
RLDQASGLIT ITDAATLAKE VSSLLTDADY RNFYGRHAVE VLYQNQGALQ RLLQLLEPYL
PPKTH