Gene SeHA_C4516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4516 
SymbolaceB 
ID6490967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4397420 
End bp4399024 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content52% 
IMG OID642744589 
Productmalate synthase 
Protein accessionYP_002048166 
Protein GI194450851 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.070236 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATC CACAGGCAAC CACAACTGAT GAATTAACCT TTACCAGGCC GCAAGGCGAG 
CTGGAAAAGC AAGTCCTGAC CGCTGAAGCA GTCGAGTTTT TGACGGAGTT AGTCACCCGT
TTTACGCCAA AACGCAATAA ACTCCTGGCT GCACGTATCC AGCAACAGCA GGATATTGAT
AACGGTAAGT TGCCTGATTT TATTTCGGAA ACCACTTCCA TTAGAGAAAG TAATTGGCAG
ATTCGTGGTA TTCCGGCGGA TTTACAGGAT CGCCGAGTAG AAATTACCGG GCCGGTTGAA
CGTAAAATGG TGATTAATGC CCTGAACGCA AATGTGAAAG TGTTTATGGC GGATTTTGAA
GACTCGCTGG CGCCGGACTG GAATAAAGTT ATTGATGGTC AAATCAACCT GCGTGATGCG
GTGAACGGCA CCATTAGCTA TACCAACGAA GCCGGAAAAA TCTATCAGCT CAAGCCCGAT
CCGGCCGTAT TGATTTGTCG TGTACGTGGT CTACATCTGC CAGAAAAACA TGTTACCTGG
CGGGGGGAAG CCATTCCCGG CAGCCTGTTT GATTTTGCTC TGTACTTTTT CCACAACTAT
AAAGCGCTGC TCGCTAAAGG TAGCGGCCCG TATTTTTACC TGCCGAAAAC GCAAGCCTGG
CAGGAGGCAG CCTGGTGGAG CGAAGTCTTC AGCTACGCCG AAGACCGCTT TAACCTGCCG
CGCGGTACGA TCAAAGCGAC CCTGTTGATT GAAACGCTTC CTGCTGTTTT CCAGATGGAT
GAGATTCTTC ATGCGTTGCG TGATCATATT GTCGGTCTCA ACTGTGGTCG CTGGGATTAT
ATTTTCAGCT ATATCAAAAC GTTGAAAAAT CACCCGGATC GCGTCCTGCC GGACAGACAG
GTGGTAACGA TGGACAAACC GTTTCTGAGC GCCTACTCGC GCCTGCTGAT CAAAACCTGT
CACAAGCGCG GCGCGTTCGC GATGGGCGGT ATGGCGGCGT TTATCCCGAG CAAAGACGTT
GAACGCAACA ATCAGGTCCT TGCCAAAGTG AAAGCGGATA AAGCGCTGGA AGCGAACAAC
GGCCACGACG GCACGTGGAT TGCGCATCCC GGGTTGGCGG ATACCGCAAT GGCCGTCTTT
AACGAGGTAC TGGGCGAGCA CAAAAATCAG CTGTTCATTA CCCGTGATGA AGATGCGCCG
ATTACCGCTG AGCAGTTACT GGAGCCATGT GAAGGCGAAC GCACAGAAGC GGGAATGCGC
GCCAATATTC GCGTGGCAGT GCAGTACATT GAAGCGTGGA TCTCCGGCAA TGGCTGTGTA
CCGATTTACG GTCTGATGGA GGATGCCGCG ACGGCGGAAA TCTCACGAAC CTCTATCTGG
CAGTGGATTC ACCATGAGAA AACACTGAGC AATGGAAAAC CCGTAACGAA AGCGCTTTTC
CGCGAAATGT TGGCGGAAGA GATGCGGGTA ATCCAGGACG AACTGGGCGA GCACCGCTAC
AGCAGCGGGC GCTTCGACGA TGCCGCACGT CTGATGGAGC AAATCACCAC CTCAGATGAC
TTAATCGACT TCCTCACCCT GCCGGGCTAT CGCTTACTGG CTTAA
 
Protein sequence
MMNPQATTTD ELTFTRPQGE LEKQVLTAEA VEFLTELVTR FTPKRNKLLA ARIQQQQDID 
NGKLPDFISE TTSIRESNWQ IRGIPADLQD RRVEITGPVE RKMVINALNA NVKVFMADFE
DSLAPDWNKV IDGQINLRDA VNGTISYTNE AGKIYQLKPD PAVLICRVRG LHLPEKHVTW
RGEAIPGSLF DFALYFFHNY KALLAKGSGP YFYLPKTQAW QEAAWWSEVF SYAEDRFNLP
RGTIKATLLI ETLPAVFQMD EILHALRDHI VGLNCGRWDY IFSYIKTLKN HPDRVLPDRQ
VVTMDKPFLS AYSRLLIKTC HKRGAFAMGG MAAFIPSKDV ERNNQVLAKV KADKALEANN
GHDGTWIAHP GLADTAMAVF NEVLGEHKNQ LFITRDEDAP ITAEQLLEPC EGERTEAGMR
ANIRVAVQYI EAWISGNGCV PIYGLMEDAA TAEISRTSIW QWIHHEKTLS NGKPVTKALF
REMLAEEMRV IQDELGEHRY SSGRFDDAAR LMEQITTSDD LIDFLTLPGY RLLA