Gene SNSL254_A4519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4519 
SymbolaceB 
ID6483073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4392194 
End bp4393798 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content52% 
IMG OID642739745 
Productmalate synthase 
Protein accessionYP_002043427 
Protein GI194442437 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.492257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATC CACAGGCAAC CACAACTGAT GAATTAACCT TTACCAGGCC GCAAGGCGAG 
CTGGAAAAGC AAGTCCTGAC CGCTGAAGCA GTCGAGTTTT TGACGGAGTT AGTCACCCGT
TTTACGCCAA AACGCAATAA ACTCCTGGCT GCACGTATCC AGCAACAGCA GGATATTGAT
AACGGTAAGT TGCCTGATTT TATTTCGGAA ACCACTTCCA TTAGAGAAAG TAATTGGCAG
ATTCGTGGTA TTCCGGCGGA TTTACAGGAT CGCCGAGTAG AAATTACCGG GCCGGTTGAA
CGTAAAATGG TGATTAATGC CCTGAACGCA AACGTGAAAG TGTTTATGGC GGATTTTGAA
GACTCGCTGG CGCCGGACTG GAATAAAGTT ATTGATGGTC AAATCAACCT GCGTGATGCG
GTGAACGGCA CCATTAGCTA TACCAACGAA GCCGGAAAAA TCTATCAGCT CAAGCCCGAT
CCGGCCGTAT TGATTTGTCG TGTACGCGGT CTACATCTGC CAGAAAAACA TGTTACCTGG
CGGGGGGAAG CTATTCCCGG CAGCCTGTTT GATTTTGCTC TGTACTTTTT CCACAACTAT
AAAGCGCTGC TCGCTAAAGG TAGCGGCCCG TATTTTTACC TGCCGAAAAC GCAAGCCTGG
CAGGAGGCAG CCTGGTGGAG TGAAGTTTTC AGCTACGCCG AAGACCGCTT TAACCTGCCG
CGCGGTACGA TCAAAGCGAC CCTGTTGATT GAAACGCTGC CGGCTGTTTT CCAGATGGAT
GAGATTCTTC ATGCGCTGCG TGATCATATC GTCGGTCTCA ACTGTGGTCG CTGGGATTAT
ATTTTCAGCT ATATCAAAAC GTTGAAAAAT CACCCGGATC GCGTCCTGCC GGACAGGCAG
GTGGTAACGA TGGACAAACC GTTTCTGAGC GCCTACTCGC GCCTGCTGAT CAAAACCTGT
CACAAGCGCG GCGCGTTCGC GATGGGCGGT ATGGCGGCGT TTATCCCGAG CAAAGACGTT
GAACGCAACA ATCAGGTCCT TGCCAAAGTG AAAGCGGATA AAGCGCTGGA AGCGAACAAC
GGCCACGACG GCACGTGGAT TGCGCATCCC GGGTTGGCGG ATACCGCAAT GGCCGTCTTT
AACGAGGTAC TGGGCGAGCA CAAAAATCAG CTGTTCATTA CCCGTGATGA AGATGCGCCG
ATTACCGCTG AACAGTTACT GGAGCCATGT GAAGGCGAAC GCACAGAAGC GGGAATGCGC
GCCAATATTC GCGTGGCAGT GCAGTACATT GAAGCGTGGA TCTCCGGCAA TGGCTGTGTA
CCGATTTACG GTCTGATGGA GGATGCCGCG ACGGCGGAAA TCTCACGAAC CTCTATCTGG
CAGTGGATTC ACCATGAGAA AACACTGAGC AATGGAAAAC CCGTAACGAA AGCGCTTTTC
CGCGAAATGT TGGCGGAAGA GATGCGGGTA ATCCAGGACG AGCTGGGCGA GCACCGCTAC
AGCAGCGGGC GCTTCGACGA TGCCGCACGT CTGATGGAGC AAATCACCAC CTCAGATGAC
TTAATCGACT TCCTCACCCT GCCGGGCTAT CGCTTACTGG CTTAA
 
Protein sequence
MMNPQATTTD ELTFTRPQGE LEKQVLTAEA VEFLTELVTR FTPKRNKLLA ARIQQQQDID 
NGKLPDFISE TTSIRESNWQ IRGIPADLQD RRVEITGPVE RKMVINALNA NVKVFMADFE
DSLAPDWNKV IDGQINLRDA VNGTISYTNE AGKIYQLKPD PAVLICRVRG LHLPEKHVTW
RGEAIPGSLF DFALYFFHNY KALLAKGSGP YFYLPKTQAW QEAAWWSEVF SYAEDRFNLP
RGTIKATLLI ETLPAVFQMD EILHALRDHI VGLNCGRWDY IFSYIKTLKN HPDRVLPDRQ
VVTMDKPFLS AYSRLLIKTC HKRGAFAMGG MAAFIPSKDV ERNNQVLAKV KADKALEANN
GHDGTWIAHP GLADTAMAVF NEVLGEHKNQ LFITRDEDAP ITAEQLLEPC EGERTEAGMR
ANIRVAVQYI EAWISGNGCV PIYGLMEDAA TAEISRTSIW QWIHHEKTLS NGKPVTKALF
REMLAEEMRV IQDELGEHRY SSGRFDDAAR LMEQITTSDD LIDFLTLPGY RLLA