Gene SNSL254_A2491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2491 
SymbolmenC 
ID6484959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2414798 
End bp2415760 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content60% 
IMG OID642737826 
ProductO-succinylbenzoate synthase 
Protein accessionYP_002041567 
Protein GI194444463 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1441] O-succinylbenzoate synthase 
TIGRFAM ID[TIGR01927] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.31031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGCG CGCAGGTATA CCGCTGGCAG ATCCCCATGG ACGCGGGGGT GGTTCTGCGC 
GACAGGCGGT TAAAAACTCG CGACGGGCTG TATGTTTGTC TGCGTGACGG CGAGCGTGAG
GGATGGGGAG AGATCTCTCC GCTGCCGGGC TTCAGTCAGG AAACGTGGGA AGAGGCGCAG
ACGGCGCTCC TGACGTGGGT GAACGACTGG CTTCAGGGGA ACGAGGGATT ACCGGAGATG
CCTTCGGTCG CGTTTGGCGC AAGCTGCGCG CTGGCGGAAC TGACAGGCAT ATTGCCGGAG
GCGGCGGACT ATCGCGCTGC GCCGTTATGC ACTGGCGATC CTGACGATTT GGTGCTGCGG
CTTGCCGATA TGCCCGGCGA GAAAATCGCT AAGGTCAAAG TGGGTCTTTA TGAAGCGGTA
CGCGACGGCA TGGTGGTTAA TTTGCTGCTG GAGGCGATCC CGGATCTGCA TCTGCGTCTG
GATGCGAATC GCGCCTGGAC GCCGCTAAAA GCCCAACAGT TCGCAAAGTA TGTTAATCCA
GACTACCGCG CTCGTATCGC CTTTCTCGAA GAACCATGTA AGACGCGGGA TGATTCCCGC
GCCTTTGCCC GTGAAACCGG CATCGCGATT GCCTGGGACG AAAGTCTGCG CGAAGCGGAT
TTCACCTTTG AAGCCGAAGA GGGCGTCAGG GCTGTGGTTA TCAAACCTAC GCTGACCGGA
TCGCTTGATA AAGTGCGTGA GCAAGTCGCT GCCGCCCATG CGTTGGGACT GACGGCGGTC
ATCAGCTCTT CGATCGAGTC CAGCCTCGGC CTGACGCAAC TGGCGCGGAT TGCCGCCTGG
TTGACGCCGG GAACGCTGCC TGGACTGGAT ACCTTGCATC TGATGCAGGC GCAACAGATT
CGCCCCTGGC CCGGTAGCGC GTTGCCTTGT CTGAAGCGTG AGGAGCTGGA ACGACTGTTA
TGA
 
Protein sequence
MRSAQVYRWQ IPMDAGVVLR DRRLKTRDGL YVCLRDGERE GWGEISPLPG FSQETWEEAQ 
TALLTWVNDW LQGNEGLPEM PSVAFGASCA LAELTGILPE AADYRAAPLC TGDPDDLVLR
LADMPGEKIA KVKVGLYEAV RDGMVVNLLL EAIPDLHLRL DANRAWTPLK AQQFAKYVNP
DYRARIAFLE EPCKTRDDSR AFARETGIAI AWDESLREAD FTFEAEEGVR AVVIKPTLTG
SLDKVREQVA AAHALGLTAV ISSSIESSLG LTQLARIAAW LTPGTLPGLD TLHLMQAQQI
RPWPGSALPC LKREELERLL