Gene SNSL254_A3310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3310 
Symbolepd 
ID6486128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3215410 
End bp3216456 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content54% 
IMG OID642738603 
Producterythrose 4-phosphate dehydrogenase 
Protein accessionYP_002042324 
Protein GI194443216 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01532] D-erythrose-4-phosphate dehydrogenase
[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTAC GCATAGCGAT TAATGGCTTT GGTCGCATCG GACGTAACGT GGTTCGTGCT 
TTGTATGAAT CCGGACGTCG GGCGGAAATT ACCGTGGTGG CCATCAACGA GCTGGCGGAT
GCCGCAGGCA TGGCGCATTT GTTGAAATAC GATACCAGCC ACGGGCGTTT TGCATGGGAG
GTTCGCCACG AGCGCGAGCA GCTTTTTGTC GGCGACGATG TCATTCGTAT TCTGCATGAA
CGAACGCTGG CGGATCTGCC GTGGCGCGAA CTGGGCGTGG ATGTCGTGTT GGATTGTACG
GGCGTATATG GCAACCGGGA GCATGGCGAG GCGCATATTG CCGCTGGCGC GAAGAAAGTG
CTCTTTTCTC ATCCGGGCAG CAACGATCTT GACGCCACCG TCGTTTTTGG CGTGAACCAG
AACCAACTGC GTGCGGAACA TCGTATTGTC TCAAACGCGT CCTGTACCAC GAATTGCATA
ATTCCCGTCA TTAAATTGTT GGATGATGCT TACGGCATCG AGTCTGGTAC CGTCACGACG
ATCCACTCCG CGATGAACGA TCAGCAGGTG ATCGACGCGT ATCACTCCGA TCTACGGCGC
ACGCGCGCGG CCAGCCAGTC GATTATCCCG GTAGATACAA AACTGGCGGC AGGCATTACG
CGTATATTCC CGCAGTTTAA CGACCGTTTT GAGGCGATTG CAGTGCGCGT TCCAACCATT
AATGTGACGG CGATCGATTT AAGCGTAACG GTGAAAAAAC CAGTAAAAGC CAGTGAAGTC
AACCAGTTGC TGCAAAAAGC AGCACAAGGT GCATTTCATG GTATAGTTGA CTATACGGAA
TCACCGTTGG TCTCGATAGA TTTTAACCAC GACCCGCACA GCGCCATTGT TGATGGCACG
CAAACCCGGG TCAGTGGCGC CCACCTGATC AAGACGCTGG TCTGGTGCGA TAATGAATGG
GGCTTTGCTA ACAGGATGCT CGACACCACG TTAGCGATGG CCGCAGTTGG TTTCAGGCTC
GACGCGTCAG CGTCGACAAA ACTTTAA
 
Protein sequence
MTVRIAINGF GRIGRNVVRA LYESGRRAEI TVVAINELAD AAGMAHLLKY DTSHGRFAWE 
VRHEREQLFV GDDVIRILHE RTLADLPWRE LGVDVVLDCT GVYGNREHGE AHIAAGAKKV
LFSHPGSNDL DATVVFGVNQ NQLRAEHRIV SNASCTTNCI IPVIKLLDDA YGIESGTVTT
IHSAMNDQQV IDAYHSDLRR TRAASQSIIP VDTKLAAGIT RIFPQFNDRF EAIAVRVPTI
NVTAIDLSVT VKKPVKASEV NQLLQKAAQG AFHGIVDYTE SPLVSIDFNH DPHSAIVDGT
QTRVSGAHLI KTLVWCDNEW GFANRMLDTT LAMAAVGFRL DASASTKL