Gene SNSL254_A3119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3119 
Symbol 
ID6483319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3035290 
End bp3036240 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content56% 
IMG OID642738431 
ProductNAD-dependent epimerase/dehydratase:Short-chain dehydrogenase/reductase SDR:3-beta hydroxysteroid dehydrogenase/isomerase:dTDP-4-dehydrorhamnose reductase 
Protein accessionYP_002042155 
Protein GI194442216 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.0293341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATTA TCATTACCGG CGGGGGCGGC TTTTTAGGCC AGAAACTCGC AAGCGCCTTA 
TTAAACTCAT CGCTGGCGTT TAACGAACTG CTTCTTGTTG ATTTAAAAAT GCCTGCACGG
TTATCAGATT CCCCTCGTTT ACGCTGCCTG GAAGCTGACT TAACCCAGCC GGGCGTGCTG
GAGAATGTGA TTACCGCTAA TACCTCTGTT GTTTATCATC TCGCTGCGAT TGTCAGCAGT
CATGCGGAAG ACGATTTTGA TCTGGGATGG AAAGTTAACC TGGATCTTAC CCGCCAGTTA
CTTGAGGCGT GTCGTTGGCA ACCGCAGAAA ATTCGTTTTG TCTTCTCCAG CTCGCTTGCC
GTTTATGGCG GTACGCTGCC GGAATGCGTC ACCGATACCA CCGCGCTCAC GCCGCGCTCG
TCTTATGGCG CGCAGAAGGC CGCCTGTGAA CTGTTGGTCA ACGACTATAC CCGCAAAGGC
TATGTGGATG GGCTGGCGCT GCGTTTGCCG ACGATCTGTG TTCGCCCGGG TAAACCAAAC
CGCGCCGCTT CTTCTTTTGT CAGCGCGATT ATTCGTGAAC CGTTGCAGGG CGAGACGACC
GTCTGCCCGG TGTCGGAAAG TTTGCGGCTG TGGATTTCCA GCCCGGCGAC GGTGATCCAT
AACCTGTCGC TGGCCGCAAC GTTACCCGCG CCTGGCGAGG CGAGCAGCAT CAACTTACCG
GGGATCAGCG TAACCGTGGG CGAGATGCTG GAAACGTTGC GTCAGGCGGG CGGCCAGGCG
GCGCGCGATC GGGTTACGCA TCAGCGCGAT GAAGGCGTCG AGAAAATTGT CGCCTCCTGG
CCGGGACGTA TCGATAACCA GCGTGCGCTG GCGTTAGGTT TTGTCGCCGA TAAACGCTTC
GATGACATTA TCGAACGCTT TCGACAAGAT GATATGGAGG GGAGGTCATG A
 
Protein sequence
MQIIITGGGG FLGQKLASAL LNSSLAFNEL LLVDLKMPAR LSDSPRLRCL EADLTQPGVL 
ENVITANTSV VYHLAAIVSS HAEDDFDLGW KVNLDLTRQL LEACRWQPQK IRFVFSSSLA
VYGGTLPECV TDTTALTPRS SYGAQKAACE LLVNDYTRKG YVDGLALRLP TICVRPGKPN
RAASSFVSAI IREPLQGETT VCPVSESLRL WISSPATVIH NLSLAATLPA PGEASSINLP
GISVTVGEML ETLRQAGGQA ARDRVTHQRD EGVEKIVASW PGRIDNQRAL ALGFVADKRF
DDIIERFRQD DMEGRS