Gene SeD_A3223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3223 
Symbol 
ID6872862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3101533 
End bp3102483 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content56% 
IMG OID642786240 
ProductNAD dependent epimerase/dehydratase family protein 
Protein accessionYP_002216881 
Protein GI198244663 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATTA TCATTACCGG CGGGGGCGGC TTTTTAGGTC AGAAACTCGC AAGCGCCTTA 
TTAAACTCAT CGCTGGCGTT TAACGAACTG CTTCTTGTTG ATTTAAAAAT GCCTGCACGG
TTATCAGATT CTCCTCGTTT ACGCTGCCTA GAGGCTGACT TAACCCAGCC GGGCGTGCTG
GAGAATGTGA TTACCGCTAA TACCTCTGTT GTTTATCATC TCGCTGCGAT TGTCAGCAGT
CATGCGGAAG ACGATTTCGA TCTGGGATGG AAAGTTAACC TGGATCTTAC CCGCCAGTTA
CTTGAGGCGT GTCGTCGGCA ACCGCAGAAA ATTCGTTTTG TCTTCTCCAG CTCGCTTGCC
GTTTATGGCG GTACGCTGCC GGAATGCGTC ACCGATACCA CCGCGCTCAC GCCGCGCTCG
TCTTATGGCG CGCAGAAGGC TGCCTGTGAA CTGTTGGTCA ACGACTATAC CCGCAAAGGC
TATGTGGATG GGCTGGCGCT GCGTTTGCCG ACGATCTGTG TTCGCCCGGG TAAACCAAAC
CGCGCCGCTT CTTCTTTTGT CAGCGCGATT ATTCGTGAAC CGTTGCAGGG CGAGACGACC
GTCTGCCCGG TGTCGGAAAG TTTGCGGCTG TGGATTTCCA GCCCGGCGAC GGTGATCCAT
AACCTGTCGC TGGCCGCAAC GTTACCCGCG CCTGGCGAGG CGAGCAGCAT CAACTTACCG
GGGATCAGCG TAACCGTGGG CGAGATGCTG GAAACGTTGC GTCAGGCGGG CGGCCAGGCG
GCGCGCGATC GGGTTACGCA TCAGCGCGAT GAAGGCGTCG AGAAAATTGT CGCCTCCTGG
CCGGGACGTA TCGATAACCA GCGTGCGCTG GCGTTAGGTT TTGTCGCCGA TAAACGCTTC
GATGACATTA TCGAACGCTT TCGACAAGAT GATATGGAGG GGAGGTCATG A
 
Protein sequence
MQIIITGGGG FLGQKLASAL LNSSLAFNEL LLVDLKMPAR LSDSPRLRCL EADLTQPGVL 
ENVITANTSV VYHLAAIVSS HAEDDFDLGW KVNLDLTRQL LEACRRQPQK IRFVFSSSLA
VYGGTLPECV TDTTALTPRS SYGAQKAACE LLVNDYTRKG YVDGLALRLP TICVRPGKPN
RAASSFVSAI IREPLQGETT VCPVSESLRL WISSPATVIH NLSLAATLPA PGEASSINLP
GISVTVGEML ETLRQAGGQA ARDRVTHQRD EGVEKIVASW PGRIDNQRAL ALGFVADKRF
DDIIERFRQD DMEGRS