Gene SeHA_C3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3103 
Symbol 
ID6489443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3031647 
End bp3032597 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content56% 
IMG OID642743253 
ProductNAD dependent epimerase/dehydratase family protein 
Protein accessionYP_002046872 
Protein GI194450353 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATTA TCATTACCGG CGGGGGCGGC TTTTTAGGCC AGAAACTCGC AAGCGCCTTA 
TTAAACTCAT CGCTGGCGTT TAACGAACTG CTTCTTGTTG ATTTAAAAAT GCCTGCACGG
TTATCAGATT CCCCTCGTTT ACGCTGCCTG GAAGCTGACT TAACCCAGCC GGGCGTGCTG
GAGAATGTGA TTACCGCTAA TACCTCTGTT GTTTATCATC TCGCTGCGAT TGTCAGCAGT
CATGCGGAAG ACGATTTTGA TCTGGGATGG AAAGTTAACC TGGATCTTAC CCGCCAGTTA
CTTGAGGCGT GTCGTCGACA ACCGCAGAAA ATTCGTTTTG TCTTCTCCAG CTCGCTTGCC
GTTTATGGCG GTACGCTGCC GGAATGCGTC ACCGATACCA CCGCGCTCAC GCCGCGCTCG
TCTTATGGCG CGCAGAAGGC CGCCTGTGAA CTGTTGGTCA ACGATTATAC CCGCAAAGGC
TATGTGGATG GGCTGGCGCT GCGTTTGCCG ACGATCTGTG TTCGCCCGGG TAAACCAAAC
CGCGCCGCTT CTTCTTTTGT CAGCGCGATT ATTCGTGAAC CGTTGCAGGG CGAGACGACC
GTCTGCCCGG TGTCGGAAAG TTTGCGTCTG TGGATTTCCA GCCCGGCGAC GGTGATCCAT
AACCTGTCGC TGGCCGCAAC GTTACCCGCG CCTGGCGAGG CGAGCAGCAT CAACTTACCC
GGGATCAGCG TAACCGTGGG CGAGATGCTG GAAACGTTGC GTCAGGCGGG CGGTCAGGCG
GCGCGCGATC GGGTTACGCA TCAGCGCGAC GAAGGCGTCG AGAAAATTGT CGCCTCCTGG
CCGGGACGTA TCGATAACCA GCGTGCGCTG GCGTTAGGTT TTGTCGCCGA TAAACGCTTC
GATGACATTA TCGAACGCTT TCGACAAGAT GATATGGAGG GGAGGTCATG A
 
Protein sequence
MQIIITGGGG FLGQKLASAL LNSSLAFNEL LLVDLKMPAR LSDSPRLRCL EADLTQPGVL 
ENVITANTSV VYHLAAIVSS HAEDDFDLGW KVNLDLTRQL LEACRRQPQK IRFVFSSSLA
VYGGTLPECV TDTTALTPRS SYGAQKAACE LLVNDYTRKG YVDGLALRLP TICVRPGKPN
RAASSFVSAI IREPLQGETT VCPVSESLRL WISSPATVIH NLSLAATLPA PGEASSINLP
GISVTVGEML ETLRQAGGQA ARDRVTHQRD EGVEKIVASW PGRIDNQRAL ALGFVADKRF
DDIIERFRQD DMEGRS