Gene SeHA_C1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1031 
Symbol 
ID6489473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1012176 
End bp1013609 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content58% 
IMG OID642741273 
ProductNAD dependent epimerase/dehydratase family protein 
Protein accessionYP_002044926 
Protein GI194450982 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCAAC GCATTCTGGT TCTCGGCGCC AGCGGCTATA TCGGCCAGCA CCTGGTCTTT 
GCGCTAAGTC AGCAAGGGCA TCAGGTGCGA GCGGCGGCGC GACGCATCGA ACGTCTGGAA
AAACAGCGCC TCGCCAACGT CAGTTGTCAT AAGGTCGATC TGCACTGGCC GGAAAATTTA
CCCGCGCTGC TTCGCGACAT TGATACCGTT TACTATCTGG TACACGGCAT GGGCGAAGGC
GGCGATTTTA TCGCCCATGA GCGTCAGGCG GCGCTCAACG TGCGCGACGC GCTGCGCCAG
ACGCCGGTTA AACAACTTAT TTTCCTCAGT TCACTGCAGG CACCGGCGCA TGAGCAATCC
GATCACCTGC GCGCCCGCCA GCTTACGGCT GACACGCTGC GCGACGCAGG CGTACCGGTG
ACGGAATTAC GCGCCGGGAT CATCGTCGGC GCAGGCTCCG CCGCCTTCGA AGTCATGCGC
GACATGGTTT ACAACCTGCC AATACTCACG CCGCCGCGCT GGGTGCGTTC GCGCACCACG
CCCATCGCTC TGGAAAATTT ACTCTACTAC CTGGTCGGCT TACTGGACCA CCCTGCTCGC
GAGCATCGTA TTCTGGAAGC CGCCGGGCCG CAGGTATTAA GTTATCAGCA GCAGTTTGAA
CGTTTTATGG CCGTCAGCGG TAAACGGCGT CCGCTGATCC CGGTGCCTTT TCCGACCCGC
TGGATTTCGG TCTGGTTTTT AAACGTCATT ACCTCCGTGC CGCCAACTAC CGCAAAAGCG
TTAATCCAGG GATTAAGGCA CGATTTGCTG GCCGATGACG CCGCGTTAAA AAAGTTGATC
CCCCAAACGC TTATCACCTT TGATGACGCC GTTCGCCGCA CGCTGAAAGA AGAAGAAAAA
CTGGTGAACT CCAGCGACTG GGGCTACGAC GCGCTGGCCT TCGCCCGCTG GCGTCCCGAA
TACGGCTATT TTCCAAAGCA GGCGGGCTTT ACCGCGCAGA CCCCGGCCAG CCTATCGGCG
CTATGGCAGG TCGTAAATCG GCTGGGTGGC AAAGAGGGCT ATTTTTTCGG CAATATTTTG
TGGCAGACGC GCGCCGCGAT GGACCGTCTG GTGGGGCATA AACTGGCGAA AGGCCGCCCG
TCGCATACCT TGCTCAAGCC TGGCGATACG GTAGATAGCT GGAAAGTGAT CATTGTCGAA
CCAGAAAAAC AGCTCACGCT CTTGTTTGGC ATGAAAGCGC CGGGTCTGGG GCGGCTTAGC
TTCACGCTGC ACGATAAAGG CCGCTACCGC GAAATTGACG TGCGCGCCTG GTGGCATCCA
CACGGAATGC CGGGCCTGAT TTACTGGCTA CTGATGATCC CGGCGCACCT GTTTATTTTC
CGGGGAATGG CAAGGCGTAT TGCCCGACTT GCAGAACAAA TCACAGAAAA ATGA
 
Protein sequence
MAQRILVLGA SGYIGQHLVF ALSQQGHQVR AAARRIERLE KQRLANVSCH KVDLHWPENL 
PALLRDIDTV YYLVHGMGEG GDFIAHERQA ALNVRDALRQ TPVKQLIFLS SLQAPAHEQS
DHLRARQLTA DTLRDAGVPV TELRAGIIVG AGSAAFEVMR DMVYNLPILT PPRWVRSRTT
PIALENLLYY LVGLLDHPAR EHRILEAAGP QVLSYQQQFE RFMAVSGKRR PLIPVPFPTR
WISVWFLNVI TSVPPTTAKA LIQGLRHDLL ADDAALKKLI PQTLITFDDA VRRTLKEEEK
LVNSSDWGYD ALAFARWRPE YGYFPKQAGF TAQTPASLSA LWQVVNRLGG KEGYFFGNIL
WQTRAAMDRL VGHKLAKGRP SHTLLKPGDT VDSWKVIIVE PEKQLTLLFG MKAPGLGRLS
FTLHDKGRYR EIDVRAWWHP HGMPGLIYWL LMIPAHLFIF RGMARRIARL AEQITEK