Gene SeD_A1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1000 
Symbol 
ID6873638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp989033 
End bp990466 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content58% 
IMG OID642784185 
ProductNAD dependent epimerase/dehydratase family protein 
Protein accessionYP_002214860 
Protein GI198243271 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.31559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones104 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCAAC GCATTCTGGT TCTCGGCGCC AGCGGCTATA TCGGCCAGCA CCTGGTCTTT 
GCGCTAAGTC AGCAAGGGCA TCATGTGCGA GCGGCGGCGC GACGCATCGA ACGTCTGGAA
AAACAGCGCC TCGCCAACGT CAGTTGTCAT AAGGTCGATC TGCACTGGCC GGAAAATTTA
CCCGCGCTGC TTCGCGACAT TGATACCGTT TACTATCTGG TACACGGCAT GGGCGAAGGC
GGCGATTTTA TCGCCCATGA GCGTCAGGCG GCGCTCAACG TGCGCGACGC GCTGCGCCAG
ACGCCGGTTA AACAACTTAT TTTCCTCAGT TCATTGCAGG CGCCGGCACA TGAGCAATCC
GATCACCTGC GCGCCCGCCA GCTTACGGCT GACACGCTGC GCGACGCAGG CGTACCGGTG
ACGGAATTAC GCGCCGGGAT CATCGTCGGC GCAGGCTCCG CCGCCTTCGA AGTCATGCGT
GACATGGTTT ACAACCTGCC AATACTCACG CCGCCGCGCT GGGTGCGTTC GCGCACCACG
CCCATCGCTC TGGAAAATTT ACTCTACTAC CTGGTCGGCT TGCTGGACCA CCCTGCGCAC
GAGCATCGTA TTCTGGAAGC CGCCGGGCCG CAGGTATTAA GCTATCAGCA GCAGTTTGAA
CGTTTTATGG CCGTCAGCGG TAAACGGCGT CCGCTGATCC CGGTGCCTTT TCCGACCCGC
TGGATTTCGG TCTGGTTTTT AAACGTTATT ACCTCCGTGC CGCCGACTAC CGCAAAAGCC
TTAATCCAGG GGTTAAGGCA CGATTTGCTG GCCGATGACG CCGCGTTAAA AAAGTTGATC
CCCCAAACGC TTATCACCTT TGACGACGCC GTTCGCCGCA CGCTGAAAGA AGAAGAAAAA
CTGGTGAACT CCAGCGACTG GGGCTACGAC GCGCTGGCCT TCGCCCGCTG GCGTCCCGAA
TACGGCTATT TTCCAAAGCA GGCGGGCTTT ACCGCGCAGA CCCCGGCCAG CCTATCGGCG
CTCTGGCAGG TCGTAAATCG GCTGGGTGGC AAAGAGGGCT ATTTTTTCGG CAATATTTTG
TGGCAGACGC GCGCCGCGAT GGACCGTCTG GTGGGGCATA AACTGGCGAA AGGCCGCCCG
TCGCATACCT TGCTCAAGCC TGGCGATACG GTAGATAGCT GGAAAGTGAT CATTGTCGAA
CCAGAAAAAC AGCTCACGCT CTTGTTTGGC ATGAAAGCGC CGGGCCTGGG GCGGCTTAGC
TTCACGCTGC ACGATAAAGG CCGCTACCGC GAAATTGACG TGCGCGCCTG GTGGCATCCA
CACGGAATGC CGGGCCTGAT TTACTGGCTA CTGATGATCC CGGCGCACCT GTTTATTTTC
CGGGGAATGG CAAGGCGTAT TGCCCGACTT GCAGAACAAA TCACAGAAAA ATGA
 
Protein sequence
MAQRILVLGA SGYIGQHLVF ALSQQGHHVR AAARRIERLE KQRLANVSCH KVDLHWPENL 
PALLRDIDTV YYLVHGMGEG GDFIAHERQA ALNVRDALRQ TPVKQLIFLS SLQAPAHEQS
DHLRARQLTA DTLRDAGVPV TELRAGIIVG AGSAAFEVMR DMVYNLPILT PPRWVRSRTT
PIALENLLYY LVGLLDHPAH EHRILEAAGP QVLSYQQQFE RFMAVSGKRR PLIPVPFPTR
WISVWFLNVI TSVPPTTAKA LIQGLRHDLL ADDAALKKLI PQTLITFDDA VRRTLKEEEK
LVNSSDWGYD ALAFARWRPE YGYFPKQAGF TAQTPASLSA LWQVVNRLGG KEGYFFGNIL
WQTRAAMDRL VGHKLAKGRP SHTLLKPGDT VDSWKVIIVE PEKQLTLLFG MKAPGLGRLS
FTLHDKGRYR EIDVRAWWHP HGMPGLIYWL LMIPAHLFIF RGMARRIARL AEQITEK