Gene SeD_A4409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4409 
Symbol 
ID6874326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4258556 
End bp4259797 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content56% 
IMG OID642787329 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_002217940 
Protein GI198245149 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGGT TTAACACGTT GAGCCACAAC CGCTGGCTGG AGCAGGAAAC CGACCGCATC 
TTTAATTTCG GTAAAAACGC CGTGGTGCCG ACAGGCTTCG GCTGGCTGGG AAATAAAGGG
CAAATCAAAG AAGAGATGGG CACCCATCTT TGGATCACGG CGCGTATGCT GCACGTTTAT
TCCGTGGCGG CGTCGATGGG CCGACCGGGC GCTTATGATC TGGTCGATCA CGGCATCAAA
GCCATGAACG GCGCGCTGCG CGATAAAAAA TACGGCGGCT GGTATGCCTG CGTTAACGAT
CAGGGCGTGG TGGATGCCTC TAAACAGGGT TATCAACACT TCTTCGCCTT GCTGGGCGCG
GCCAGCGCCG TCACGACCGG GCATCCTGAA GCCAGGAAAT TGCTGGATTA CACCATAGAA
GTGATTGAGA AATACTTCTG GAGTGAAGAA GAGCAGATGT GCCTGGAGTC CTGGGACGAA
GCCTTCAGCC AGACGGAAGA TTACCGTGGC GGTAACGCCA ATATGCACGC CGTCGAAGCG
TTCCTGATTG TTTATGACGT TACCCATGAC AAAAAATGGC TGGATCGCGC GCTGCGTATC
GCGTCGGTAA TTATTCATGA TGTGGCGCGC AACGGTGATT ACCGCGTCAA TGAGCACTTC
GATTCACAGT GGAACCCTAT CCGCGACTAC AACAAAGATA ATCCTGCCCA CCGTTTCCGC
GCCTACGGCG GTACGCCAGG TCACTGGATT GAGTGGGGCC GTCTGATGCT CCATCTCCAT
GCTGCGCTGG AGGCCCGCTT CGAAACGCCA CCCGCCTGGC TGCTGGAAGA CGCGAAAGGT
CTGTTCCATG CCACTATCCG CGACGCCTGG GCGCCAGATG GCGCAGACGG TTTTGTCTAC
TCAGTCGACT GGGACGGTAA ACCTATCGTA CGTGAACGCG TGCGCTGGCC GATTGTGGAA
GCGATGGGTA CGGCCTACGC CCTCCACACC CTGACGGGCG ATAGCCAGTA TGAAGAGTGG
TATCAGAAAT GGTGGGACTA CTGCATTAAG TACCTGATGG ACTATGAAAA TGGTTCTTGG
TGGCAAGAGC TGGACGCCGA TAACAAAGTG ACCACCAAAG TGTGGGACGG CAAGCAAGAT
ATTTACCATC TGCTGCACTG TCTGGTCATT CCTCGTCTGC CACTGGCGCC GGGCCTGGCT
CCGGCGGTCG CGGCGGGTCT CCTGGATATC AACGCGAAAT AG
 
Protein sequence
MKWFNTLSHN RWLEQETDRI FNFGKNAVVP TGFGWLGNKG QIKEEMGTHL WITARMLHVY 
SVAASMGRPG AYDLVDHGIK AMNGALRDKK YGGWYACVND QGVVDASKQG YQHFFALLGA
ASAVTTGHPE ARKLLDYTIE VIEKYFWSEE EQMCLESWDE AFSQTEDYRG GNANMHAVEA
FLIVYDVTHD KKWLDRALRI ASVIIHDVAR NGDYRVNEHF DSQWNPIRDY NKDNPAHRFR
AYGGTPGHWI EWGRLMLHLH AALEARFETP PAWLLEDAKG LFHATIRDAW APDGADGFVY
SVDWDGKPIV RERVRWPIVE AMGTAYALHT LTGDSQYEEW YQKWWDYCIK YLMDYENGSW
WQELDADNKV TTKVWDGKQD IYHLLHCLVI PRLPLAPGLA PAVAAGLLDI NAK