Gene SeD_A0788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0788 
SymbolnagC 
ID6872235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp779095 
End bp780315 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content54% 
IMG OID642783985 
ProductN-acetylglucosamine repressor 
Protein accessionYP_002214664 
Protein GI198244422 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.808469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.447439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCAG GCGGACAAGC TCAGATAGGT AACGTTGATC TCGTAAAACA GCTTAACAGC 
GCGGCCGTTT ACCGCCTGAT TGACCAGCAT GGTCCTATCT CGCGCATACA AATTGCCGAG
CAAAGCCAGC TTGCTCCCGC CAGCGTAACG AAAATTACGC GTCAACTCAT TGAACGCGGG
CTGATCAAAG AAGTCGATCA GCAGGCCTCT ACCGGAGGCC GCCGCGCTAT CTCTATCGTC
ACGGAAACCC GTAACTTCCA CGCCATTGGC GTTCGCCTGG GCCGTCATGA CACCACTTTA
ACGCTCTACG ATCTGAGCAG TAAAGTGGTC GCTGAGGAGC ATTATCCGCT ACCGGAGCGC
ACCCAGGAGA CGCTGGAACA TGCGCTGCTC AACACCATCG CCGTCTTTAT TGATAGCTGT
CAGCGTAAAA TTCGTGAATT GATCGCTATC TCGGTGATCC TGCCAGGGCT TGTCGATCCG
GAAAGCGGCG TGATTCGTTA CATGCCGCAC ATTCAGGTTG AAAACTGGGG ACTGGTCGAA
GCGCTGGAAA AACGGTTTCA CGTTACCTGT TTCGTGGGAC ACGATATCCG TAGCCTGGCG
CTGGCGGAAC ACTACTTCGG CGCCAGTCAG GATTGCGAGG ACTCGATTCT GGTGCGCGTT
CATCGTGGTA CAGGCGCCGG GATTATCTCC AACGGACGCA TCTTCATTGG CCGTAACGGC
AACGTCGGTG AAATTGGGCA TATTCAGGTG GAGCCGCTGG GCGAGCGCTG CCACTGCGGT
AATTTCGGCT GTCTGGAAAC CATTGCCGCC AATGCGGCGA TTGAACAACG GGTGCTGAAT
TTGCTTAAAC AAGGGTATCA AAGCCGTGTT CCGCTTGACG ACTGCACGAT TAAAACCATC
TGTAAGGCGG CAAACCGGGG CGACAGCCTG GCCTCGGAAG TCATTGAGCA TGTTGGTCGC
CATTTGGGCA AAACGATCGC CATTGCTATC AACCTGTTTA ACCCGCAAAA AATCGTCATT
GCCGGCGAGA TCATTGAAGC CGATAAAGTC CTGTTGCCCG CTATCGAAAG CTGTATCAAT
ACGCAGGCGT TAAAGGCATT TCGCAAAAAT TTGCCGGTGG TACGCTCCAC GCTGGATCAC
CGTTCTGCTA TCGGCGCATT TGCCTTAGTT AAACGCGCCA TGCTCAACGG AACATTGCTG
CAACGTTTGC TGGAAAGTTG A
 
Protein sequence
MTPGGQAQIG NVDLVKQLNS AAVYRLIDQH GPISRIQIAE QSQLAPASVT KITRQLIERG 
LIKEVDQQAS TGGRRAISIV TETRNFHAIG VRLGRHDTTL TLYDLSSKVV AEEHYPLPER
TQETLEHALL NTIAVFIDSC QRKIRELIAI SVILPGLVDP ESGVIRYMPH IQVENWGLVE
ALEKRFHVTC FVGHDIRSLA LAEHYFGASQ DCEDSILVRV HRGTGAGIIS NGRIFIGRNG
NVGEIGHIQV EPLGERCHCG NFGCLETIAA NAAIEQRVLN LLKQGYQSRV PLDDCTIKTI
CKAANRGDSL ASEVIEHVGR HLGKTIAIAI NLFNPQKIVI AGEIIEADKV LLPAIESCIN
TQALKAFRKN LPVVRSTLDH RSAIGAFALV KRAMLNGTLL QRLLES