Gene SeSA_A0837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A0837 
SymbolnagC 
ID6519744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp807243 
End bp808463 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content53% 
IMG OID642745971 
ProductN-acetylglucosamine repressor 
Protein accessionYP_002113787 
Protein GI194735429 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCAG GCGGACAAGC TCAGATAGGT AACGTTGATC TCGTAAAACA GCTTAACAGC 
GCGGCCGTTT ACCGCCTGAT TGACCAGCAT GGTCCTATCT CGCGCATACA AATTGCCGAG
CAAAGCCAGC TTGCTCCCGC CAGCGTAACG AAAATTACGC GTCAACTCAT TGAACGCGGG
CTGATCAAAG AAGTCGATCA GCAGGCCTCT ACCGGAGGCC GCCGCGCTAT CTCTATCGTC
ACGGAAACCC GCAACTTCCA TGCCATTGGC GTTCGCCTGG GGCGTCATGA CACCACTTTA
ACGCTCTACG ATCTGAGCAG TAAAGTGGTC GCTGAGGAGC ATTATCCGCT ACCGGAGCGC
ACTCAGGAGA CGCTGGAACA TGCGCTGCTC AACACCATCG CCGTCTTTAT TGATAGCTGT
CAGCGTAAAA TTCGTGAATT GATCGCTATC TCGGTGATCC TGCCAGGGCT TGTCGATCCG
GAAAGCGGCG TGATTCGTTA CATGCCGCAC ATTCAGGTTG AAAACTGGGG ACTGGTCGAA
GCGCTGGAAA AACGGTTTCA CGTTACCTGT TTCGTGGGAC ACGATATCCG TAGCCTGGCG
CTGGCGGAAC ACTACTTCGG CGCCAGTCAG GATTGCGAGG ACTCGATTCT GGTGCGCGTT
CATCGTGGTA CAGGCGCCGG GATTATCTCC AACGGACGCA TCTTCATTGG CCGTAACGGC
AACGTTGGCG AAATTGGGCA TATTCAGGTG GAGCCGTTGG GCGAGCGCTG CCACTGCGGT
AATTTCGGCT GTCTGGAAAC CATTGCCGCC AATGCGGCGA TTGAACAACG GGTGCTGAAT
TTGCTTAAAC AAGGGTATCA AAGCCGTGTT CCGCTTGACG ACTGCACGAT TAAAACCATC
TGTAAGGCGG CAAACCGGGG CGACAGCCTG GCCTCGGAAG TCATTGAGCA TGTTGGTCGC
CATTTGGGCA AAACGATCGC CATTGCTATC AACCTGTTTA ATCCGCAAAA AATCGTCATT
GCCGGCGAGA TCATTGAAGC CGATAAAGTC CTGTTGCCCG CTATCGAAAG CTGTATCAAT
ACGCAGGCGT TAAAGGCATT TCGCAAAAAT TTGCCGGTGG TACGCTCCAC GCTGGATCAC
CGTTCTGCTA TCGGCGCATT TGCCTTAGTT AAACGCGCCA TGCTCAACGG AACATTGCTG
CAACGTTTGC TGGAAAGTTG A
 
Protein sequence
MTPGGQAQIG NVDLVKQLNS AAVYRLIDQH GPISRIQIAE QSQLAPASVT KITRQLIERG 
LIKEVDQQAS TGGRRAISIV TETRNFHAIG VRLGRHDTTL TLYDLSSKVV AEEHYPLPER
TQETLEHALL NTIAVFIDSC QRKIRELIAI SVILPGLVDP ESGVIRYMPH IQVENWGLVE
ALEKRFHVTC FVGHDIRSLA LAEHYFGASQ DCEDSILVRV HRGTGAGIIS NGRIFIGRNG
NVGEIGHIQV EPLGERCHCG NFGCLETIAA NAAIEQRVLN LLKQGYQSRV PLDDCTIKTI
CKAANRGDSL ASEVIEHVGR HLGKTIAIAI NLFNPQKIVI AGEIIEADKV LLPAIESCIN
TQALKAFRKN LPVVRSTLDH RSAIGAFALV KRAMLNGTLL QRLLES