Gene SNSL254_A0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0741 
SymbolnagC 
ID6483600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp746089 
End bp747309 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content54% 
IMG OID642736153 
ProductN-acetylglucosamine repressor 
Protein accessionYP_002039919 
Protein GI194442984 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.388075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.560523 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCAG GCGGACAAGC TCAGATAGGT AACGTTGATC TCGTAAAACA GCTTAACAGC 
GCGGCCGTTT ACCGCCTGAT TGACCAGCAT GGTCCTATCT CGCGCATACA AATTGCCGAG
CAAAGCCAGC TTGCCCCCGC CAGCGTAACG AAAATTACGC GTCAACTCAT TGAACGCGGG
CTGATCAAAG AAGTCGATCA GCAGGCCTCT ACCGGAGGCC GCCGCGCTAT CTCTATCGTC
ACGGAAACCC GCAACTTCCA TGCCATTGGC GTTCGCCTGG GGCGTCATGA CACCACTTTA
ACGCTCTACG ATCTGAGCAG TAAAGTGGTC GCTGAGGAGC ATTATCCGCT ACCGGAGCGC
ACCCAGGAGA CGCTGGAACA TGCGCTGCTC AACACCATCG CCGTCTTTAT TGATAGCTGT
CAGCGTAAAA TTCGTGAATT GATCGCTATC TCGGTGATCC TGCCAGGGCT TGTCGATCCG
GAAAGCGGCG TGATTCGTTA CATGCCGCAC ATTCAGGTTG AAAACTGGGG ACTGGTCGAA
GCGCTGGAAA AACGGTTTCA CGTTACCTGT TTCGTGGGAC ACGATATCCG TAGCCTGGCG
CTGGCGGAAC ACTACTTCGG CGCCAGTCAG GATTGCGAGG ACTCGATTCT GGTGCGCGTT
CATCGTGGTA CAGGCGCCGG GATTATCTCC AACGGACGCA TCTTCATTGG CCGTAACGGC
AACGTCGGCG AAATTGGGCA TATTCAGGTG GAGCCGTTGG GCGAGCGCTG CCACTGCGGT
AATTTCGGCT GTCTGGAAAC CATTGCCGCC AATGCGGCGA TTGAACAACG GGTGCTGAAT
TTGCTTAAAC AAGGGTATCA AAGCCGTGTT CCGCTTGACG ACTGCACGAT TAAAACCATC
TGTAAGGCGG CAAACCGGGG CGACAGCCTG GCCTCGGAAG TCATTGAGCA TGTTGGTCGC
CATTTGGGCA AAACGATCGC CATTGCTATC AACCTGTTTA ATCCGCAAAA AATCGTCATT
GCCGGCGAGA TCATTGAAGC CGATAAAGTC CTGTTGCCCG CTATCGAAAG CTGTATCAAT
ACGCAGGCGT TAAAGGCCTT TCGCAAAAAT TTGCCGGTGG TACGCTCCAC GCTGGATCAC
CGTTCTGCTA TCGGCGCATT TGCCTTAGTT AAACGCGCCA TGCTCAACGG AACATTGCTG
CAACGTTTGC TGGAAAGTTG A
 
Protein sequence
MTPGGQAQIG NVDLVKQLNS AAVYRLIDQH GPISRIQIAE QSQLAPASVT KITRQLIERG 
LIKEVDQQAS TGGRRAISIV TETRNFHAIG VRLGRHDTTL TLYDLSSKVV AEEHYPLPER
TQETLEHALL NTIAVFIDSC QRKIRELIAI SVILPGLVDP ESGVIRYMPH IQVENWGLVE
ALEKRFHVTC FVGHDIRSLA LAEHYFGASQ DCEDSILVRV HRGTGAGIIS NGRIFIGRNG
NVGEIGHIQV EPLGERCHCG NFGCLETIAA NAAIEQRVLN LLKQGYQSRV PLDDCTIKTI
CKAANRGDSL ASEVIEHVGR HLGKTIAIAI NLFNPQKIVI AGEIIEADKV LLPAIESCIN
TQALKAFRKN LPVVRSTLDH RSAIGAFALV KRAMLNGTLL QRLLES