Gene SeHA_C0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0801 
SymbolnagC 
ID6489292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp790107 
End bp791327 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content54% 
IMG OID642741053 
ProductN-acetylglucosamine repressor 
Protein accessionYP_002044711 
Protein GI194448029 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCAG GCGGACAAGC TCAGATAGGT AACGTTGATC TCGTAAAACA GCTTAACAGC 
GCGGCCGTTT ACCGCCTGAT TGACCAGCAT GGTCCTATCT CGCGCATACA AATTGCCGAG
CAAAGCCAGC TTGCTCCCGC CAGCGTAACG AAAATTACGC GTCAACTCAT TGAACGCGGG
CTGATCAAAG AAGTCGATCA GCAGGCCTCT ACCGGAGGCC GCCGCGCTAT CTCTATCGTC
ACGGAAACCC GCAACTTCCA TGCCATTGGC GTTCGCCTGG GCCGTCATGA CACCACTTTA
ACGCTCTACG ATCTGAGCAG TAAAGTGGTC GCTGAGGAGC ATTATCCGCT GCCGGAGCGC
ACCCAGGAGA CGCTGGAACA CGCGCTGCTC AACACCATCG CCGTCTTTAT TGATAGCTGT
CAGCGTAAAA TTCGTGAATT GATCGCTATC TCGGTGATCC TGCCAGGGCT TGTCGATCCG
GAAAGCGGCG TGATTCGTTA CATGCCGCAC ATTCAGGTTG AAAACTGGGG ACTGGTCGAA
GCGCTGGAAA AACGGTTTCA CGTTACCTGT TTCGTGGGAC ACGATATTCG TAGCCTGGCA
CTGGCGGAAC ACTACTTCGG CGCCAGTCAG GATTGCGAGG ACTCGATTCT GGTGCGCGTC
CATCGTGGTA CAGGCGCCGG GATTATCTCC AACGGACGCA TCTTCATTGG CCGTAACGGC
AACGTCGGCG AAATTGGGCA TATTCAGGTG GAGCCGTTGG GCGAGCGCTG CCACTGCGGT
AATTTCGGCT GTCTGGAAAC CATTGCCGCC AATGCGGCGA TTGAACAACG GGTGCTGAAT
TTGCTTAAAC AAGGGTATCA AAGCCGTGTT CCGCTTGACG ACTGCACGAT TAAAACCATC
TGTAAGGCGG CAAACCGGGG CGACAGCCTG GCCTCGGAAG TCATTGAGCA TGTTGGCCGC
CATTTGGGCA AAACGATCGC CATTGCTATC AACCTGTTTA ATCCGCAAAA AATCGTCATT
GCCGGCGAGA TCATTGAAGC CGATAAAGTC CTGTTGCCCG CTATCGAAAG CTGTATCAAT
ACGCAGGCGT TAAAGGCCTT TCGCAAAAAT TTGCCGGTGG TACGCTCCAC GCTGGATCAC
CGTTCTGCTA TCGGCGCATT TGCCTTAGTT AAACGCGCCA TGCTCAACGG AACATTGCTG
CAACGTTTGC TGGAAAGCTG A
 
Protein sequence
MTPGGQAQIG NVDLVKQLNS AAVYRLIDQH GPISRIQIAE QSQLAPASVT KITRQLIERG 
LIKEVDQQAS TGGRRAISIV TETRNFHAIG VRLGRHDTTL TLYDLSSKVV AEEHYPLPER
TQETLEHALL NTIAVFIDSC QRKIRELIAI SVILPGLVDP ESGVIRYMPH IQVENWGLVE
ALEKRFHVTC FVGHDIRSLA LAEHYFGASQ DCEDSILVRV HRGTGAGIIS NGRIFIGRNG
NVGEIGHIQV EPLGERCHCG NFGCLETIAA NAAIEQRVLN LLKQGYQSRV PLDDCTIKTI
CKAANRGDSL ASEVIEHVGR HLGKTIAIAI NLFNPQKIVI AGEIIEADKV LLPAIESCIN
TQALKAFRKN LPVVRSTLDH RSAIGAFALV KRAMLNGTLL QRLLES