Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0767 |
Symbol | nagC |
ID | 6969855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 783672 |
End bp | 784892 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643384796 |
Product | N-acetylglucosamine repressor |
Protein accession | YP_002269302 |
Protein GI | 209400377 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000547022 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACCAG GCGGACAAGC TCAGATAGGT AATGTTGATC TCGTAAAACA GCTTAACAGC GCGGCGGTTT ATCGCCTGAT TGACCAGTAC GGGCCAATCT CGCGGATTCA GATTGCCGAG CAAAGCCAGC TTGCCCCCGC CAGCGTAACC AAAATTACGC GTCAGCTTAT CGAACGCGGG CTGATCAAAG AAGTTGATCA GCAGGCCTCC ACCGGGGGCC GCCGCGCTAT CTCTATCGTC ACCGAAACCC GCAATTTCCA CGCAATCGGC GTACGGCTTG GTCGTCACGA CGCCACCATC ACCCTGTTTG ATCTCAGCAG CAAAGTGCTG GCAGAAGAAC ATTACCCGCT GCCGGAACGT ACCCAGCAGA CGCTGGAACA TGCCCTGCTG AATGCCATTG CTCAGTTTAT TGATAGCTAC CAGCGCAAAC TGCGCGAGCT GATCGCCATT TCCGTTATCC TGCCAGGACT TGTTGACCCG GACAGCGGCA AAATTCATTA CATGCCGCAT ATTCAGGTAG AAAACTGGGG GCTGGTAGAA GCACTGGAAG AGCGTTTTAA AGTGACCTGT TTCGTTGGTC ACGATATCCG TAGTCTGGCG CTGGCGGAGC ACTACTTCGG TGCAAGTCAG GATTGCGAAG ACTCCATTCT GGTGCGTGTC CATCGCGGAA CCGGGGCCGG GATTATCTCT AACGGGCGCA TTTTTATTGG CCGCAACGGC AACGTCGGTG AAATTGGCCA TATTCAGGTC GAACCGCTGG GTGAACGCTG CCACTGCGGC AACTTTGGCT GCCTGGAAAC TATCGCTGCC AACGCCGCTA TTGAACAACG GGTGTTGAAT CTGTTAAAGC AGGGCTACCA GAGCCGCGTG CCGCTGGACG ACTGCACCAT CAAAACTATC TGCAAAGCCG CGAACAAAGG CGATAGCCTG GCCTCGGAAG TGATTGAGTA TGTCGGTCGT CATCTGGGCA AAACCATCGC CATTGCTATC AACTTATTTA ATCCGCAAAA AATTGTTATT GCCGGTGAAA TCACCGAAGC CGATAAAGTG CTGCTCCCTG CTATTGAAAG CTGCATTAAT ACTCAGGCGC TGAAGGCGTT TCGTACTAAT CTGCCGGTGG TACGTTCTGA GCTAGACCAC CGCTCGGCAA TCGGTGCTTT TGCGCTGGTA AAACGCGCCA TGCTCAACGG TATTTTGCTC CAGCATTTGC TGGAAAATTA A
|
Protein sequence | MTPGGQAQIG NVDLVKQLNS AAVYRLIDQY GPISRIQIAE QSQLAPASVT KITRQLIERG LIKEVDQQAS TGGRRAISIV TETRNFHAIG VRLGRHDATI TLFDLSSKVL AEEHYPLPER TQQTLEHALL NAIAQFIDSY QRKLRELIAI SVILPGLVDP DSGKIHYMPH IQVENWGLVE ALEERFKVTC FVGHDIRSLA LAEHYFGASQ DCEDSILVRV HRGTGAGIIS NGRIFIGRNG NVGEIGHIQV EPLGERCHCG NFGCLETIAA NAAIEQRVLN LLKQGYQSRV PLDDCTIKTI CKAANKGDSL ASEVIEYVGR HLGKTIAIAI NLFNPQKIVI AGEITEADKV LLPAIESCIN TQALKAFRTN LPVVRSELDH RSAIGAFALV KRAMLNGILL QHLLEN
|
| |