Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4451 |
Symbol | nagA1 |
ID | 6970963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4125115 |
End bp | 4126248 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643388171 |
Product | N-acetylglucosamine-6-phosphate deacetylase |
Protein accession | YP_002272608 |
Protein GI | 209399438 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1820] N-acetylglucosamine-6-phosphate deacetylase |
TIGRFAM ID | [TIGR00221] N-acetylglucosamine-6-phosphate deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.23608 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACACG TTCTGCGCGC CAGAAGGCTG CTGACTGAAG AGGGATGGCT CGATGACCAT CAGTTGCGTA TTGCTGACGG TGTCATCGCA GCAATCGAAC CGATTCCAGT GAGCGTGACT GAACGCGATG CGGAACTGCT CTGCCCCGCT TACATCGACA CCCATGTACA CGGTGGTGCG GGCGTTGATG TAATGGATGA CGCGCCGGAT GTACTCGACA AGCTGGCAAT GCACAAGGCA CGCGAAGGTG TCGGCAGTTG GTTGCCGACT ACCGTAACCG CGCCGCTTAG TACCATTCAT GCGGCGCTGA AACGTATTGC TCAACGTTGC CAACGCGGCG GACCTGGTGC GCAAGTGCTG GGGAGTTATC TCGAAGGACC GTACTTCACG CCGCAGAATA AAGGCGCGCA TCCGCCGGAG TTGTTTCGCG AGCTTGAAAT TGCCGAGCTG GATCAATTGA TTGCCGTTTC TCAGCACACC TTACGCGTGG TAGCGCTGGC ACCGGAAAAA GAGGGGGCAT TGCAGGCCAT CCGCCATCTT AAACAGCAAA ATGTACGAGT GATGCTGGGG CATAGCGCGG CGACCTGGCA ACAAACTCGC GCCGCGTTTG ATGCTGGTGC CGACGGCCTG GTGCATTGCT ATAACGGGAT GACAGGTTTA CATCACCGCG AACCGGGAAT GGTTGGCGCG GGATTAACGG ACAAGCGCGC CTGGCTGGAA CTGATAGCCG ATGGTCATCA TGTGCATCCG GCGGCAATGT CGCTGTGTTG TTGCTGTGCG AAAGAGAGAA TCGTACTGAT CACCGACGCG ATGCAGGCAG CCGGGATGCC GGATGGTCGC TATACGTTAT GTGGCGAAGA AGTGCAGATG CACGGTGGCG TTGTCCGTAC CGCGTCCGGT GGGCTGGCGG GCAGTACGCT GTCTGTTGAT GCGGCAGTGC GCAACATGGT CGAGTTGACG GGCGTAACGC CTGCGGAAGC CATTCATATG GCGTCGCTGC ATCCGGCGCG AATGCTGGGT GTTGATGGTG TTCTGGGATC GCTTAAACCG GGCAAACGCG CCAGCGTCGT TGCGCTGGAT AGCGGGCTGC ATGTGCAACA AATCTGGATT CAGGGTCAAT TAGCTTCGTT TTGA
|
Protein sequence | MTHVLRARRL LTEEGWLDDH QLRIADGVIA AIEPIPVSVT ERDAELLCPA YIDTHVHGGA GVDVMDDAPD VLDKLAMHKA REGVGSWLPT TVTAPLSTIH AALKRIAQRC QRGGPGAQVL GSYLEGPYFT PQNKGAHPPE LFRELEIAEL DQLIAVSQHT LRVVALAPEK EGALQAIRHL KQQNVRVMLG HSAATWQQTR AAFDAGADGL VHCYNGMTGL HHREPGMVGA GLTDKRAWLE LIADGHHVHP AAMSLCCCCA KERIVLITDA MQAAGMPDGR YTLCGEEVQM HGGVVRTASG GLAGSTLSVD AAVRNMVELT GVTPAEAIHM ASLHPARMLG VDGVLGSLKP GKRASVVALD SGLHVQQIWI QGQLASF
|
| |