Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0697 |
Symbol | nagA |
ID | 6146163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 700983 |
End bp | 702131 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615587 |
Product | N-acetylglucosamine-6-phosphate deacetylase |
Protein accession | YP_001742786 |
Protein GI | 170682789 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1820] N-acetylglucosamine-6-phosphate deacetylase |
TIGRFAM ID | [TIGR00221] N-acetylglucosamine-6-phosphate deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0146279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGCAT TAACCCAGGG CCGGATCTTT ACCGGCCACG AATTTCTTGA TGACCACGCG GTTGTTATCG CTGATGGCCT GATTAAAAGC GTCTGTCCGG TAGCGGAACT GCCGCCAGAG ATCGAACAAC GTTCACTGAA CGGGGCCATT CTCTCCCCCG GTTTTATCGA TGTGCAGCTA AACGGCTGCG GCGGCGTACA GTTTAACGAC ACCGCTGAAG CAGTCAGCGT GGAAACGCTG GAAATCATGC AGAAAGCCAA TGAGAAATCA GGCTGTACTA ACTATCTGCC GACGCTTATC ACCACCAGCG ATGAGCTGAT GAAACAGGGC GTGCGCGTTA TGCGCGAGTA CCTGGCAAAA CATCCGAACC AGGCGTTAGG TCTGCATCTG GAAGGTCCGT GGCTGAATCT GGTAAAAAAA GGCACCCATA ATCCGAATTT TGTGCGTAAG CCTGATGCCG CGCTGGTCGA TTTCCTGTGT GAGAACGCCG ACGTGATTAC CAAAGTGACT CTGGCACCGG AAATGGTTCC AGCGGAAGTC ATCAGCAAAC TGGCAAATAC CGGGATTGTG GTTTCTGCTG GTCACTCCAA CGCGACGTTG AAAGAAGCGA AAGCCGGTTT CCGCGCGGGG ATTACCTTTG CCACCCATCT GTACAACGCG ATGCCGTATA TTACCGGTCG TGAACCGGGC CTGGCGGGCG CGATCCTCGA CGAAGCTGAC ATTTATTGCG GTATTATTGC TGATGGCCTG CATGTTGATT ACGCCAACAT TCGTAACGCT AAACGTCTGA AAGGCGACAA ATTGTGTCTG GTTACCGATG CCACCGCGCC AGCAGGTGCC AACATTGAAC AGTTCATTTT TGCGGGTAAA ACAATATACT ACCGTAATGG ACTTTGTGTG GATGAGAACG GTACGTTAAG CGGTTCATCC TTAACCATGA TTGAAGGCGT GCGTAATCTG GTCGAACATT GTGGTATCGC ACTGGATGAA GTGCTGCGTA TGGCGACGCT CTATCCGGCA CGTGCGATTG GCGTTGAGAA ACGTCTCGGC ACGCTCGCCG CAGGTAAAGT AGCCAACCTG ACTGCATTCA CACCTGATTT TAAAATCACC AGGACCATCG TTAACGGTAA CGAGGTCGTA ACTCAATAA
|
Protein sequence | MYALTQGRIF TGHEFLDDHA VVIADGLIKS VCPVAELPPE IEQRSLNGAI LSPGFIDVQL NGCGGVQFND TAEAVSVETL EIMQKANEKS GCTNYLPTLI TTSDELMKQG VRVMREYLAK HPNQALGLHL EGPWLNLVKK GTHNPNFVRK PDAALVDFLC ENADVITKVT LAPEMVPAEV ISKLANTGIV VSAGHSNATL KEAKAGFRAG ITFATHLYNA MPYITGREPG LAGAILDEAD IYCGIIADGL HVDYANIRNA KRLKGDKLCL VTDATAPAGA NIEQFIFAGK TIYYRNGLCV DENGTLSGSS LTMIEGVRNL VEHCGIALDE VLRMATLYPA RAIGVEKRLG TLAAGKVANL TAFTPDFKIT RTIVNGNEVV TQ
|
| |