Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4265 |
Symbol | |
ID | 6143294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4365124 |
End bp | 4366365 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619086 |
Product | N-acylglucosamine 2-epimerase |
Protein accession | YP_001746210 |
Protein GI | 170683291 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2942] N-acyl-D-glucosamine 2-epimerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATGGT TTAACACCTT AAGTCACAAC CGTTGGCTGG AACAGGAAAC CGATCGCATC TTTGATTTTG GTAAAAATTC CGTGGTGCCG ACTGGTTTTG GCTGGTTAGG CAATAAAGGG CAAATCAAAG AAGAGATGGG CACCCATCTG TGGATCACCG CCCGTATGTT ACACGTTTAT TCCGTTGCTG CGGCGATGGG TCGACCAGGC GCTTACTCGT TGGTTGATCA CGGTATCAAA GCCATGAACG GCGCACTGCG CGATAAAAAA TATGGCGGCT GGTATGCCTG TGTGAATGAC GAGGGCGTGG TGGATGCCTC CAAACAGGGC TATCAACATT TCTTTGCCCT GCTGGGTGCC GCCAGTGCCG TCACAACGGG TCACCCGGAA GCGCGCAAGC TGCTCGATTA CACCATTGAA ATTATCGAGA AATACTTCTG GAGCGAAGAA GAGCAGATGT GCCTGGAATC CTGGGACGAA GCCTTCAGTA AAACCGAAGA GTACCGCGGC GGCAACGCCA ACATGCATGC GGTAGAAGCC TTCTTGATTG TTTATGACGT AACGCATGAC AAAAAATGGC TGGATCGTGC GATTCGCGTA GCTTCCGTCA TTATCCACGA CGTCGCCAGA AATAATCATT ATCGCGTTAA CGAGCACTTC GATACCCAGT GGAATCCGCT GCCGGATTAC AACAAAGATA ACCCGGCGCA CCGCTTCCGC GCATTCGGAG GCACGCCGGG CCACTGGATC GAGTGGGGTC GTTTAATGCT GCACATCCAT GCCGCCCTGG AAGCCCGCGG CGAACAGCCG CCAGCCTGGC TGTTAGAAGA TGCCAAAGGT CTGTTCAACG CCACCGTGCG CGATGCCTGG GCGCCCGATG GTGCGGACGG GATTGTTTAT ACCGTTGACT GGGAAGGAAA ACCGGTCGTT CGCGAACGTG TGCGTTGGCC TATCGTCGAG GCGATGGGTA CGGCTTACGC GCTCTACACC GTCACCGGCG ATCGCCAGTA CGAAACCTGG TATCAGACAT GGTGGGAGTA CTGCATTAAG TACCTGATGG ACTATGAAAA TGGTTCCTGG TGGCAGGAGC TGGATGCGGA CAATAAGGTC ACCACCAAAG TCTGGGACGG CAAACAGGAT ATTTATCACC TGCTGCATTG CCTGGTGATC CCGCGTATCC CGTTAGCCCC TGGTCTGGCT CCGGCAGTTG CGGCGGGGCT GCTGGACGTG AATGCCAAAT AA
|
Protein sequence | MKWFNTLSHN RWLEQETDRI FDFGKNSVVP TGFGWLGNKG QIKEEMGTHL WITARMLHVY SVAAAMGRPG AYSLVDHGIK AMNGALRDKK YGGWYACVND EGVVDASKQG YQHFFALLGA ASAVTTGHPE ARKLLDYTIE IIEKYFWSEE EQMCLESWDE AFSKTEEYRG GNANMHAVEA FLIVYDVTHD KKWLDRAIRV ASVIIHDVAR NNHYRVNEHF DTQWNPLPDY NKDNPAHRFR AFGGTPGHWI EWGRLMLHIH AALEARGEQP PAWLLEDAKG LFNATVRDAW APDGADGIVY TVDWEGKPVV RERVRWPIVE AMGTAYALYT VTGDRQYETW YQTWWEYCIK YLMDYENGSW WQELDADNKV TTKVWDGKQD IYHLLHCLVI PRIPLAPGLA PAVAAGLLDV NAK
|
| |