Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4150 |
Symbol | |
ID | 6147133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4249780 |
End bp | 4250910 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618973 |
Product | UDP-N-acetylglucosamine 2-epimerase |
Protein accession | YP_001746105 |
Protein GI | 170682256 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0381] UDP-N-acetylglucosamine 2-epimerase |
TIGRFAM ID | [TIGR00236] UDP-N-acetylglucosamine 2-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAGTAC TGACTGTATT TGGTACGCGC CCGGAAGCCA TCAAGATGGC TCCGTTGGTG CATGCGTTGG CAAAAGATCC TTTTTTTGAG GCTAAAGTTT GCGTCACTGC GCAGCATCGG GAGATGCTCG ATCAGGTGCT GAAACTCTTT TCCATTGTAC CTGACTACGA TCTCAACATA ATGCAGCCAG GACAGGGCCT GACAGAGATA ACCTGTCGGA TTCTGGAAGG GCTAAAACCA ATTCTTGCCG AGTTCAAACC AGATGTCGTG CTGGTTCACG GCGATACTAC GACGACGCTG GCAACCAGCC TGGCGGCGTT TTATCAGCGT ATTCCAGTTG GTCACGTTGA GGCAGGTCTG CGCACGGGCG ATCTCTATTC GCCGTGGCCG GAAGAGGCTA ACCGTACATT GACCGGGCAT CTGGCGATGT ATCACTTCTC TCCAACCGAA ACTTCCCGGC AAAACTTGCT GCGTGAAAAC GTTGCTGACA GCCGAATCTT CATTACCGGT AATACTGTCA TCGATGCACT GTTATGGGTG CGTGACCAGG TGATGAGCAG CGACACGCTG CGTTCAGAAC TGGCGGCAAA TTACCCGTTT ATCGACCCCG ATAAAAAGAT GATTCTGGTG ACCGGTCACA GGCGTGAGAG CTTCGGTCGT GGCTTTGAAG AAATCTGCCA GGCGCTGGCA GACATCGCCA CCACGCACCA GGACATCCAG ATTGTCTATC CGGTGCATCT CAACCCGAAC GTCAGAGAGC CGGTCAATCG CATTCTGGGG CATGTGAAAA ATGTCATTCT GATCGATCCC CAGGAGTATT TACCGTTTGT CTGGCTGATG AACCACGCCT GGCTGATTTT GACCGACTCA GGCGGCATTC AGGAAGAAGC GCCTTCGCTG GGGAAACCGG TGCTGGTGAT GCGCGATACC ACTGAGCGTC CGGAAGCGGT GACAGCGGGG ACGGTGCGTC TGGTCGGCAC GGATAAGCAG CGAATTGTCG AGGAAGTGAC GCGTCTTTTA AAAGACGAAA ACGAATATCA AGCTATGAGC CGCGCCCATA ACCCGTATGG TGATGGTCAG GCATGCTCTC GCATTCTGGA AGCGTTAAAA AATAATCGGA TATCACTATG A
|
Protein sequence | MKVLTVFGTR PEAIKMAPLV HALAKDPFFE AKVCVTAQHR EMLDQVLKLF SIVPDYDLNI MQPGQGLTEI TCRILEGLKP ILAEFKPDVV LVHGDTTTTL ATSLAAFYQR IPVGHVEAGL RTGDLYSPWP EEANRTLTGH LAMYHFSPTE TSRQNLLREN VADSRIFITG NTVIDALLWV RDQVMSSDTL RSELAANYPF IDPDKKMILV TGHRRESFGR GFEEICQALA DIATTHQDIQ IVYPVHLNPN VREPVNRILG HVKNVILIDP QEYLPFVWLM NHAWLILTDS GGIQEEAPSL GKPVLVMRDT TERPEAVTAG TVRLVGTDKQ RIVEEVTRLL KDENEYQAMS RAHNPYGDGQ ACSRILEALK NNRISL
|
| |