Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3435 |
Symbol | |
ID | 6145313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3514117 |
End bp | 3515271 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618264 |
Product | AgaS family sugar isomerase |
Protein accession | YP_001745413 |
Protein GI | 170682634 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2222] Predicted phosphosugar isomerases |
TIGRFAM ID | [TIGR02815] putative sugar isomerase, AgaS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGAAA ATTACACCCC TGCTGCTGCC GCAACCGGTA CATGGACTGA AGAAGAGATC CGCCATCAGC CTCGCGCATG GATCCGTTCA CTCACCAACA TCGACGCGCT ACGTTCCGCG CTCAATAACT TCCTTGAACC GTTACTGCGC AAAGAGAATC TGCGGGTAAT CCTGACCGGA GCCGGAACCT CGGCATTTAT CGGTGACATC ATCGCGCCGT GGCTCGCCAG CCATACCGGT AAAAACTTCA GCGCCGTACC GACCACCGAT CTGGTCACTA ATCCGATGGA CTACCTGAAT CCAGCTCATC CGCTGCTGTT GATCTCCTTC GGTCGATCCG GCAACAGCCC GGAAAGCGTC GCCGCCGTGG AACTGGCAAA TCAATTTGTA CCAGAATGCT ATCACCTGCC GATCACCTGC AACGAAGCGG GCGCTCTTTA CCAAAACGCG ATCAACAGCG ACAACGCGTT TGCCCTGCTG ATGCCCGCAG AAACGCACGA TCGCGGCTTC GCGATGACCA GCAGCATTAC CACCATGATG GCCAGCTGCC TCGCGGTTTT CGCACCTGAG ACGATCAACA GCCAGAGCTT CCGCGATGTG GCGGATCGTT GCCAGGCGAT CCTGACCTCA CTGGGCGATT TCAGCGAAGG TGTGTTTGGT TACGCACCGT GGAAACGGAT CGTTTATCTC GGCAGCGGTG GCTTACAGGG CGCAGCACGC GAGTCGGCGC TGAAAGTGCT GGAACTGACG GCGGGTAAAC TGGCGGCCTT TTATGATTCC CCGACCGGAT TCCGTCATGG CCCGAAATCG CTGGTCGATA ACGAAACGCT GGTGGTGGTG TTTGTCTCCA GCCACCCTTA CACCCGTCAG TATGATCTTG ATCTGCTGGC AGAACTCCGC CGTGACAACC AGGCATTGCG CGTAATCGCC ATCGCCGCGG AAAGCAACGA CGTTATTACC GCCGGTCCAC ATATCATCCT GCCGCCGTCC CGTCACTTTA TCGACGTTGA GCAGGCATTT TGCTTCCTGA TGTACGCCCA GACGTTTGCA CTGATGCAGT CGCTGCACAT GGGCAATACG CCGGATACCC CATCAGCCAG TGGCACCGTT AACCGCGTGG TGCAAGGCGT AATCATTCAT CCGTGGCAGG CATAA
|
Protein sequence | MPENYTPAAA ATGTWTEEEI RHQPRAWIRS LTNIDALRSA LNNFLEPLLR KENLRVILTG AGTSAFIGDI IAPWLASHTG KNFSAVPTTD LVTNPMDYLN PAHPLLLISF GRSGNSPESV AAVELANQFV PECYHLPITC NEAGALYQNA INSDNAFALL MPAETHDRGF AMTSSITTMM ASCLAVFAPE TINSQSFRDV ADRCQAILTS LGDFSEGVFG YAPWKRIVYL GSGGLQGAAR ESALKVLELT AGKLAAFYDS PTGFRHGPKS LVDNETLVVV FVSSHPYTRQ YDLDLLAELR RDNQALRVIA IAAESNDVIT AGPHIILPPS RHFIDVEQAF CFLMYAQTFA LMQSLHMGNT PDTPSASGTV NRVVQGVIIH PWQA
|
| |