Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3438 |
Symbol | agaC |
ID | 6147501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3516826 |
End bp | 3517629 |
Gene Length | 804 bp |
Protein Length | 267 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618267 |
Product | PTS system N-acetylgalactosamine-specific transporter subunit IIC |
Protein accession | YP_001745416 |
Protein GI | 170681846 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3715] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC |
TIGRFAM ID | [TIGR00822] PTS system, mannose/fructose/sorbose family, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGAAA TAACCCTACT CCAGGGATTA TCCCTGGCGG CGTTAGTTTT CTTTCTGGGG ATTGATTTTT GGCTGGAAGC CTTATTTTTA TTCCGCCCGA TAATCGTTTG TACCCTTACA GGTGCTATTC TCGGTGATAT TCAGACTGGC TTAATTACCG GTGGCCTGAC AGAGTTGGCT TTCGCCGGAT TAACCCCTGC AGGTGGTGTT CAGCCGCCCA ACCCTATTAT GGCGGGTCTG ATGACCACTG TCATTGCATG GTCTACGGGC GTTGATGCCA AAACGGCAAT TGGTCTTGGC CTGCCGTTTA GTTTGTTAAT GCAGTACGTC ATTCTGTTCT TCTATTCCGC TTTCTCATTA TTTATGACCA AAGCCGATAA ATGCGCAAAA GAGGCGGATA CGGCAGCATT TTCCCGACTT AACTGGACAA CGATGCTCAT CGTCGCTTCA GCGTATGCGG TGATTGCTTT CCTCTGTACT TACCTGGCAC AAGGGGCGAT GCAGGCGCTG GTGAAAGCGA TGCCCGCCTG GCTGACCCAC GGCTTTGAAG TGGCTGGCGG TATTCTGCCT GCCGTTGGTT TTGGCTTGCT GCTGCGCGTG ATGTTCAAAG CGCAATATAT CCCTTACCTG ATCGCCGGTT TCCTGTTCGT TTGCTACATC CAGGTCAGCA ACCTGTTGCC GGTTGCCGTG CTGGGCGCAG GCTTTGCGGT GTATGAGTTT TTCAATGCGA AATCCCGGCA GCAAGCGCAA CCGCAGCCCG TTGCCAGTAA AAATGAAGAA GAGGACTACA GCAATGGGAT CTGA
|
Protein sequence | MHEITLLQGL SLAALVFFLG IDFWLEALFL FRPIIVCTLT GAILGDIQTG LITGGLTELA FAGLTPAGGV QPPNPIMAGL MTTVIAWSTG VDAKTAIGLG LPFSLLMQYV ILFFYSAFSL FMTKADKCAK EADTAAFSRL NWTTMLIVAS AYAVIAFLCT YLAQGAMQAL VKAMPAWLTH GFEVAGGILP AVGFGLLLRV MFKAQYIPYL IAGFLFVCYI QVSNLLPVAV LGAGFAVYEF FNAKSRQQAQ PQPVASKNEE EDYSNGI
|
| |