Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3331 |
Symbol | agaC |
ID | 5593958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3335610 |
End bp | 3336413 |
Gene Length | 804 bp |
Protein Length | 267 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640922449 |
Product | PTS system N-acetylgalactosamine-specific transporter subunit IIC |
Protein accession | YP_001459942 |
Protein GI | 157162624 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3715] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC |
TIGRFAM ID | [TIGR00822] PTS system, mannose/fructose/sorbose family, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 0.5681 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGAAA TAACCCTACT TCAGGGATTA TCCCTGGCGG CGTTAGTTTT TGTTCTGGGG ATTGATTTTT GGCTGGAAGC CTTATTTTTA TTCCGCCCGA TAATCGTTTG TACCCTAACT GGCGCTATTC TCGGTGATAT TCAGACTGGC TTAATTACCG GTGGTCTGAC AGAGTTGGCT TTCGCCGGAT TAACCCCTGC AGGTGGTGTT CAGCCGCCCA ACCCGATTAT GGCGGGTCTG ATGACCACCG TCATTGCATG GTCTACGGGC GTTGATGCCA AAACAGCAAT TGGTCTTGGC CTGCCGTTTA GTTTGTTAAT GCAGTACGTC ATTCTGTTCT TCTATTCCGC TTTCTCATTA TTTATGACCA AAGCCGATAA ATGCGCGAAA GAGGCGGATA CGGCAGCGTT TTCCCGGCTT AACTGGACAA CGATGCTCAT CGTCGCTTCA GCGTATGCGG TGATTGCTTT CCTCTGTACT TACCTGGCAC AGGGGGCGAT GCAGGCGCTG GTGAAAGCGA TGCCCGCCTG GCTGACCCAC GGCTTTGAAG TGGCAGGCGG TATTCTGCCT GCCGTTGGTT TTGGCTTGCT GCTGCGCGTA ATGTTCAAAG CGCAATATAT CCCTTACCTG ATCGCCGGTT TCCTGTTTGT TTGCTACATC CAGGTCAGCA ACCTGTTGCC GGTTGCCGTA CTGGGCGCAG GCTTTGCGGT GTATGAGTTT TTCAATGCGA AATCCCGGCA GCAAGCGCAA CCGCAGCCCG TTGCCAGTAA AAATGAAGAA GAGGACTACA GCAATGGGAT CTGA
|
Protein sequence | MHEITLLQGL SLAALVFVLG IDFWLEALFL FRPIIVCTLT GAILGDIQTG LITGGLTELA FAGLTPAGGV QPPNPIMAGL MTTVIAWSTG VDAKTAIGLG LPFSLLMQYV ILFFYSAFSL FMTKADKCAK EADTAAFSRL NWTTMLIVAS AYAVIAFLCT YLAQGAMQAL VKAMPAWLTH GFEVAGGILP AVGFGLLLRV MFKAQYIPYL IAGFLFVCYI QVSNLLPVAV LGAGFAVYEF FNAKSRQQAQ PQPVASKNEE EDYSNGI
|
| |