Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3332 |
Symbol | agaD |
ID | 5593959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3336403 |
End bp | 3337194 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640922450 |
Product | PTS system N-acetylgalactosamine-specific transporter subunit IID |
Protein accession | YP_001459943 |
Protein GI | 157162625 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3716] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID |
TIGRFAM ID | [TIGR00828] PTS system, mannose/fructose/sorbose family, IID component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 0.40145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATCTG AAATCAGTAA AAAAGATATC ACCCGTCTGG GCTTTCGTTC GTCGCTGCTG CAAGCGAGCT TTAACTACGA AAGGATGCAG GCGGGCGGTT TTACTTGGGC GATGTTGCCG ATCCTGAAAA AGATTTATAA GGACGACAAA CCGGGCTTAA GCGCGGCGAT GAAAGATAAC CTCGAATTTA TTAATACCCA CCCGAATCTG GTCGGATTCC TGATGGGGTT ATTAATTTCG ATGGAAGAAA AAGGAGAAAA CCGCGACACC ATTAAAGGCC TCAAAGTGGC ACTGTTTGGC CCAATCGCCG GGATTGGCGA TGCGATTTTC TGGTTTACTT TGTTGCCGAT TATGGCGGGA ATTTGCTCAT CATTTGCCAG CCAGGGAAAC CTGCTGGGGC CGATTCTATT TTTCGCCGTT TACCTGCTTA TCTTTTTCCT GCGCGTCGGC TGGACCCACG TCGGTTATTC AGTCGGCGTG AAGGCGATCG ATAAAGTGCG AGAGAACTCG CAGATGATTG CCCGTTCGGC AACCATCCTC GGGATCACGG TAATCGGCGG GCTGATCGCT TCGTATGTGC ATATTAACGT GGTGACATCG TTTGCCATCG ACAGTACCCA CAGCGTCGCG CTGCAGCAGG ATTTCTTCGA TAAAGTCTTC CCGAACATTT TACCGATGGC CTACACCCTG CTGATGTATT ACTTCCTGCG GGTGAAAAAA GCGCATCCGG TGCTGTTAAT CGGCGTGACT TTTGTGCTCT CTATTGTTTG TTCCGCATTC GGCATTTTGT AA
|
Protein sequence | MGSEISKKDI TRLGFRSSLL QASFNYERMQ AGGFTWAMLP ILKKIYKDDK PGLSAAMKDN LEFINTHPNL VGFLMGLLIS MEEKGENRDT IKGLKVALFG PIAGIGDAIF WFTLLPIMAG ICSSFASQGN LLGPILFFAV YLLIFFLRVG WTHVGYSVGV KAIDKVRENS QMIARSATIL GITVIGGLIA SYVHINVVTS FAIDSTHSVA LQQDFFDKVF PNILPMAYTL LMYYFLRVKK AHPVLLIGVT FVLSIVCSAF GIL
|
| |