Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3439 |
Symbol | agaD |
ID | 6146039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3517619 |
End bp | 3518410 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641618268 |
Product | PTS system N-acetylgalactosamine-specific transporter subunit IID |
Protein accession | YP_001745417 |
Protein GI | 170681996 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3716] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID |
TIGRFAM ID | [TIGR00828] PTS system, mannose/fructose/sorbose family, IID component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATCTG AAATCAGTAA AAAAGATATC ACCCGTCTGG GCTTTCGTTC GTCACTGCTG CAAGCGAGCT TTAACTACGA AAGGATGCAG GCGGGCGGTT TTACCTGGGC GATGTTGCCG ATCCTGAAAA AGATTTATAA GGACGACAAA CCGGGCCTAA GCGCGGCGAT GAAAGATAAC CTCGAATTTA TTAATACCCA CCCGAATCTG GTCGGATTCC TGATGGTGTT ATTAATTTCG ATGGAAGAAA AAGGAGAAAA CCGCGACACC ATTAAAGGCC TCAAAGTGGC ACTGTTTGGC CCAATCGCCG GGATTGGCGA TGCGATTTTC TGGTTTACCT TATTGCCGAT TATGGCGGGA ATTTGCTCAT CATTTGCCAG CCAGGGAAAC CTGTTGGGGC CGATTTTGTT TTTCGCCGTT TACCTGCTTA TCTTCTTCCT GCGCGTCGGC TGGACCCACG TCGGTTATTC AGTCGGCGTG AAGGCGATCG ATAAAGTGCG AAAGAACTCG CAGATGATTG CTCGTTCGGC AACCATCCTC GGGATCACGG TAATCGGCGG GCTGATCGCT TCGTATGTGC ATATTAACGT GGTGACATCG TTTGCCATCG ACAGTACCCA CAGCGTCGCA CTGCAGCAGG ATTTCTTCGA TAAAGTCTTC CCGAACATTT TACCGATGGC CTACACCCTG CTGATGTATT ACTTCCTGCG GGTGAAAAAA GCGCATCCGG TGCTGTTAAT CGGCGTGACT TTTGTGCTCT CTATTGTTTG TTCCGCATTC GGCATTTTGT AA
|
Protein sequence | MGSEISKKDI TRLGFRSSLL QASFNYERMQ AGGFTWAMLP ILKKIYKDDK PGLSAAMKDN LEFINTHPNL VGFLMVLLIS MEEKGENRDT IKGLKVALFG PIAGIGDAIF WFTLLPIMAG ICSSFASQGN LLGPILFFAV YLLIFFLRVG WTHVGYSVGV KAIDKVRKNS QMIARSATIL GITVIGGLIA SYVHINVVTS FAIDSTHSVA LQQDFFDKVF PNILPMAYTL LMYYFLRVKK AHPVLLIGVT FVLSIVCSAF GIL
|
| |