Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4449 |
Symbol | agaE |
ID | 6970580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4123788 |
End bp | 4124666 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643388169 |
Product | PTS system N-acetylgalactosamine-specific, IID component |
Protein accession | YP_002272606 |
Protein GI | 209400408 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3716] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID |
TIGRFAM ID | [TIGR00828] PTS system, mannose/fructose/sorbose family, IID component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.340341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCTA ATCAAACCAC CCTGCCGAAC GTCAGTGAAA ACGAAGAAAC ATTGCTGACT GGCGTCAATG AAAACGTGTA TGAAGATCAG AGCATTGGCG CGGAGCTGAC GAAAAAAGAT ATCAATCGCG TCGCCTGGCG TTCCATGCTG TTACAAGCGT CTTTTAACTA CGAACGTATG CAGGCTTCCG GCTGGTTGTA CGGTCTGCTG CCTGCTCTGA AAAAGATCCA CACTAATAAA CGCGACCTGG CGCGCGCCAT GAAGGGGCAT ATGGGTTTCT TCAATACCCA TCCGTTTCTG GTGACATTTG TTATCGGCAT TATCCTTGCG ATGGAGCGTT CTAAGCAGGA CGTTAACAGT ATTCAGAGCA CCAAAATTGC CGTCGGTGCG CCGCTCGGCG GGATTGGCGA TGCAATGTTC TGGCTAACGC TACTGCCGAT TTGTGGCGGG ATTGGAGCCA GTCTGGCATT GCAAGGCTCC ATTCTTGGCG CAGTCGTCTT TATTGTGCTG TTCAACGTGG TCCACCTGGG GCTGCGTTTT GGTCTGGCGC ATTATGCTTA CCGCATGGGC GTGGCGGCGA TTCCACTCAT TAAAGCTAAT ACCAAAAAAG TCGGCCATGC TGCGTCTATC GTTGGGATGA CGGTAATCGG TGCGCTGGTG GCAACCTATG TTCGTTTAAG CACCACGCTG GAAATCACAG CGGGCGACGC AGTGGTTAAG TTACAGGCTG ATGTTATCGA CAAACTGATG CCTGCCTTCT TACCGTTGGT CTACACCCTG ACCATGTTTT GGTTGGTACG CCGCGGCTGG AGTCCGCTGC GTCTGATTGC AGTTACCGTG GTTCTCGGCA TCGTTGGTAA ATTCTGCCAT TTCCTTTAA
|
Protein sequence | MASNQTTLPN VSENEETLLT GVNENVYEDQ SIGAELTKKD INRVAWRSML LQASFNYERM QASGWLYGLL PALKKIHTNK RDLARAMKGH MGFFNTHPFL VTFVIGIILA MERSKQDVNS IQSTKIAVGA PLGGIGDAMF WLTLLPICGG IGASLALQGS ILGAVVFIVL FNVVHLGLRF GLAHYAYRMG VAAIPLIKAN TKKVGHAASI VGMTVIGALV ATYVRLSTTL EITAGDAVVK LQADVIDKLM PAFLPLVYTL TMFWLVRRGW SPLRLIAVTV VLGIVGKFCH FL
|
| |