Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3432 |
Symbol | agaE |
ID | 6143544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3511306 |
End bp | 3512184 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618261 |
Product | PTS system N-acetylgalactosamine-specific, IID component |
Protein accession | YP_001745410 |
Protein GI | 170683484 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3716] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID |
TIGRFAM ID | [TIGR00828] PTS system, mannose/fructose/sorbose family, IID component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCTA ATCAAACTAC CCTGCCGAAC GTCAGTGAAA ACGAAGAAAC ACTGCTGACA GGTGTCAATG AAAACGTGTA TGAAGATCAG AGCATTGGCG CAGAGCTGAC GAAAAAAGAT ATCAACCGCG TCGCCTGGCG TTCCATGCTG TTACAAGCGT CTTTTAACTA CGAACGTATG CAGGCTTCTG GCTGGTTGTA CGGTCTGCTG CCTGCTCTGA AAAAGATCCA CACTAATAAA CGCGACCTGG CGCGCGCCAT GAAGGGGCAT ATGGGTTTCT TCAATACCCA TCCGTTTCTG GTGACGTTTG TTATCGGCAT TATCCTTGCG ATGGAGCGTT CTAAGCAGGA CGTTAACAGT ATTCAGAGCA CCAAAATTGC CGTCGGTGCG CCGCTCGGCG GGATTGGCGA TGCGATGTTC TGGCTAACGC TACTGCCGAT TTGTGGCGGG ATAGGTGCAA GCCTCGCGCT ACAAGGCTCT ATTCTTGGCG CTGTCGTCTT TATTGTGCTG TTCAACGTGG TGCACCTGGG GCTGCGTTTT GGTCTGGCGC ATTATGCTTA CCGCATGGGC GTGGCGGCGA TTCCACTCAT TAAAGCAAAT ACCAAAAAAG TCGGCCATGC GGCATCTATC GTTGGGATGA CGGTAATCGG CGCGCTGGTG GCAACCTATG TTCGTTTAAG CACCACGCTG GAAATCACCG CGGGCGACGC AGTGGTTAAG TTACAGGCTG ATGTTATCGA CAAACTGATG CCAGCCTTCT TACCGCTGGT CTACACCCTG ACCATGTTCT GGCTGGTACG CCGCGGCTGG AGTCCGCTGC GCCTGATTGC GGTTACCGTG GTTCTCGGCA TCGTCGGTAA ATTCTGCCAT TTCCTTTAA
|
Protein sequence | MASNQTTLPN VSENEETLLT GVNENVYEDQ SIGAELTKKD INRVAWRSML LQASFNYERM QASGWLYGLL PALKKIHTNK RDLARAMKGH MGFFNTHPFL VTFVIGIILA MERSKQDVNS IQSTKIAVGA PLGGIGDAMF WLTLLPICGG IGASLALQGS ILGAVVFIVL FNVVHLGLRF GLAHYAYRMG VAAIPLIKAN TKKVGHAASI VGMTVIGALV ATYVRLSTTL EITAGDAVVK LQADVIDKLM PAFLPLVYTL TMFWLVRRGW SPLRLIAVTV VLGIVGKFCH FL
|
| |