Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3894 |
Symbol | |
ID | 5595084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3889158 |
End bp | 3891035 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640923002 |
Product | PTS system, alpha-glucoside-specific IIBC component |
Protein accession | YP_001460479 |
Protein GI | 157163161 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG1264] Phosphotransferase system IIB components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component [TIGR02005] PTS system, alpha-glucoside-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 66 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCCAC AAGTTCTGCT TAATCGATTG AAAAAACATC TTTTTTTTAA AGATGTGTTC GATGCTGTGT ACCAGTGCTC ACAGATGTCT ACTTTTTCGC GAAAACGTAG ATCTCTACCG CCCAACGAAA AGCATGAAAG CGATCACGAA TCCCATTGGG TCGTCATGTT CTGCCGATCG CATCTTCCTA TCCTCGCTCC AGGCCTGCCG CATAACCAAT CAGGCTTCCT ACTTACAGAA TTGAGAAAAG AGGATGTGGA AATGCTCAGT CAAATTCAAC GCTTTGGCGG CGCGATGTTC ACGCCAGTGC TGCTGTTTCC CTTCGCCGGG ATTGTGGTGG GTCTTGCCAT CTTGCTGCAA AACCCGATGT TTGTCGGGGA ATCACTGACC GATCCGAACA GTTTATTCGC GCAAATCGTA CACATTATTG AAGAGGGCGG TTGGACGGTA TTCCGTAATA TGCCGCTGAT TTTTGCTGTC GGTTTACCCA TTGGCCTTGC TAAGCAAGCG CAGGGGCGTG CTTGTCTGGC GGTGATGGTG AGTTTCCTGA CCTGGAACTA TTTCATCAAC GCGATGGGAA TGACCTGGGG AAGCTACTTC GGCGTCGATT TCACTCAGGA CGCGGTGGCA GGTAGCGGTC TGACAATGAT GGCCGGGATT AAAACCCTCG ATACCAGCAT TATCGGCGCA ATTATCATTT CCGGCATTGT GACGGCGCTG CATAACCGTC TGTTCGATAA AAAACTGCCG GTTTTTCTCG GCATTTTCCA GGGGACGTCT TATGTGGTGA TTATCGCCTT CCTGGTGATG ATCCCCTGTG CCTGGCTGAC GTTGCTCGGC TGGCCAAAAG TACAAATGGG GATTGAATCT CTGCAAGCGT TCCTGCGTTC GGCGGGTGCA CTTGGGGTCT GGGTTTACAC CTTCCTCGAA CGTATTCTGA TCCCAACCGG TTTACACCAC TTCATCTACG GACAGTTTAT CTTTGGTCCG GCAGCTGTTG AAGGCGGCAT TCAGATGTAC TGGGCGCAGC ATCTGCAAGA GTTCAGTTTG AGCGCCGAGC CGCTGAAATC GTTGTTCCCG GAAGGCGGTT TTGCCCTGCA CGGTAACTCA AAAATCTTTG GTGCCGTGGG CATTTCTTTA GCGATGTACT TCACTGCCGC ACCGGAAAAT CGGGTAAAAG TGGCGGGCTT GCTGATTCCC GCAACCTTAA CCGCCATGCT GGTGGGAATT ACCGAACCGC TGGAATTTAC CTTCCTGTTC ATTTCACCGT TGCTGTTTGC GGTACACGCC GTGCTGGCGG CCTCAATGTC GACCGTAATG TATCTCTTTG GTGTGGTGGG CAACATGGGC GGAGGTCTGA TTGACCAGGT TTTACCGCAA AACTGGATCC CGATGTTCAG CAACCACGCG GATATGATGC TGACCCAAAT CGCCATTGGG TTGTGCTTTA CCCTGCTGTA CTTCGTGGTT TTCCGCACAC TGATTCTGCA GTTCAACATG TGCACGCCGG GACGTGAAGA TGCGGAAGTG AAACTCTACT CAAAAGCCGA ATACAAAGCC TCGCGAGGCC AAACCACCGC GGCAGAGCCA AAAAAAGAGC TGGATCAGGC TGCCGGTATC CTGCAAGCCC TGGGCGGGGT CGGCAATATC TCCAGCATTA ACAATTGCGC GACGCGTTTA CGTATTGCAC TGCATGACAT GTCACAAACG CTGGATGACG AAGTCTTTAA AAAGCTGGGA GCGCACGGCG TCTTCCGTAG TGGCGATGCC ATTCAGGTGA TCATTGGTCT GCATGTATCC CAGCTGCGTG AACAGCTCGA TAGCTTAATT AATTCTCATC AATCAGCAGA AAATGTTGCC ATTACGGAGG CAGTATAA
|
Protein sequence | MRPQVLLNRL KKHLFFKDVF DAVYQCSQMS TFSRKRRSLP PNEKHESDHE SHWVVMFCRS HLPILAPGLP HNQSGFLLTE LRKEDVEMLS QIQRFGGAMF TPVLLFPFAG IVVGLAILLQ NPMFVGESLT DPNSLFAQIV HIIEEGGWTV FRNMPLIFAV GLPIGLAKQA QGRACLAVMV SFLTWNYFIN AMGMTWGSYF GVDFTQDAVA GSGLTMMAGI KTLDTSIIGA IIISGIVTAL HNRLFDKKLP VFLGIFQGTS YVVIIAFLVM IPCAWLTLLG WPKVQMGIES LQAFLRSAGA LGVWVYTFLE RILIPTGLHH FIYGQFIFGP AAVEGGIQMY WAQHLQEFSL SAEPLKSLFP EGGFALHGNS KIFGAVGISL AMYFTAAPEN RVKVAGLLIP ATLTAMLVGI TEPLEFTFLF ISPLLFAVHA VLAASMSTVM YLFGVVGNMG GGLIDQVLPQ NWIPMFSNHA DMMLTQIAIG LCFTLLYFVV FRTLILQFNM CTPGREDAEV KLYSKAEYKA SRGQTTAAEP KKELDQAAGI LQALGGVGNI SSINNCATRL RIALHDMSQT LDDEVFKKLG AHGVFRSGDA IQVIIGLHVS QLREQLDSLI NSHQSAENVA ITEAV
|
| |