Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4494 |
Symbol | treB |
ID | 5593313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4499974 |
End bp | 4501392 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640923592 |
Product | PTS system trehalose(maltose)-specific transporter subunits IIBC |
Protein accession | YP_001461033 |
Protein GI | 157163715 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component [TIGR01992] PTS system, trehalose-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAA TAAACCAAAC GGATATCGAT CGGTTGATTG AACTGGTCGG CGGGCGCGGC AATATTGCGA CGGTGAGCCA CTGTATTACT CGCCTGCGCT TTGTCCTCAA CCAACCGGCC AATGCCAGAC CGAAAGAAAT TGAGCAACTC CCCATGGTGA AAGGCTGTTT CACCAATGCC GGGCAATTTC AGGTGGTGAT TGGCACCAAC GTGGGTGATT ACTATCAAGC ACTAATAGCG TCAACCGGAC AGGCGCAGGT TGATAAAGAG CAGGTAAAAA AAGCCGCCCG GCAGAATATG AAATGGCATG AGCAGTTGAT CTCTCATTTC GCGGAGATCT TCTTCCCGTT GCTGCCCGCG TTGATTAGCG GCGGTTTGAT CCTCGGTTTT CGCAATGTGA TCGGCGATTT GCCCATGAGC AACGGTCAAA CGCTGGCGCA AATGTACCCT TCCCTGCAAA CGATCTACGA TTTTCTGTGG TTGATCGGTG AAGCGATCTT CTTCTACCTG CCGGTCGGGA TTTGCTGGTC AGCGGTGAAA AAAATGGGCG GCACGCCGAT CCTTGGTATC GTGCTTGGCG TGACACTGGT TTCCCCCCAG CTGATGAACG CTTATCTGCT CGGGCAGCAG CTGCCGGAAG TGTGGGACTT TGGCATGTTC AGCATCGCCA AAGTGGGCTA TCAGGCGCAG GTGATCCCGG CACTGTTAGC CGGGCTGGCG CTGGGCGTTA TTGAAACTCG CCTTAAACGC ATCGTACCGG ATTACCTCTA TCTGGTGGTG GTGCCCGTCT GTTCGCTGAT CCTCGCGGTG TTCCTCGCCC ATGCGCTGAT TGGTCCGTTT GGTCGCATGA TTGGCGATGG CGTTGCCTTT GCGGTACGTC ACCTGATGAC CGGCAGCTTT GCTCCGATTG GTGCGGCATT GTTTGGCTTC CTGTACGCGC CGCTGGTGAT CACCGGCGTA CACCAGACCA CCCTTGCTAT TGATTTGCAG ATGATTCAAA GCATGGGTGG CACGCCAGTG TGGCCGCTGA TTGCGCTGTC GAATATCGCT CAGGGCTCCG CCGTGATAGG CATTATCATT TCCAGCCGCA AGCACAATGA ACGCGAGATC TCCGTGCCTG CCGCTATCTC CGCCTGGCTT GGGGTCACTG AGCCTGCAAT GTACGGCATC AACCTGAAAT ATCGCTTCCC GATGCTGTGC GCGATGATTG GTTCTGGTCT GGCAGGATTA CTATGCGGCC TGAACGGCGT TATGGCGAAT GGTATCGGCG TAGGCGGCCT GCCGGGAATT CTCTCGATTC AACCGAGCTA CTGGCAGGTA TTTGCGCTGG CAATGGTTAT CGCCATCATC ATCCCGATTG TACTCACCTC GTTTATCTAT CAGCGGAAAT ACCGCCTGGG CACGCTGGAT ATTGTTTAA
|
Protein sequence | MSKINQTDID RLIELVGGRG NIATVSHCIT RLRFVLNQPA NARPKEIEQL PMVKGCFTNA GQFQVVIGTN VGDYYQALIA STGQAQVDKE QVKKAARQNM KWHEQLISHF AEIFFPLLPA LISGGLILGF RNVIGDLPMS NGQTLAQMYP SLQTIYDFLW LIGEAIFFYL PVGICWSAVK KMGGTPILGI VLGVTLVSPQ LMNAYLLGQQ LPEVWDFGMF SIAKVGYQAQ VIPALLAGLA LGVIETRLKR IVPDYLYLVV VPVCSLILAV FLAHALIGPF GRMIGDGVAF AVRHLMTGSF APIGAALFGF LYAPLVITGV HQTTLAIDLQ MIQSMGGTPV WPLIALSNIA QGSAVIGIII SSRKHNEREI SVPAAISAWL GVTEPAMYGI NLKYRFPMLC AMIGSGLAGL LCGLNGVMAN GIGVGGLPGI LSIQPSYWQV FALAMVIAII IPIVLTSFIY QRKYRLGTLD IV
|
| |