Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3893 |
Symbol | glvA |
ID | 5595083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3887836 |
End bp | 3889158 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640923001 |
Product | maltose-6'-phosphate glucosidase |
Protein accession | YP_001460478 |
Protein GI | 157163160 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 65 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAAAT TCTCAGTGGT TGTCGCAGGC GGTGGAAGCA CCTTTACGCC AGGCATCGTG TTGATGCTCC TGGCGAATCA GGACCGTTTC CCGCTTCGTG CACTGAAATT TTATGATAAC GATGGTGCGC GGCAGGAAGT GATTGCCGAA GCCTGTAAAG TCATCCTTAA AGAGAAAGCG CCGGACATTG CGTTTAGTTA CACCACCGAT CCTGAAGTGG CATTCAGCGA CGTTGATTTT GTCATGGCGC ACATCCGCGT CGGCAAGTAC CCGATGCGCG AACTGGATGA AAAAATCCCA CTGCGCCACG GTGTTGTTGG TCAGGAAACT TGCGGACCCG GCGGAATAGC GTACGGCATG CGTTCCATTG GCGGCGTCCT GGAACTGGTG GATTATATGG AAAAATATTC ACCAAACGCC TGGATGCTCA ACTACTCCAA CCCGGCAGCA ATTGTCGCAG AAGCCACGCG TCGTCTGCGC CCGAATGCGA AAATCCTCAA CATCTGTGAC ATGCCAATCG GTATTGAAAG CCGGATGGCG CAAATTGTTG GGCTGCAAGA TCGCAAACAG ATGCGCGTGC GCTACTACGG CCTGAACCAC TTTGGCTGGT GGACATCAAT TGAAGATTTG CAGGGCAACG ACCTGATGCC CCAGCTGCGG CAATATGTCT CTAAGCACGG TTATGTTCCA CCGCTGCAAG ATACACATAC TGAAGCGAGC TGGAACGACA CCTATGCAAA AGCGCGGGAT GTCCAGGCAC TGGACCCGGA TACATTACCA AACACCTATC TGAAATATTA TCTCTTCCCG GATTATGTCG TTCAGCATTC CAACCCTGAA CATACCCGCG CGAATGAGGT GATGGAACAT CGCGAGAAAC AGGTTTTCGA TGCTTGCCGC GCCATTACGG CGGCAGGAAA TTCAGCGGCG GGCAAGCTGG AAATTGACGA ACATGCGTCA TACATCGTCG ATCTGGCGGC GGCAATTGCC TTCAACACTC AGGAGCGGAT GTTGCTGATT GTGCCTAACA ACGGGGCAAT TCATAACTTT GATGATGAAG CGATGGTCAA AATCCCGTGT CTGGTCGGGC ACAACGGACC AGAACCACTG GTGGTCGGCG ATATCCCGCA GTTTCAGAAA GGGTTAATGA GTCAGCAAGT GGCGGTGGAA AAACTGGTCG TGGACGCCTG GGAACAGCGT TCATATCAGC ACCTGTGGCA GGCGATTACG TTATCGAAAA CGGTACCGAG CGCCTCGGTC GCCAAAGCTA TTCTGGATGA GTTGCTGGAG GCCAACAAAG CGTACTGGCC AGAGTTACGT TAA
|
Protein sequence | MTKFSVVVAG GGSTFTPGIV LMLLANQDRF PLRALKFYDN DGARQEVIAE ACKVILKEKA PDIAFSYTTD PEVAFSDVDF VMAHIRVGKY PMRELDEKIP LRHGVVGQET CGPGGIAYGM RSIGGVLELV DYMEKYSPNA WMLNYSNPAA IVAEATRRLR PNAKILNICD MPIGIESRMA QIVGLQDRKQ MRVRYYGLNH FGWWTSIEDL QGNDLMPQLR QYVSKHGYVP PLQDTHTEAS WNDTYAKARD VQALDPDTLP NTYLKYYLFP DYVVQHSNPE HTRANEVMEH REKQVFDACR AITAAGNSAA GKLEIDEHAS YIVDLAAAIA FNTQERMLLI VPNNGAIHNF DDEAMVKIPC LVGHNGPEPL VVGDIPQFQK GLMSQQVAVE KLVVDAWEQR SYQHLWQAIT LSKTVPSASV AKAILDELLE ANKAYWPELR
|
| |