Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4045 |
Symbol | glvA |
ID | 6143831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4137430 |
End bp | 4138752 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618871 |
Product | maltose-6'-phosphate glucosidase |
Protein accession | YP_001746009 |
Protein GI | 170680175 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAAT TCTCAGTGGT TGTCGCAGGC GGTGGAAGCA CCTTTACGCC AGGCATCGTG TTGATGCTCC TGGCGAATCA GGACCGTTTC CCGCTTCGTG CACTGAAATT TTATGATAAC GATGGTGCGC GGCAGGAAGT GATTGCCGAA GCCTGTAAAG TCATCCTTAA AGAAAAAGCG CCGGACATTG CGTTTAGTTA CACCAACGAT CCTGAAGTGG CATTCAGCGA CGTTGATTTT GTCATGGCGC ACATCCGCGT CGGCAAATAC CCGATGCGCG AACTGGATGA AAAAATCCCG CTGCGCCACG GCGTTGTTGG TCAGGAAACA TGCGGACCCG GCGGAATAGC GTACGGCATG CGTTCCATTG GCGGCGTCCT GGAACTGGTG GATTATATGG AAAAATATTC ACCAAATGCC TGGATGCTCA ACTACTCCAA CCCGGCAGCC ATTGTCGCAG AAGCCACGCG TCGTCTGCGC CCGAATGCGA AAATCCTCAA CATCTGTGAC ATGCCAATCG GTATTGAAAG CCGGATGGCG CAAATTGTTG GGCTGCAAGA TCGCAAACAG ATGCGCGTGC GCTACTACGG CCTGAACCAC TTTGGCTGGT GGACATCAAT TGAAGATTTG CAGGGCAACG ACCTGATGCC CCAGCTGCGG CAATATGTCT CTAAGCACGG TTATGTTCCA CCGCAGCAAG ATACACATAC TGAAGCGAGC TGGAACGACA CCTATGCAAA AGCGCGGGAT GTCCAGGCAC TGGACCCGGA TACATTGCCG AACACCTATC TGAAATATTA TCTCTTCCCG GATTATGTCG TTCAGCATTC CAACCCTGAA CATACCCGCG CGAATGAGGT GATGGAGCAT CGCGAGAAAC AGGTTTTCGA TGCTTGCCGC GCCATTACGG CGGCAGGAAA TTCAGCAGCG GGCAAGCTGG AAATTGACGA ACATGCGTCA TACATCGTCG ATCTGGCGGC GGCAATTGCC TTCAACACTC AGGAGCGGAT GTTGCTGATT GTGCCTAACA ACGGGGCAAT TCATAACTTT GATGATGAAG CGATGGTCGA GATCCCGTGT CTGGTCGGGC ACAACGGACC AGAACCACTG GTGGTCGGCG AGATCCCGCA GTTTCAGAAA GGGTTAATGA GTCAGCAAGT GGCGGTGGAA AAACTGGTCG TGGACGCCTG GGAGCAGCGT TCATATCAGC ACCTGTGGCA GGCGATTACG TTGTCGAAAA CGGTACCGAG CGCCGCGGTC GCCAAAGCTA TTCTGGATGA GTTGCTGGAG GCCAACAAAG CGTACTGGCC AGAGTTACGT TAA
|
Protein sequence | MTKFSVVVAG GGSTFTPGIV LMLLANQDRF PLRALKFYDN DGARQEVIAE ACKVILKEKA PDIAFSYTND PEVAFSDVDF VMAHIRVGKY PMRELDEKIP LRHGVVGQET CGPGGIAYGM RSIGGVLELV DYMEKYSPNA WMLNYSNPAA IVAEATRRLR PNAKILNICD MPIGIESRMA QIVGLQDRKQ MRVRYYGLNH FGWWTSIEDL QGNDLMPQLR QYVSKHGYVP PQQDTHTEAS WNDTYAKARD VQALDPDTLP NTYLKYYLFP DYVVQHSNPE HTRANEVMEH REKQVFDACR AITAAGNSAA GKLEIDEHAS YIVDLAAAIA FNTQERMLLI VPNNGAIHNF DDEAMVEIPC LVGHNGPEPL VVGEIPQFQK GLMSQQVAVE KLVVDAWEQR SYQHLWQAIT LSKTVPSAAV AKAILDELLE ANKAYWPELR
|
| |