Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1898 |
Symbol | |
ID | 6064701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2098189 |
End bp | 2099541 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641601311 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001724873 |
Protein GI | 170019919 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.696753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.423209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA AATTAAAAGT CGTCACTATT GGTGGCGGGA GCAGCTATAC CCCGGAGTTA CTGGAAGGAT TTATTAAGCG ATACCACGAA TTGCCGGTCA GCGAATTATG GCTGGTGGAT GTCGAAGGTG GTAAAGCTAA ACTGGATATT ATTTTCGATC TCTGCCAACG GATGATTGAT AACGCTGGCG TCCCGATGAA GCTTTATAAA ACGCTGGATC GCCGCGAAGC ATTGAAAGAT GCTGATTTCG TTACTACCCA ACTGCGCGTT GGCCAATTAC CGGCGCGTGA ACTGGATGAA CGTATTCCAT TAAGTCATGG TTATCTTGGT CAGGAAACCA ACGGCGCGGG CGGTTTGTTT AAAGGTCTGC GTACCATTCC GGTGATTTTT GACATCGTAA AAGATGTCGA AGAACTTTGT CCGAATGCAT GGGTGATTAA CTTCACTAAC CCGGCGGGAA TGGTCACTGA AGCCGTTTAT CGTCATACTG GATTTAAACG CTTTATCGGC GTGTGTAATA TTCCGATCGG CATGAAGATG TTTATTCGCG ATGTTCTGAT GCTGAAAGAC AGCGATGATT TATCTATCGA TCTGTTCGGC CTCAACCATA TGGTGTTCAT TAAGGATGTG CTGGTAAATG GCAAGTCGCG CTTTGCCGAA TTGCTTGATG GTGTGGCGTC AGGGCAGTTA AAAGCATCTG GCGTTAAAAA TATTTTCGAT CTGCCATTTA GCGAAGGCTT AATTCGTTCG TTAAATCTGC TGCCATGTTC TTATCTGCTG TATTACTTCA AGCAGAAAGA GATGCTGGCT ATTGAAATGG GCGAATACTA CAAAGGCGGC GCACGAGCAC AGGTAGTACA GAAAGTCGAG AAACAACTTT TTGAGCTGTA TAAAAATCCG GAGTTGAAAG TTAAGCCGAA AGAACTGGAA CAGCGCGGTG GGGCTTATTA CTCTGATGCA GCGTGCGAAG TGATCAACGC TATCTACAAC GACAAGCAAG CTGAACATTA CGTTAATATC CCGCATCATG GGCATATTGA TAATATTCCG GCAGACTGGG CGGTAGAAAT GACCTGTAAG CTGGGGCGCG ATGGCGCGAC GCCACATCCG CGCATTACGC ATTTCGATGA TAAAGTGATG GGGCTGATTC ACACCATTAA AGGCTTCGAG ATTGCTGCCA GCAACGCCGC ACTTAGCGGA GAATTTAACG ATGTGTTACT GGCGCTAAAC CTTAGTCCGT TGGTGCATTC CGATCGCGAT GCTGAGCTGC TGGCACGCGA GATGATTCTG GCGCACGAGA AATGGCTGCC AAACTTTGCC GACTGCATCG CAGAGCTTAA AAAAGCACAT TAA
|
Protein sequence | MSQKLKVVTI GGGSSYTPEL LEGFIKRYHE LPVSELWLVD VEGGKAKLDI IFDLCQRMID NAGVPMKLYK TLDRREALKD ADFVTTQLRV GQLPARELDE RIPLSHGYLG QETNGAGGLF KGLRTIPVIF DIVKDVEELC PNAWVINFTN PAGMVTEAVY RHTGFKRFIG VCNIPIGMKM FIRDVLMLKD SDDLSIDLFG LNHMVFIKDV LVNGKSRFAE LLDGVASGQL KASGVKNIFD LPFSEGLIRS LNLLPCSYLL YYFKQKEMLA IEMGEYYKGG ARAQVVQKVE KQLFELYKNP ELKVKPKELE QRGGAYYSDA ACEVINAIYN DKQAEHYVNI PHHGHIDNIP ADWAVEMTCK LGRDGATPHP RITHFDDKVM GLIHTIKGFE IAASNAALSG EFNDVLLALN LSPLVHSDRD AELLAREMIL AHEKWLPNFA DCIAELKKAH
|
| |