Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0021 |
Symbol | |
ID | 6068488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 21790 |
End bp | 23112 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641599426 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001723036 |
Protein GI | 170018082 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAAT TCTCAGGGGT TGTCGCAGGC GGTGGAAGCA CCTTTACGCC AGGCATCGTG TTGATGCTCC TGGCGAATCA GGACCGTTTC CCGCTTCGTG CACTGAAATT TTATGATAAC GATGGTGCGC GGCAGGAAGT GATTGCCGAA GCCTGTAAAG TCATCCTTAA AGAAAAAGCG CCGGACATTG CGTTTAGTTA CACCACCGAT CCTGAAGTGG CATTCAGCGA CGTTGATTTT GTCATGGCGC ACATCCGCGT CGGCAAATAC CCGATGCGCG AACTGGATGA AAAAATCCCG CTGCGCCACG GCGTTGTTGG TCAGGAAACT TGCGGACCCG GCGGAATAGC GTACGGCATG CGTTCCATTG GCGGCGTCCT GGAACTGGTG GATTATATGG AAAAATATTC ACCAAATGCC TGGATGCTCA ACTACTCCAA CCCGGCAGCC ATTGTCGCAG AAGCCACGCG TCGTCTGCGC CCGAATGCGA AAATCCTCAA CATCTGTGAC ATGCCAATCG GTATTGAAAG CCGGATGGCG CAAATTGTTG GGCTGCAAGA TCGCAAACAG ATGCGCGTGC GCTACTACGG CCTGAACCAC TTTGGCTGGT GGACATCAAT TGAAGATTTG CAGGGCAACG ACCTGATGCC CCAGCTGCGG CAATATGTCT CTAAGCATGG TTATGTTCCA CCGCAGCAAG ATACGCATAC TGAAGCGAGC TGGAACGACA CCTATGCAAA AGCGCGGGAT GTCCAGGCAC TGGCCCCGGA TACATTACCA AACACCTATC TGAAATATTA TCTCTTCCCG GATTACGTCG TTCAGCATTC CAACCCTGAA CATACCCGCG CGAATGAGGT GATGGAACAT CGCGAGAAAC AGGTTTTCGA TGCTTGCCGC GCCATTACGG CGGCAGGAAA TTCAGCGGCG GGCAAGCTGG AAATTGACGA ACATGCGTCA TACATCGTCG ATCTGGCGGC GGCAATTGCC TTCAACACTC AGGAGCGGAT GTTGCTGATT GTGCCTAACA ACGGGGCAAT TCATAACTTT GATGATGAAG CGATGGTCGA GATCCCGTGT CTGGTTGGGC ACAACGGACC AGAACCACTG GTGGTCGGCG ATATCCCGCA GTTTCAGAAA GGGTTAATGA GTCAGCAAGT GGCGGTGGAA AAACTGGTCG TGGACGCCTG GGAACAGCGT TCATATCAGC ACCTGTGGCA GGCGATTACG TTGTCGAAAA CGGTACCGAG CGCCTCGGTC GCCAAAGCTA TTCTGGATGA ATTGCTGGAG GCCAACAAAG CGTACTGGCC AGAGTTACGT TAA
|
Protein sequence | MTKFSGVVAG GGSTFTPGIV LMLLANQDRF PLRALKFYDN DGARQEVIAE ACKVILKEKA PDIAFSYTTD PEVAFSDVDF VMAHIRVGKY PMRELDEKIP LRHGVVGQET CGPGGIAYGM RSIGGVLELV DYMEKYSPNA WMLNYSNPAA IVAEATRRLR PNAKILNICD MPIGIESRMA QIVGLQDRKQ MRVRYYGLNH FGWWTSIEDL QGNDLMPQLR QYVSKHGYVP PQQDTHTEAS WNDTYAKARD VQALAPDTLP NTYLKYYLFP DYVVQHSNPE HTRANEVMEH REKQVFDACR AITAAGNSAA GKLEIDEHAS YIVDLAAAIA FNTQERMLLI VPNNGAIHNF DDEAMVEIPC LVGHNGPEPL VVGDIPQFQK GLMSQQVAVE KLVVDAWEQR SYQHLWQAIT LSKTVPSASV AKAILDELLE ANKAYWPELR
|
| |