Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0808 |
Symbol | |
ID | 6065927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 867643 |
End bp | 869076 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641600213 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001723807 |
Protein GI | 170018853 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.785665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAAAAC TCACCTTACC GAAAGATTTC TTATGGGGCG GCGCAGTTGC CGCTCATCAG GTCGAAGGCG GCTGGAACAA AGGCGGAAAA GGGCCGAGCA TTTGTGACGT TCTGACCGGT GGCGCACACG GCGTGCCGCG CGAAATCACC AAAGAAGTCT TGCCAGGAAA ATACTATCCA AACCATGAAG CCGTTGATTT TTATGGTCAC TATAAGGAAG ACATCAAGCT ATTTGCCGAA ATGGGCTTCA AATGTTTTCG TACATCCATT GCCTGGACGC GCATTTTTCC AAAAGGCGAT GAAGCTCAGC CAAACGAAGA AGGGCTGAAG TTCTACGATG ATATGTTCGA TGAACTGCTG AAATACAACA TCGAACCGGT GATCACCCTC TCCCACTTTG AAATGCCGCT GCATCTGGTG CAGCAATACG GTAGCTGGAC CAACCGTAAA GTGGTTGATT TCTTTGTACG TTTCGCGGAA GTGGTATTTG AACGCTATAA GCACAAAGTC AAATACTGGA TGACCTTCAA CGAAATTAAC AACCAGCGTA ACTGGCGTGC ACCGCTGTTC GGTTACTGCT GCTCCGGCGT GGTGTATACC GAGCATGAAA ACCCGGAAGA GACGATGTAT CAGGTGCTGC ATCACCAGTT TGTCGCCAGC GCCCTGGCGG TGAAAGCTGC GCGTCGCATT AACCCGGAGA TGAAAGTCGG CTGTATGCTG GCGATGGTGC CGCTCTATCC TTACTCCTGT AACCCGGACG ATGTGATGTT CGCTCAGGAG TCGATGCGCG AACGCTACGT CTTTACCGAT GTGCAGCTAC GCGGCTATTA CCCGTCCTAT GTGTTGAACG AGTGGGAGCG TCGCGGATTT AACATCAAAA TGGAAGACGG CGATCTGGAT GTGCTGCGTG AAGGCACCTG CGATTATCTT GGTTTCAGCT ATTACATGAC CAATGCAGTG AAGGCCGAAG GCGGCACCGG CGATGCGATC TCTGGTTTTG AAGGCAGCGT ACCAAACCCG TATGTTAAAG CATCTGACTG GGGCTGGCAG ATTGATCCAG TAGGTCTGCG CTATGCACTT TGCGAACTGT ATGAGCGTTA TCAGAGGCCG CTGTTTATTG TCGAAAACGG TTTTGGCGCT TACGACAAAG TGGAAGAAGA TGGCAGCATC AACGACGACT ACCGCATTGA CTACCTGCGC GCCCATATCG AAGAGATGAA AAAAGCGGTG ACTTACGATG GCGTGGATCT GATGGGCTAC ACACCGTGGG GCTGCATCGA CTGCGTGTCG TTCACCACCG GGCAGTACAG CAAACGCTAC GGCTTTATCT ATGTGAATAA ACATGACGAC GGTACTGGCG ATATGTCGCG TTCACGTAAG AAGAGCTTTA ACTGGTACAA AGAGGTGATT GCCAGCAACG GCGAGAAGCT TTAA
|
Protein sequence | MKKLTLPKDF LWGGAVAAHQ VEGGWNKGGK GPSICDVLTG GAHGVPREIT KEVLPGKYYP NHEAVDFYGH YKEDIKLFAE MGFKCFRTSI AWTRIFPKGD EAQPNEEGLK FYDDMFDELL KYNIEPVITL SHFEMPLHLV QQYGSWTNRK VVDFFVRFAE VVFERYKHKV KYWMTFNEIN NQRNWRAPLF GYCCSGVVYT EHENPEETMY QVLHHQFVAS ALAVKAARRI NPEMKVGCML AMVPLYPYSC NPDDVMFAQE SMRERYVFTD VQLRGYYPSY VLNEWERRGF NIKMEDGDLD VLREGTCDYL GFSYYMTNAV KAEGGTGDAI SGFEGSVPNP YVKASDWGWQ IDPVGLRYAL CELYERYQRP LFIVENGFGA YDKVEEDGSI NDDYRIDYLR AHIEEMKKAV TYDGVDLMGY TPWGCIDCVS FTTGQYSKRY GFIYVNKHDD GTGDMSRSRK KSFNWYKEVI ASNGEKL
|
| |