Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0996 |
Symbol | |
ID | 6067726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 1081648 |
End bp | 1083072 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641600404 |
Product | cryptic 6-phospho-beta-glucosidase |
Protein accession | YP_001723992 |
Protein GI | 170019038 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0301517 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTAT TTCCAGAAAG TTTTTTATGG GGCGGCGCGC TTGCCGCCAA CCAGTCTGAA GGTGCGTTCC GTGAAGGTGA CAAAGGTCTG ACCACTGTCG ATATGATCCC ACACGGCGAG CATCGAATGG CGGTGAAACT GGGGCTGGAA AAACGTTTTC AGTTGCGAGA TGACGAGTTT TATCCCAGCC ATGAGGCGAC GGATTTTTAT CATCGTTATA AAGAAGATAT CGCCCTGATG GCAGAGATGG GATTCAAGGT TTTCCGTACC TCAATTGCCT GGAGCCGTCT CTTTCCGCAG GGCGATGAAA TCACGCCCAA TCAGCAGGGC ATTGCTTTTT ATCGTTCTGT CTTTGAAGAG TGTAAAAAGT ACGGTATCGA ACCGCTGGTC ACGTTGTGCC ACTTCGATGT GCCGATGCAT CTGGTCACCG AATATGGCTC CTGGCGTAAC CGCAAGCTGG TGGAGTTTTT CAGCCGCTAC GCCAGAACCT GCTTTGAAGC ATTTGATGGT CTGGTGAAAT ACTGGCTAAC CTTCAATGAA ATCAACATTA TGTTGCATAG CCCGTTCTCC GGCGCGGGTC TGGTGTTTGA AGAAGGTGAA AATCAGGATC AGGTGAAATA TCAGGCCGCG CATCACCAGC TGGTTGCCAG TGCGCTAGCC ACCAAAATCG CCCATGAGGT TAACCCGCAA AATCAGGTGG GCTGTATGCT GGCGGGCGGT AACTTCTACC CTTACAGTTG CAAGCCGGAA GATGTCTGGG CGGCGCTGGA GAAAGACCGG GAAAACCTGT TTTTTATCGA TGTGCAGGCG CGGGGCACGT ATCCGGCTTA CTCTGCCCGC GTATTCCGCG AAAAAGGGGT AACCATCAAC AAAGCACCGG GCGATGATGA AATCCTGAAA AACACCGTCG ATTTTGTCTC TTTCAGCTAT TACGCCTCGC GCTGCGCCTC GGCGGAGATG AACGCCAACA ACAGCAGTGC GGCGAACGTG GTGAAATCGC TGCGTAATCC GTATCTACAG GTGAGCGACT GGGGCTGGGG AATTGATCCA CTCGGTCTGC GTATCACCAT GAATATGATG TACGATCGTT ATCAGAAGCC GCTGTTTCTG GTGGAAAACG GCCTGGGCGC AAAAGATGAA TTTGCTGCCA ATGGCGAGAT TAACGACGAC TATCGCATCA GCTACTTACG CGAACATATC CGCGCAATGA GCGAAGCGAT TGCAGACGGC ATTCCGCTGA TGGGCTACAC CACATGGGGC TGTATTGATT TAGTTTCCGC CTCTACGGGT GAAATGAGCA AACGCTACGG CTTTGTCTTT GTTGACCGTG ACGACGCAGG CAACGGTACG CTGACGCGCA CGCGTAAGAA ATCATTCTGG TGGTATAAAA AAGTGATTGC CAGTAATGGG GAAGATTTAG AGTAG
|
Protein sequence | MSVFPESFLW GGALAANQSE GAFREGDKGL TTVDMIPHGE HRMAVKLGLE KRFQLRDDEF YPSHEATDFY HRYKEDIALM AEMGFKVFRT SIAWSRLFPQ GDEITPNQQG IAFYRSVFEE CKKYGIEPLV TLCHFDVPMH LVTEYGSWRN RKLVEFFSRY ARTCFEAFDG LVKYWLTFNE INIMLHSPFS GAGLVFEEGE NQDQVKYQAA HHQLVASALA TKIAHEVNPQ NQVGCMLAGG NFYPYSCKPE DVWAALEKDR ENLFFIDVQA RGTYPAYSAR VFREKGVTIN KAPGDDEILK NTVDFVSFSY YASRCASAEM NANNSSAANV VKSLRNPYLQ VSDWGWGIDP LGLRITMNMM YDRYQKPLFL VENGLGAKDE FAANGEINDD YRISYLREHI RAMSEAIADG IPLMGYTTWG CIDLVSASTG EMSKRYGFVF VDRDDAGNGT LTRTRKKSFW WYKKVIASNG EDLE
|
| |