Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_3951 |
Symbol | celF |
ID | 5384592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 4429006 |
End bp | 4430319 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640866982 |
Product | 6-phospho-beta-glucosidase |
Protein accession | YP_001402899 |
Protein GI | 153947458 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 55 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGT TTAAAATAAC CATTATTGGC GGTGGGAGTA GCTATACCCC AGAGCTAGTT GATGGGTTGA TTCAGCGTAT TGAACAACTG CCCGTCACTG AACTGGCACT GGTTGATGTC GAGCAGGGCC GCCAAAAGGT GGAAATTATT GCCGCGCTGA CCCAAAGAAT GCTTGCCCGT CATGGGTTGG AACAGGTCAA AGTCAGTGTC CACTACTCGC TGGATGAAGC CATTCGCGGT GCCCGTTTTG TCCTGACACA ATTTCGTGTC GGCCAACTCC CTGCCAGGGC AGCAGACGAA CGCTTGGGGT TAAAATATAA CCTGCTCGGT CAGGAGACGA CCGGAATTGG TGGCTTTGCC AAAGCGCTGC GTACCATTCC GGTGATGCTT GATATTGCTG CTAACGTTGA GCGATTGGCC CCCGATGCCT GGATCATTAA CTTCACTAAC CCAGCCGGCA TTGTCACCGA AGCCGTTAGC CGCTATACCA AAGCGAAAAT CATTGGTCTG TGTAATGTCC CGATCAGTAT GCATCATATG ATTGCCAAGC TCTTGCAAGC CCCCTACGAA GACCTCCAAC TGCGCTTTGC CGGGCTGAAT CATATGGTAT GGGTACATGA AGTATTACAA CGCGGGAAAG ATGTCACCGC TGATGTGCTG AAAATGCTGT GTGATGGGGC ATCATTAAGC ATGAACAATA TTAAGGAAGC CCCATGGCCA CCCGAGTTCC TGCAAGCAAT GGGGGCCATC CCCTGCCCAT ACCATCGCTA TTTCTACCAA ACGCAAGACA TGCTGGCAGA AGAGATGGCC GCAGCAGCTG AGCGTGGTAC CCGTGCGGAA CAGGTGATGC AGGTGGAAAA AGAGCTGTTC GCGCTGTACG CCGATCCGCA TTTGGATACC AAACCAGAGC AGCTCAGCTT CCGGGGTGGA TCATTTTATT CTGAAGTTGC ATTGGAACTT ATTCGTGCCA TCCATAACAA CTTAGGGACG CAATTAGTAG TAAATACAAC AAACCGTGGC GCGATTCGCG GTTTATCTGA TGATTCTGTG GTAGAAACTA ACTGTATTAT TGATGCAAAA GGGGCTCATC CATTAACCTT CGGCCCTTTA CCGGTCTCTA TGCATGGGCT AACCCAACAG GTAAAAGCCT ATGAACGATT GACGATTGAC GCGGCGGTAC ATGGCGATCG TCGCAGTGCC TTGTTGGCCT TAGTGACAAA TCCACTGATT GCTAACGCCA GCATTGCTCA ACCGTTACTG GAGGAGGTAC TACAGGTGAA TAAACCTTAC CTGCCTCAGT TCTCCGGTTT GTGA
|
Protein sequence | MSTFKITIIG GGSSYTPELV DGLIQRIEQL PVTELALVDV EQGRQKVEII AALTQRMLAR HGLEQVKVSV HYSLDEAIRG ARFVLTQFRV GQLPARAADE RLGLKYNLLG QETTGIGGFA KALRTIPVML DIAANVERLA PDAWIINFTN PAGIVTEAVS RYTKAKIIGL CNVPISMHHM IAKLLQAPYE DLQLRFAGLN HMVWVHEVLQ RGKDVTADVL KMLCDGASLS MNNIKEAPWP PEFLQAMGAI PCPYHRYFYQ TQDMLAEEMA AAAERGTRAE QVMQVEKELF ALYADPHLDT KPEQLSFRGG SFYSEVALEL IRAIHNNLGT QLVVNTTNRG AIRGLSDDSV VETNCIIDAK GAHPLTFGPL PVSMHGLTQQ VKAYERLTID AAVHGDRRSA LLALVTNPLI ANASIAQPLL EEVLQVNKPY LPQFSGL
|
| |