Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4171 |
Symbol | |
ID | 6873345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4014612 |
End bp | 4015994 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642787105 |
Product | beta-glucosidase |
Protein accession | YP_002217731 |
Protein GI | 198243944 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.663192 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 0.656643 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATACC GTTTTCCCGA TAACTTCTGG TGGGGCAGCG CCTGCTCAGC GTTGCAAACT GAAGGGGATA GTCTGAATGG CGGTAAAAGC CAGACCACGT GGGATGTGTG GTTCGAGCGC CAGCCTGATC GTTTTCACCA GGGCGTCGGT CCAGCGGAAA CCTCAACGTT CTATCGCCAC TGGAAGCAAG ACATCGCGCT ACTGAAACAG TTAAAACATA ACAGTTTTCG CACCTCGCTA AGCTGGGCGC GGCTCATTCC AGACGGCGTA GGCGAGGTGA ATCCACAAGC GGTGAGCTTC TACAATCACG TCATCGACGA GCTACTGGCG CAGGGCATCA CGCCGTTTAT TACGCTGTTC CATTTTGATA TGCCGATGGT CATGCAGGAG AAAGGCGGCT GGGAAAATCG CGACGTCGTA GAGGCGTTTG GTCGGTACGC GCAAACGTGT TTTACCTTGT TTGGCGACCG CGTGAAGCAC TGGTTTACCT TTAACGAGCC GATTGTGCCG GTGGAAGGCG GCTATTTGTA CGACTTCCAC TATCCCAATG TGGTGGATTT TAAACGGGCG GCCACCGTGG CGTACCATAC CGTGCTGGCG CACTCGACCG CCGTGCGCGC CTGGCGCGCC GGGCGCTACG ACGGTGAAAT CGGCATAGTA CTGAATCTGA CGCCGTCCTA CCCACGCTCG CAGCATCCCG CCGATGTGCA AGCCGCGCAC CATGCGGATC TGTTATTCAA CCGCAGTTTT CTTGACCCGG TATTAAAGGG AGAATACCCG GCGGACTTGG TGGCGCTGCT GAAAACCTAT GACCAGTTGC CTGCCTGTCA GCCAGGCGAC CGTCAGCTTA TTGCCGACGG CAAAATCGAT TTACTGGGGA TTAACTATTA TCAGCCGCGC CGCGTGAAAT GCCGTGATAC GGCGGTGAAT CCGCAAGCGC CGTTTATGCC GGAGTGGTTA TTTGACTATT ACGACATGCC GGGGCGCAAG ATGAACCCTT ACCGCGGCTG GGAAATTTAC GCGCCAGGAA TTTACGACAT CATTACCAAC CTACGGGATA ATTACGGCAA TCCGCGCTGT TTTATCTCCG AAAACGGGAT GGGCGTTGAG AACGAGCAGC GTTTTGTGCA AGCGGGACAG ATTCACGATG ATTACCGGAT TGACTTTATT TCTGAGCATC TTAAATGGCT GCATAAAGGC ATTAGCGAGG GCTGTCACTG TCTTGGCTAC CACATGTGGA CCTTTATCGA TAACTGGTCA TGGCTGAACG GCTATAAAAA TCGCTATGGT TTTGTACAAC TGGATTTAGC CACCCAAACG CGCACGGTGA AAAAAAGCGG AGAATGGTTT GCCGCCACCG CAGAGCATAA CGGTTTTGAT TAA
|
Protein sequence | MRYRFPDNFW WGSACSALQT EGDSLNGGKS QTTWDVWFER QPDRFHQGVG PAETSTFYRH WKQDIALLKQ LKHNSFRTSL SWARLIPDGV GEVNPQAVSF YNHVIDELLA QGITPFITLF HFDMPMVMQE KGGWENRDVV EAFGRYAQTC FTLFGDRVKH WFTFNEPIVP VEGGYLYDFH YPNVVDFKRA ATVAYHTVLA HSTAVRAWRA GRYDGEIGIV LNLTPSYPRS QHPADVQAAH HADLLFNRSF LDPVLKGEYP ADLVALLKTY DQLPACQPGD RQLIADGKID LLGINYYQPR RVKCRDTAVN PQAPFMPEWL FDYYDMPGRK MNPYRGWEIY APGIYDIITN LRDNYGNPRC FISENGMGVE NEQRFVQAGQ IHDDYRIDFI SEHLKWLHKG ISEGCHCLGY HMWTFIDNWS WLNGYKNRYG FVQLDLATQT RTVKKSGEWF AATAEHNGFD
|
| |