Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A3983 |
Symbol | |
ID | 6516865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 3856430 |
End bp | 3857812 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642748955 |
Product | beta-glucosidase |
Protein accession | YP_002116717 |
Protein GI | 194736129 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATACC GTTTTCCCGA TAACTTCTGG TGGGGCAGCG CCTGCTCAGC GTTGCAAACC GAAGGGGATA GTCTGAATGG CGGTAAAAGC CAGACCACGT GGGATGTGTG GTTCGAGCGC CAGCCGGGTC GTTTTCATCA GGGCATCGGT CCAGCAGAAA CCTCAACGTT CTATCGCCAC TGGAAGCAAG ACATCGCGCT ACTGAAACAG TTAAAACATA ACAGTTTTCG CACCTCGCTA AGCTGGGCGC GGCTCATTCC AGACGGCGTA GGCGAGGTGA ATCCACAAGC GGTGAGCTTC TACAATCACG TCATCGACGA GCTACTGGCG CAGGGCATCA CGCCGTTTAT TACGCTGTTC CATTTTGATA TGCCGATGGT CATGCAGGAG AAAGGCGGCT GGGAAAATCG CGACGTCGTA GAGGCGTTTG GTCGGTACGC GCAAACGTGT TTTACCTTGT TTGGCGACCG CGTGAAGCAC TGGTTTACCT TTAACGAGCC GATTGTGCCG GTGGAAGGCG GCTATTTGTA CGACTTCCAC TATCCCAATG TGGTGGATTT TAAACGTGCA GCCACCGTGG CGTACCATAC CGTGCTGGCG CACTCGACCG CCGTGCGCGC CTGGCGCGCC GGGCGCTACG ACGGTGAAAT CGGCGTAGTG CTGAATCTGA CGCCGTCCTA CCCACGCTCG CAGCATCCCG CCGATGTACA AGCCGCGCAT CATGCGGATC TGTTATTCAA CCGCAGTTTT CTTGACCCGG TATTAAAGGG AGAATACCCG GCGGACTTGG TGGCGCTGCT GAAAACCTAT GACCAGTTGC CTGCCTGTCA GCCAGGCGAT CGTCAGCTTA TTACCGACGG CAAAATCGAT TTACTGGGGA TTAACTATTA TCAGCCGCGC CGCGTGAAAT GCCGTGATAC GGCGGTGAAT CCGCAAGCGC CGTTTATGCC GGAGTGGTTA TTTGACTATT ACGACATGCC GGGGCGCAAG ATGAACCCTT ACCGCGGCTG GGAAATTTAC GCGCCAGGAA TTTACGACAT CATCACCAAC CTGCGGGATA ATTACGGCAA TCCGCGCTGT TTTATCTCCG AAAACGGGAT AGGCGTTGAG AACGAGCAGC GTTTTGTGCA AGCGGGACAG ATTCACGATG ATTACCGGAT TGACTTTATT TCTGAGCATC TTAAATGGCT GCATAAAGGC ATTAGCGAGG GCTGTCACTG TCTTGGCTAC CACATGTGGA CCTTTATCGA TAACTGGTCA TGGCTGAACG GCTATAAAAA TCGCTATGGT TTTGTACAAC TGGATTTAGC CACCCAAACG CGCACGGTGA AAAAAAGCGG AGAATGGTTT GCCGCCACCG CAGAGCATAA CGGTTTTGAT TAA
|
Protein sequence | MRYRFPDNFW WGSACSALQT EGDSLNGGKS QTTWDVWFER QPGRFHQGIG PAETSTFYRH WKQDIALLKQ LKHNSFRTSL SWARLIPDGV GEVNPQAVSF YNHVIDELLA QGITPFITLF HFDMPMVMQE KGGWENRDVV EAFGRYAQTC FTLFGDRVKH WFTFNEPIVP VEGGYLYDFH YPNVVDFKRA ATVAYHTVLA HSTAVRAWRA GRYDGEIGVV LNLTPSYPRS QHPADVQAAH HADLLFNRSF LDPVLKGEYP ADLVALLKTY DQLPACQPGD RQLITDGKID LLGINYYQPR RVKCRDTAVN PQAPFMPEWL FDYYDMPGRK MNPYRGWEIY APGIYDIITN LRDNYGNPRC FISENGIGVE NEQRFVQAGQ IHDDYRIDFI SEHLKWLHKG ISEGCHCLGY HMWTFIDNWS WLNGYKNRYG FVQLDLATQT RTVKKSGEWF AATAEHNGFD
|
| |