Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1855 |
Symbol | |
ID | 6793436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 1816981 |
End bp | 1818336 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642776085 |
Product | 6-phospho-beta-glucosidase |
Protein accession | YP_002146719 |
Protein GI | 197251134 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.545492 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA AATTAAAAGT CGTCACTATT GGTGGCGGGA GCAGCTACAC CCCTGAATTA CTTGAAGGCT TTATTAAGCG CTACCATGAA TTACCTGTCA CCGAATTATG GCTGGTTGAT GTTGAAGACG GGAAAGAGAA GCTGGGCATT ATTTATGATC TCTGCCAGCG AATGATTGAT AAAGCAGGCG TTCCGCTAAA ATTGTATAAA ACGCTGGATC GCCGGGAAGC GCTGAAAGAC GCTAATTTTG TCACTACCCA GCTACGCGTT GGTCAACTCA AAGCCCGTGA GCTGGACGAG CGTATCCCGC TTAGCCACGG CTATCTGGGG CAAGAAACCA ACGGCGCTGG CGGTTTATTT AAAGGGTTGC GTACCATTCC GGTTATTTTT GACATCATTA AGGATGTTGA AGAATTATGT CCGAATGCGT GGGTCATTAA CTTTACTAAC CCGGCGGGGA TGGTGACGGA AGCGGTTTAT CGTCATACCA ACTTTAAAAA ATTCATCGGC GTATGTAATA TTCCTGTCGG CATGAAAATG TTTATTCATG ACGTGCTGGC GCTGAATGAG AATGACGATC TTTCCATTGA CCTGTTTGGT CTAAACCATA TGGTCTTTAT TAAAGACGTG CTGGTGAATG GTACCTCACG GTTCGCAGAA TTACTGGATG GCGTGGCGTC CGGTCAGTTG AAAGCGTCAA CCGTAAAAAA TATCTTTGAT CTGCCGTTTA GTGAAGGATT GATTCGCTCG CTGAACATGC TGCCGTGCTC TTATTTGTTG TATTACTTCA AGCAAAAAGA GATGTTGGCG ATTGAAATGG GCGAATATTA CAAAGGCGGC GCGCGCGCTC AGGTCGTACA AAAAGTGGAG AAACAACTCT TCGACTTGTA CAAAAATCCT GAGCTAAACG TGAAGCCGAA AGAGCTTGAG CAACGCGGCG GCGCTTATTA TTCCGATGCC GCTTGTGAAG TCATTAACGC TATTTATAAT GACAAGCAGA CTGAGCATTA CGTTAATATT CCACATCATG GGCATGTCGA GAATATCCCG GCGGACTGGG CGGTGGAAAT GACCTGCATT CTGGGACGCA ATGGCGCGAC GCCGCACCCG CGTATCACCC GTTTTGACGA AAAAGTGCTG GGGCTTATCC ACACTATTAA AGGATTTGAG GTCGCGGCCA GCAATGCGGC GCTGAGCGGA AACTTTAATG ATGTGCTGCT GGCGCTTAAC CTGAGTCCGC TGGTGCATTC CGACCGCGAC GCAGAAGTCC TGGCGCGTGA ACTCATTCTG GCGCATGAAA AATGGCTGCC TAATTTTGCC GCTTGCATCG AAGCGCTGAA AGGTAAGCAC CACTGA
|
Protein sequence | MSQKLKVVTI GGGSSYTPEL LEGFIKRYHE LPVTELWLVD VEDGKEKLGI IYDLCQRMID KAGVPLKLYK TLDRREALKD ANFVTTQLRV GQLKARELDE RIPLSHGYLG QETNGAGGLF KGLRTIPVIF DIIKDVEELC PNAWVINFTN PAGMVTEAVY RHTNFKKFIG VCNIPVGMKM FIHDVLALNE NDDLSIDLFG LNHMVFIKDV LVNGTSRFAE LLDGVASGQL KASTVKNIFD LPFSEGLIRS LNMLPCSYLL YYFKQKEMLA IEMGEYYKGG ARAQVVQKVE KQLFDLYKNP ELNVKPKELE QRGGAYYSDA ACEVINAIYN DKQTEHYVNI PHHGHVENIP ADWAVEMTCI LGRNGATPHP RITRFDEKVL GLIHTIKGFE VAASNAALSG NFNDVLLALN LSPLVHSDRD AEVLARELIL AHEKWLPNFA ACIEALKGKH H
|
| |