Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A1413 |
Symbol | |
ID | 6516408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 1361616 |
End bp | 1362971 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642746531 |
Product | 6-phospho-beta-glucosidase |
Protein accession | YP_002114336 |
Protein GI | 194735432 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.790472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0216334 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA AATTAAAAGT CGTCACTATT GGTGGCGGGA GCAGCTACAC CCCTGAATTA CTTGAAGGCT TTATTAAGCG CTACCATGAA TTACCTGTCA CCGAATTATG GCTGGTTGAT GTTGAAGACG GGAAAGAGAA GCTGGACATT ATTTATGATC TCTGCCAGCG AATGATTGAT AAAGCAGGCG TTCCGCTAAA ATTGTATAAA ACGCTGGATC GTCGGGAAGC GCTGAAAGAC GCTAATTTTG TTACTACCCA GCTGCGCGTT GGTCAACTCA AAGCCCGTGA ACTGGACGAG CGTATCCCGC TTAGCCACGG CTATCTGGGG CAAGAAACCA ACGGCGCTGG CGGTTTATTT AAAGGGTTGC GCACCATTCC GGTTATTTTT GACATCATTA AGGATGTTGA GGAATTATGT CCGAATGCGT GGGTCATTAA CTTTACTAAT CCGGCGGGGA TGGTGACGGA AGCGGTTTAT CGCCATACCA ACTTTAAAAA ATTCATCGGC GTATGTAATA TTCCTGTCGG CATGAAAATG TTTATTCATG ACGTGCTGGC GCTGAATGAG AATGACGATC TTTCCATTGA CCTGTTTGGT CTAAACCATA TGGTCTTTAT TAAAGATGTG CTGGTGAATG GCACCTCACG GTTCGCAGAA TTACTGGATG GCGTGGCGTC CGGTCAGTTG AAAGCGTCAA CCGTAAAAAA TATCTTTGAT CTGCCGTTTA GTGAAGGATT GATTCGCTCG CTGAACATGC TGCCGTGCTC TTATTTGTTG TATTACTTCA AGCAAAAAGA GATGCTGGCG ATTGAAATGG GCGAATATTA CAAAGGCGGC GCGCGCGCTC AGGTCGTACA AAAAGTGGAG AAACAACTCT TCGACTTGTA CAAAAATCCT GAGCTAAACG TGAAGCCGAA AGAGCTTGAA CAACGCGGCG GCGCTTATTA TTCCGATGCC GCTTGTGAAG TCATTAACGC TATTTATAAT GACAAGCAGA CTGAGCATTA CGTTAATATT CCACATCATG GGCATGTCGA GAATATCCCG GCGGACTGGG CGGTGGAAAT GACCTGCATT CTGGGACGCA ATGGCGCGAC GCCGCACCCG CGTATCACCC GTTTTGACGA AAAAGTGCTG GGGCTTATCC ACACTATTAA AGGATTTGAG GTCGCGGCCA GCAATGCGGC GCTGAGCGGA AACTTTAATG ATGTGCTGCT GGCGCTTAAC CTGAGTCCGC TGGTGCATTC CGACCGCGAC GCAGAAGTCC TGGCGCGTGA GCTCATTCTG GCGCATGAAA AATGGCTGCC TAATTTTGCC GCTTGCATCG AAGCGCTTAA AGGTAAGCAC CACTGA
|
Protein sequence | MSQKLKVVTI GGGSSYTPEL LEGFIKRYHE LPVTELWLVD VEDGKEKLDI IYDLCQRMID KAGVPLKLYK TLDRREALKD ANFVTTQLRV GQLKARELDE RIPLSHGYLG QETNGAGGLF KGLRTIPVIF DIIKDVEELC PNAWVINFTN PAGMVTEAVY RHTNFKKFIG VCNIPVGMKM FIHDVLALNE NDDLSIDLFG LNHMVFIKDV LVNGTSRFAE LLDGVASGQL KASTVKNIFD LPFSEGLIRS LNMLPCSYLL YYFKQKEMLA IEMGEYYKGG ARAQVVQKVE KQLFDLYKNP ELNVKPKELE QRGGAYYSDA ACEVINAIYN DKQTEHYVNI PHHGHVENIP ADWAVEMTCI LGRNGATPHP RITRFDEKVL GLIHTIKGFE VAASNAALSG NFNDVLLALN LSPLVHSDRD AEVLARELIL AHEKWLPNFA ACIEALKGKH H
|
| |