Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1445 |
Symbol | |
ID | 6489883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 1399989 |
End bp | 1401344 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642741677 |
Product | 6-phospho-beta-glucosidase |
Protein accession | YP_002045324 |
Protein GI | 194451923 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.597684 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 87 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA AATTAAAAGT CGTCACTATT GGTGGCGGGA GCAGCTACAC CCCTGAATTA CTTGAAGGCT TTATTAAGCG CTACCATGAA TTACCTGTCA CCGAATTATG GCTGGTTGAC GTTGAAGACG GGAAAGAGAA GCTGGGCATT ATTTATGATC TCTGCCAGCG AATGATTGAT AAAGCAGGCG TTCCGCTAAA ATTGTATAAA ACGCTGGATC GCCGGGAAGC GCTGAAAGGC GCTAATTTTG TCACTACCCA GCTACGCGTT GGTCAACTCA AAGCCCGTGA GCTGGACGAG CGTATCCCGC TTAGCCACGG CTATCTGGGG CAAGAAACCA ACGGCGCTGG CGGTTTATTT AAAGGGTTGC GTACCATTCC GGTTATTTTT GACATCATTA AGGATGTTGA AGAATTATGT CCGAATGCGT GGGTCATTAA CTTTACTAAC CCGGCGGGGA TGGTGACGGA AGCGGTTTAT CGTCATACCA ACTTTAAAAA ATTCATCGGC GTATGTAATA TTCCTGTCGG CATGAAAATG TTTATTCATG ACGTGCTGGC GCTGAATGAG AATGACGATC TTTCCATTGA CCTGTTTGGT CTAAACCATA TAGTCTTTAT TAAAGATGTG CTGGTGAATG GCACCTCACG GTTCGCAGAA TTACTGGATG GCGTGGCGTC CGGTCAGTTG AAAGCGTCAA CCGTAAAAAA TATCTTTGAT CTGCCGTTTA GTGAAGGATT GATTCGCTCG CTGAACATGC TGCCGTGCTC TTATTTGTTG TATTACTTCA AGCAAAAAGA GATGCTGGCG ATTGAAATGG GCGAATATTA CAAAGGCGGC GCGCGCGCTC AGGTCGTACA AAAAGTGGAG AAACAACTCT TCGACTTGTA CAAAAATCCT GAGCTAAACG TGAAGCCGAA AGAGCTTGAG CAACGCGGCG GCGCTTATTA TTCCGATGCC GCTTGTGAAG TCATTAACGC TATTTATAAT GACAAGCAGA CTGAGCATTA CGTTAATATT CCACATCATG GGCATGTCGA GAATATCCCG GCGGACTGGG CGGTGGAAAT GACCTGCATT CTGGGACGCA ATGGCGCGAC GCCGCACCCG CGTATCACCC GTTTTGACGA AAAAGTGCTG GGGCTTATCC ACACTATTAA AGGATTTGAG GTCGCGGCCA GCAATGCGGC GCTGAGCGGA AACTTTAATG ATGTTCTGCT GGCGCTTAAC CTGAGTCCGC TGGTGCATTC CGACCGCGAC GCAGAAGTCC TGGCGCGTGA GCTCATTCTG GCGCATGAAA AATGGCTGCC TAATTTTGCC GCTTGCATCG AAGCGCTTAA AGGTAAGCAC CACTGA
|
Protein sequence | MSQKLKVVTI GGGSSYTPEL LEGFIKRYHE LPVTELWLVD VEDGKEKLGI IYDLCQRMID KAGVPLKLYK TLDRREALKG ANFVTTQLRV GQLKARELDE RIPLSHGYLG QETNGAGGLF KGLRTIPVIF DIIKDVEELC PNAWVINFTN PAGMVTEAVY RHTNFKKFIG VCNIPVGMKM FIHDVLALNE NDDLSIDLFG LNHIVFIKDV LVNGTSRFAE LLDGVASGQL KASTVKNIFD LPFSEGLIRS LNMLPCSYLL YYFKQKEMLA IEMGEYYKGG ARAQVVQKVE KQLFDLYKNP ELNVKPKELE QRGGAYYSDA ACEVINAIYN DKQTEHYVNI PHHGHVENIP ADWAVEMTCI LGRNGATPHP RITRFDEKVL GLIHTIKGFE VAASNAALSG NFNDVLLALN LSPLVHSDRD AEVLARELIL AHEKWLPNFA ACIEALKGKH H
|
| |