Gene SeSA_A1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A1413 
Symbol 
ID6516408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1361616 
End bp1362971 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content48% 
IMG OID642746531 
Product6-phospho-beta-glucosidase 
Protein accessionYP_002114336 
Protein GI194735432 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.790472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0216334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA AATTAAAAGT CGTCACTATT GGTGGCGGGA GCAGCTACAC CCCTGAATTA 
CTTGAAGGCT TTATTAAGCG CTACCATGAA TTACCTGTCA CCGAATTATG GCTGGTTGAT
GTTGAAGACG GGAAAGAGAA GCTGGACATT ATTTATGATC TCTGCCAGCG AATGATTGAT
AAAGCAGGCG TTCCGCTAAA ATTGTATAAA ACGCTGGATC GTCGGGAAGC GCTGAAAGAC
GCTAATTTTG TTACTACCCA GCTGCGCGTT GGTCAACTCA AAGCCCGTGA ACTGGACGAG
CGTATCCCGC TTAGCCACGG CTATCTGGGG CAAGAAACCA ACGGCGCTGG CGGTTTATTT
AAAGGGTTGC GCACCATTCC GGTTATTTTT GACATCATTA AGGATGTTGA GGAATTATGT
CCGAATGCGT GGGTCATTAA CTTTACTAAT CCGGCGGGGA TGGTGACGGA AGCGGTTTAT
CGCCATACCA ACTTTAAAAA ATTCATCGGC GTATGTAATA TTCCTGTCGG CATGAAAATG
TTTATTCATG ACGTGCTGGC GCTGAATGAG AATGACGATC TTTCCATTGA CCTGTTTGGT
CTAAACCATA TGGTCTTTAT TAAAGATGTG CTGGTGAATG GCACCTCACG GTTCGCAGAA
TTACTGGATG GCGTGGCGTC CGGTCAGTTG AAAGCGTCAA CCGTAAAAAA TATCTTTGAT
CTGCCGTTTA GTGAAGGATT GATTCGCTCG CTGAACATGC TGCCGTGCTC TTATTTGTTG
TATTACTTCA AGCAAAAAGA GATGCTGGCG ATTGAAATGG GCGAATATTA CAAAGGCGGC
GCGCGCGCTC AGGTCGTACA AAAAGTGGAG AAACAACTCT TCGACTTGTA CAAAAATCCT
GAGCTAAACG TGAAGCCGAA AGAGCTTGAA CAACGCGGCG GCGCTTATTA TTCCGATGCC
GCTTGTGAAG TCATTAACGC TATTTATAAT GACAAGCAGA CTGAGCATTA CGTTAATATT
CCACATCATG GGCATGTCGA GAATATCCCG GCGGACTGGG CGGTGGAAAT GACCTGCATT
CTGGGACGCA ATGGCGCGAC GCCGCACCCG CGTATCACCC GTTTTGACGA AAAAGTGCTG
GGGCTTATCC ACACTATTAA AGGATTTGAG GTCGCGGCCA GCAATGCGGC GCTGAGCGGA
AACTTTAATG ATGTGCTGCT GGCGCTTAAC CTGAGTCCGC TGGTGCATTC CGACCGCGAC
GCAGAAGTCC TGGCGCGTGA GCTCATTCTG GCGCATGAAA AATGGCTGCC TAATTTTGCC
GCTTGCATCG AAGCGCTTAA AGGTAAGCAC CACTGA
 
Protein sequence
MSQKLKVVTI GGGSSYTPEL LEGFIKRYHE LPVTELWLVD VEDGKEKLDI IYDLCQRMID 
KAGVPLKLYK TLDRREALKD ANFVTTQLRV GQLKARELDE RIPLSHGYLG QETNGAGGLF
KGLRTIPVIF DIIKDVEELC PNAWVINFTN PAGMVTEAVY RHTNFKKFIG VCNIPVGMKM
FIHDVLALNE NDDLSIDLFG LNHMVFIKDV LVNGTSRFAE LLDGVASGQL KASTVKNIFD
LPFSEGLIRS LNMLPCSYLL YYFKQKEMLA IEMGEYYKGG ARAQVVQKVE KQLFDLYKNP
ELNVKPKELE QRGGAYYSDA ACEVINAIYN DKQTEHYVNI PHHGHVENIP ADWAVEMTCI
LGRNGATPHP RITRFDEKVL GLIHTIKGFE VAASNAALSG NFNDVLLALN LSPLVHSDRD
AEVLARELIL AHEKWLPNFA ACIEALKGKH H