Gene SeD_A3388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3388 
Symbol 
ID6872346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3256392 
End bp3257825 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content51% 
IMG OID642786391 
Product6-phospho-beta-glucosidase BglA 
Protein accessionYP_002217029 
Protein GI198243256 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAC TCACCTTGCC AAAAGATTTT TTATGGGGCG GCGCGGTTGC TGCACACCAG 
GTTGAAGGCG GCTGGAATAA AGACGGTAAA GGCCCCAGCA TCTGCGACGT GCTAACCGGC
GGCGCACACG GCGTGCCACG CGAAATCACC CAGAATGTCG TTGCTGGCAA ATACTATCCG
AACCACGAGG CGGTGGATTT TTACGGACAT TACAAAGAAG ACATCCGTCT TTTCGCCGAA
ATGGGGTTCA AATGCTTCCG TACCTCTATT GCCTGGACGC GTATCTTCCC GAATGGCGAC
GAATCCCAGC CAAACGAGGC CGGTCTGAAA TTCTACGACG ACATGTTTGA TGAGTTACTC
AAATACAACA TCGAACCGGT CATTACCCTT TCTCACTTTG AAATGCCATT ACATCTGGTA
CAGCACTACG GCGGCTGGAC CAATCGTAAG GTCGTTGATT TCTTTGTCCG TTTTGCTGAA
GTCGTGTTTG AACGCTACAA ACATAAGGTC AAATACTGGA TGACCTTCAA TGAAATCAAC
AATCAGCGAA ACTGGCGCGC GCCGCTGTTT GGCTACTGCT GTTCCGGCGT AGTGTATACC
GAGCATGAGA ATCCAGAAGA AACCATGTAT CAGGTCTTAC ATCATCAGTT TGTCGCCAGC
GCGCTGGCGG TAAAAGCGGC ACGTCGTATT AATCCACAGA TGAAAGTGGG TTGTATGCTG
GCGATGGTCG CGCTGTATCC TTTCTCCTGT AAACCAGAAG ATGTGATGTT TGCTCAGGAG
TCGATGCGTG AACGCTACGT CTTTACCGAT GTGCAGCTGC GCGGCTATTA CCCGTCCTAT
GTGTTGAACG AGTGGGAGCG CCGCGGATTT AACATCAAAA TGGAAGATGG CGATCTTGAA
GTGCTGCGCG AAGGCACCTG CGATTATCTT GGTTTCAGTT ATTACATGAC CAACGCGGTC
AAAGCCGAAG GCGGTAGCGG CGATGCGATT TCCGGTTTTG AAGGCAGCGT ACCGAACCCC
TATGTTAAAG CATCTGACTG GGGCTGGCAG ATTGACCCGG TGGGCCTGCG TTATTCATTG
TGTGAACTGT ACGAACGCTA TCAAAAGCCG CTGTTTATTG TCGAAAACGG TTTTGGTGCT
TACGACAAAG TAGAAGAAGA TGGCAGCATC AACGACGACT ACCGAATTGA CTACCTGCGC
GCCCATATTG AAGAGATGAA AAAAGCGGTG ACTTACGATG GTGTCGACCT GATGGGCTAC
ACGCCGTGGG GCTGCATCGA CTGCGTGTCG TTCACCACCG GTCAGTACAG CAAGCGCTAC
GGCTTCATCT ACGTGAACAA GCACGATGAC GGTACGGGCG ATATGTCGCG TTCGCGTAAG
AAAAGCTTCA ACTGGTACAA AGAGGTGATT GCCAGCAACG GCGAGAAGCT TTAA
 
Protein sequence
MRKLTLPKDF LWGGAVAAHQ VEGGWNKDGK GPSICDVLTG GAHGVPREIT QNVVAGKYYP 
NHEAVDFYGH YKEDIRLFAE MGFKCFRTSI AWTRIFPNGD ESQPNEAGLK FYDDMFDELL
KYNIEPVITL SHFEMPLHLV QHYGGWTNRK VVDFFVRFAE VVFERYKHKV KYWMTFNEIN
NQRNWRAPLF GYCCSGVVYT EHENPEETMY QVLHHQFVAS ALAVKAARRI NPQMKVGCML
AMVALYPFSC KPEDVMFAQE SMRERYVFTD VQLRGYYPSY VLNEWERRGF NIKMEDGDLE
VLREGTCDYL GFSYYMTNAV KAEGGSGDAI SGFEGSVPNP YVKASDWGWQ IDPVGLRYSL
CELYERYQKP LFIVENGFGA YDKVEEDGSI NDDYRIDYLR AHIEEMKKAV TYDGVDLMGY
TPWGCIDCVS FTTGQYSKRY GFIYVNKHDD GTGDMSRSRK KSFNWYKEVI ASNGEKL