Gene SeHA_C0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0045 
Symbol 
ID6488547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp43829 
End bp45868 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content55% 
IMG OID642740334 
Productglycoside hydrolase, family 31 
Protein accessionYP_002044008 
Protein GI194451435 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTA TGCAACAAGA CCCGCGTCGT CTGGTCTGGC AGCAAAACGA TCGCTATTTA 
TGGATTGAAC CCTGGGGCGA GAACAGCCTG CGCGTACGCA GCGGCCGTCA TCTGCCGGTA
ATGAGAAATG AAGACTGGGC ATTGACTGAG CCAGTCGCAG AAAGCCAGTG CCACATTGAT
TATGAGCACC ACCAGGCAAC GCTGACCAAC GGCAAAATTA TCGCTATCGT CAATCAAAAA
GGACAGGTTA CCTTTTACCG CCATCCACAC AAACCCCTGT TGCAGGAGTT CTGGCGCCTG
CGGGGCGAAA TTGGCGAGGA TGAATCATCT CACGGCCAGT ACGTCAGCGC ACTCAACCTT
GAGGGACGCG AGTTCCGCCC TATTCAGGGT GGGAAATATT CACTGAAAGC CCGCTTCGAA
GCCACCGAAG GGGAAAAAAT TTATGGTATG GGGCAGTATC AACAGGCCAA CCTGGATCTC
AAAGGATGCG TGCTTGAGCT GGCGCAACGT AACTCCCAGG CCTCAGTACC GTTTATGCTC
TCCAGTCTGG GCTACGGATT TTTATGGAAC AACCCGGCAG TCGGACGCGT AACCTTTGCC
CAAAACGTTA CCGAATGGGA AGCGCAGGTC AGCGAACAGC TGGACTACTG GATCACGGCT
GGCGATACCC CGGCAGAAAT TAGCCGGGCT TACGCGCTGG CTACCGGCAC GCCGCCGATG
ATGCCGGACT ACGCCATGGG CTTCTGGCAG TGCAAACTCC GTTATCGTAC GCAGGAAGAG
CTGCTGGAGG TCGCCCGCGA ATATAAGCGC CGCAATCTGC CTATCTCAGT GATCGTAATC
GACTTCTTTC ACTGGCCGAA TCAGGGTGAC TGGATGTTCG ATGCGCGCGA CTGGCCCGAT
CCTGATGCCA TGATTGCCGA GCTGAAATCG CTGGGAATTG AGCTGATGGT CTCCGTCTGG
CCGACGGTGG ATAACCGTAC CGAAAGCTAT CGGGAGATGC GCGAAAACGG CTGGCTGGTA
CAAACGGAAC GTGGCTTGCC GATCAATATG GATTTCCTCG GCAATACCAC TTACTTTGAT
GCGACTCATC CGGGCGCGCG CGACTACGTC TGGGGCAAAG CCAAACGCAA CTATTACGAT
AAAGGCGTGA AGTTATTCTG GTTAGATGAA GCCGAACCTG AGTTCAGCGT TTACGACTAC
GACAACTATC GCTACCATGC CGGGCCGGTA CTGGAAGTGG GCAATATCTA CCCACGTATG
TACGCTAAAA CCTTTTTTGA CGGCATGAAA GCCGATGGCG AAGACCAGGT TATCAACCTG
CTACGCTGCG CCTGGGCCGG CAGTCAGAAG TTCGGCGCAC TGGTCTGGTC AGGTGATATT
CACTCCTCGT TTAGATCGCT ACGCAACCAG TTTGCCGCCG GACTCAATAT GGGAATCGCG
GGGATACCGT GGTGGACGAC GGATATCGGC GGTTTTCATG GCGGTAATAT TCACGACCCG
AAATTCCATG AATTGCTGAT TCGCTGGTTC CAGTGGGGCG TCTTTAGTCC GGTGATGCGT
CTGCACGGCA ACCGCGATCC GCAGATTTTA CCCGCGCAAC CGTACCGGGA TGGCATTGCT
CAATGCCCTA CAGGTGCGCC GAACGAGGTC TGGAGCTACG GTGAGGAAGT ATGCGAGGTA
CTGACAGGTT GCCTGGCGTT GCGAGAAAAA CTCAAGCCCT ATATCAAAGC GCTGATGGAG
GAAACCCATA AGCACAATAC GCCAGTGATG CGCCCCCTGT TCTTTGAATT CCCCGAACAG
GAAACAAGTT GGGCAATCAC CGACCAGTAT TGTTTTGGTC CTGACCTGCT GATCGCCCCC
GTCATGCATG AAGGTATGCG CGAACGTGAT ATCTGGCTAC CGGAAGGGGA AACATGGACG
GATCTTGCGA CCGGTGAAAG CTATTCAGGA GGGCAGACGC TGCATTACGC TACGCCACTG
AACAGAATTC CGGTGTTTAT CCGCGAAGGT GGGCAGTACC GTAGCCTACT GAACTTGTAG
 
Protein sequence
MPFMQQDPRR LVWQQNDRYL WIEPWGENSL RVRSGRHLPV MRNEDWALTE PVAESQCHID 
YEHHQATLTN GKIIAIVNQK GQVTFYRHPH KPLLQEFWRL RGEIGEDESS HGQYVSALNL
EGREFRPIQG GKYSLKARFE ATEGEKIYGM GQYQQANLDL KGCVLELAQR NSQASVPFML
SSLGYGFLWN NPAVGRVTFA QNVTEWEAQV SEQLDYWITA GDTPAEISRA YALATGTPPM
MPDYAMGFWQ CKLRYRTQEE LLEVAREYKR RNLPISVIVI DFFHWPNQGD WMFDARDWPD
PDAMIAELKS LGIELMVSVW PTVDNRTESY REMRENGWLV QTERGLPINM DFLGNTTYFD
ATHPGARDYV WGKAKRNYYD KGVKLFWLDE AEPEFSVYDY DNYRYHAGPV LEVGNIYPRM
YAKTFFDGMK ADGEDQVINL LRCAWAGSQK FGALVWSGDI HSSFRSLRNQ FAAGLNMGIA
GIPWWTTDIG GFHGGNIHDP KFHELLIRWF QWGVFSPVMR LHGNRDPQIL PAQPYRDGIA
QCPTGAPNEV WSYGEEVCEV LTGCLALREK LKPYIKALME ETHKHNTPVM RPLFFEFPEQ
ETSWAITDQY CFGPDLLIAP VMHEGMRERD IWLPEGETWT DLATGESYSG GQTLHYATPL
NRIPVFIREG GQYRSLLNL