Gene SeD_A0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0044 
Symbol 
ID6872096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp48370 
End bp50409 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content54% 
IMG OID642783302 
Productglycoside hydrolase, family 31 
Protein accessionYP_002213996 
Protein GI198242197 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.228548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.00512205 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCATTTA TGCAACAAGA CCCGCGTCGT CTGGTCTGGC AGCAAAACGA TCGCTATTTA 
TGGATTGAAC CCTGGGGCGA GAACAGCCTG CGCGTACGCA GCGGCCGTCA TCTGCCGGTA
ATGAGAAATG AAGACTGGGC ATTGACTGAG CCAGTCGCAG AAAGTCAGTG CCACATTGAT
TATGAGCACC ACCAGGCAAC GCTGACCAAC GGCAAAATTA TCGCTATCGT CAATCAAAAA
GGACAAGTTA CCTTTTACCG CCATCCACAC AAACCCCTGT TGCAGGAGTT CTGGCGCCTG
CGGGGCGAAA TTGGCGAGGA TGAATCATCT CACGGCCAGT ACGTTAGCGC ACTCAACCTT
GAGGGACGCG AGTTCCGCCC TATTCAGGGT GGGAAATATT CACTGAAAGC CCGCTTCGAA
GCCACCGAAG GGGAAAAAAT TTATGGCATG GGGCAGTATC AACAGGCCAA CCTGGATCTC
AAAGGATGCG TGCTTGAGCT GGCGCAACGT AACTCCCAGG CCTCAGTACC GTTTATGCTC
TCCAGTCTGG GCTACGGATT TTTATGGAAC AACCCGGCAG TCGGACGCGT AACCTTTGCC
CAAAACGTTA CCGAATGGGA AGCGCAGGTC AGCGAACAGT TAGACTACTG GATCACGGCT
GGCGATACCC CGGCAGAAAT TAGCCGGGCT TACGCGCTGG CTACCGGCAC GCCGCCGATG
ATGCCGGACT ACGCCATGGG CTTCTGGCAG TGCAAACTCC GTTATCGTAC GCAGGAAGAG
CTGCTGGAGG TCGCCCGCGA ATATAAGCGC CGCAATCTGC CTATCTCAGT GATCGTTATC
GACTTCTTTC ACTGGCCGAA TCAGGGTGAC TGGATGTTCG ATGCGCGCGA CTGGCCCGAT
CCTGATGCCA TGATTGCCGA GCTGAAATCG CTGGGAATTG AGCTGATGGT CTCCGTCTGG
CCGACGGTGG ATAACCGTAC CGAAAGCTAT CGGGAGATGC GCGAAAACGG CTGGCTGGTA
CAAACGGAAC GTGGCTTGCC GATCAATATG GATTTCCTCG GCAATACCAC TTACTTTGAT
GCGACTCATC CGGGCGCGCG CGACTACGTC TGGGGCAAAG CCAAACGCAA CTATTACGAT
AAAGGCGTGA AGTTATTCTG GTTAGATGAA GCCGAACCTG AGTTCAGCGT TTACGACTAC
GACAACTATC GCTACCATGC CGGGCCGGTA CTGGAAGTGG GCAATATCTA CCCACGCATG
TACGCCAAAA CCTTTTTTGA TGGCATGAAA GCCGATGGCG AAGACCAGGT TATCAACCTG
CTACGCTGCG CCTGGGCCGG CAGTCAGAAG TACGGCGCTC TGGTCTGGTC AGGGGATATT
CACTCCTCGT TTAGATCGCT GCGCAACCAG TTTGCCGCCG GACTCAATAT GGGAATAGCA
GGGATACCGT GGTGGACGAC GGATATCGGC GGTTTCCATG GCGGTAATAT TCACGACCCG
AAATTCCATG AATTGCTAAT TCGCTGGTTC CAGTGGGGCG TCTTTAGTCC GGTGATGCGT
CTGCACGGCA ACCGCGATCC GCAGATTTTA CCCGCGCAAC CGTACCGGGA TGGCATTGCT
CAATGCCCTA CAGGTGCGCC GAACGAGGTC TGGAGCTACG GTGAGGAAGT ATGCGACGTA
CTGACAGGTT GCCTGGCGTT GCGAGAAAAA CTCAAGCCCT ATATCAAAGC GCTGATGGAG
GAAACCCATA AGCACAATAC GCCAGTGATG CGCCCCCTGT TCTTTGAATT CCCCGAACAG
GAAACAAGTT GGACAATCAC CGACCAGTAT TGTTTTGGTC CTGACCTGCT GATCGCCCCC
GTCATGCATG AAGGTATGCG CGAGCGTGAT GTCTGGCTAC CAGAAGGGGA AACATGGACG
GATCTTGCGA CCGGTGAAAG CTATTCAGGA GGGCAGACGC TGCATTACGC TACGCCACTG
AACAGAATTC CGGTGTTTAT CCGCGAAGGT GGGCAGTACC GTAGCCTGCT GAACTTGTAG
 
Protein sequence
MPFMQQDPRR LVWQQNDRYL WIEPWGENSL RVRSGRHLPV MRNEDWALTE PVAESQCHID 
YEHHQATLTN GKIIAIVNQK GQVTFYRHPH KPLLQEFWRL RGEIGEDESS HGQYVSALNL
EGREFRPIQG GKYSLKARFE ATEGEKIYGM GQYQQANLDL KGCVLELAQR NSQASVPFML
SSLGYGFLWN NPAVGRVTFA QNVTEWEAQV SEQLDYWITA GDTPAEISRA YALATGTPPM
MPDYAMGFWQ CKLRYRTQEE LLEVAREYKR RNLPISVIVI DFFHWPNQGD WMFDARDWPD
PDAMIAELKS LGIELMVSVW PTVDNRTESY REMRENGWLV QTERGLPINM DFLGNTTYFD
ATHPGARDYV WGKAKRNYYD KGVKLFWLDE AEPEFSVYDY DNYRYHAGPV LEVGNIYPRM
YAKTFFDGMK ADGEDQVINL LRCAWAGSQK YGALVWSGDI HSSFRSLRNQ FAAGLNMGIA
GIPWWTTDIG GFHGGNIHDP KFHELLIRWF QWGVFSPVMR LHGNRDPQIL PAQPYRDGIA
QCPTGAPNEV WSYGEEVCDV LTGCLALREK LKPYIKALME ETHKHNTPVM RPLFFEFPEQ
ETSWTITDQY CFGPDLLIAP VMHEGMRERD VWLPEGETWT DLATGESYSG GQTLHYATPL
NRIPVFIREG GQYRSLLNL