Gene SNSL254_A0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0045 
Symbol 
ID6483259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp48363 
End bp50402 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content55% 
IMG OID642735489 
Productglycoside hydrolase, family 31 
Protein accessionYP_002039271 
Protein GI194443677 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.229343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.24021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTTA TGCAACAAGA CCCGCGTCGT CTGGTCTGGC AGCAAAACGA TCGCTATTTA 
TGGATTGAAC CCTGGGGCGA GAACAGCCTG CGCGTACGCA GCGGCCGTCA TCTGCCGGTA
ATGAGAAATG AAGACTGGGC ATTGACTGAG CCAGTCGCAG AAAGCCAGTG CCACATTGAT
TATGAGCACC ACCAGGCAAC GCTGACCAAC GGCAAAATTA TCGCTATCGT CAATCAAAAA
GGACAGGTTA CCTTTTACCG CCATCCACAC AAACCCCTGT TGCAGGAGTT CTGGCGCCTG
CGGGGCGAAA TTGGCGAGGA TGAATCATCT CACGGCCAGT ACGTCAGCGC ACTCAACCTT
GAGGGACGCG AGTTCCGCCC TATTCAGGGT GGGAAATATT CACTGAAAGC CCGCTTCGAA
GCCACCGAAG GGGAAAAAAT TTATGGCATG GGGCAGTATC AACAGGCCAA CCTGGATCTC
AAAGGATGCG TGCTTGAGCT GGCGCAACGT AACTCCCAGG CCTCAGTACC GTTTATGCTC
TCCAGTCTGG GCTACGGATT TTTATGGAAC AACCCGGCAG TCGGACGCGT AACCTTTGCC
CAAAACGTTA CCGAATGGGA AGCGCAGGTC AGCGAACAGC TGGACTACTG GATCACGGCT
GGCGATAGCC CGGCAGAAAT TAGCCGGGCT TACGCGCTGG CTACCGGCAC GCCGCCGATG
ATGCCGGACT ACGCCATGGG CTTCTGGCAG TGCAAACTCC GTTATCGTAC GCAGGAAGAG
CTGCTGGAGG TCGCCCGCGA ATATAAGCGC CGCAATCTGC CTATCTCAGT GATCGTTATC
GACTTCTTTC ACTGGCCGAA TCAGGGTGAC TGGATGTTCG ATGCGCGCGA CTGGCCCGAT
CCTGATGCCA TGATTGCCGA GCTGAAATCG CTGGGAATTG AGCTGATGGT CTCCGTCTGG
CCGACGGTGG ATAACCGTAC CGAAAGCTAT CGGGAGATGC GCGAAAACGG CTGGCTGGTA
CAAACGGAAC GTGGCTTGCC GATCAATATG GATTTCCTCG GCAATACCAC TTACTTTGAT
GCGACTCATC CGGGCGCGCG CGACTACGTC TGGGGCAAAG CCAAACGCAA CTATTACGAT
AAAGGCGTGA AGTTATTCTG GTTAGATGAA GCCGAACCTG AGTTCAGCGT TTACGACTAC
GACAACTATC GCTACCATGC CGGGCCGGTA CTGGAAGTGG GCAATATCTA CCCACGCATG
TACGCCAAAA CCTTTTTTGA TGGCATGAAA GCCGATGGCG AAGACCAGGT TATCAACCTG
CTACGCTGCG CCTGGGCCGG CAGTCAGAAG TACGGCGCTC TGGTCTGGTC AGGGGATATT
CACTCCTCGT TTAGATCGCT GCGCAACCAG TTTGCCGCCG GACTCAATAT GGGAATAGCA
GGGATACCGT GGTGGACGAC GGATATCGGC GGTTTCCATG GCGGTAATAT TCACGACCCG
AAATTCCATG AATTGCTAAT TCGCTGGTTC CAGTGGGGCG TCTTTAGTCC GGTGATGCGT
CTGCACGGCA ACCGCGATCC GCAGATTTTA CCCGCGCAAC CGTACCGGGA TGGCATTGCT
CAATGCCCTA CAGGTGCGCC GAACGAGGTC TGGAGCTACG GTGAGGAAGT ATGCGATGTA
CTGACAGGTT GCCTGGCGTT GCGAGAAAAA CTCAAGCCCT ATATCAAAGC GCTGATGGAG
GAAACCCATA AGCACAATAC GCCAGTGATG CGCCCCCTGT TCTTTGAATT CCCCGAACAG
GAAACAAGTT GGACAATCAC CGACCAGTAT TGTTTTGGTC CTGACCTGCT GATCGCCCCC
GTCATGCATG AAGGTATGCG CGAACGTGAT GTCTGGCTAC CAGAAGGGGA AACATGGACG
GATCTTGCGA CTGGTGAAAG CTATTCAGGA GGGCAGACGC TGCATTACGC TACGCCACTG
AACAGAATTC CGGTGTTTAT CCGCGAAGGT GGGCAGTACC GTAGCCTGCT GAACTTGTAG
 
Protein sequence
MPFMQQDPRR LVWQQNDRYL WIEPWGENSL RVRSGRHLPV MRNEDWALTE PVAESQCHID 
YEHHQATLTN GKIIAIVNQK GQVTFYRHPH KPLLQEFWRL RGEIGEDESS HGQYVSALNL
EGREFRPIQG GKYSLKARFE ATEGEKIYGM GQYQQANLDL KGCVLELAQR NSQASVPFML
SSLGYGFLWN NPAVGRVTFA QNVTEWEAQV SEQLDYWITA GDSPAEISRA YALATGTPPM
MPDYAMGFWQ CKLRYRTQEE LLEVAREYKR RNLPISVIVI DFFHWPNQGD WMFDARDWPD
PDAMIAELKS LGIELMVSVW PTVDNRTESY REMRENGWLV QTERGLPINM DFLGNTTYFD
ATHPGARDYV WGKAKRNYYD KGVKLFWLDE AEPEFSVYDY DNYRYHAGPV LEVGNIYPRM
YAKTFFDGMK ADGEDQVINL LRCAWAGSQK YGALVWSGDI HSSFRSLRNQ FAAGLNMGIA
GIPWWTTDIG GFHGGNIHDP KFHELLIRWF QWGVFSPVMR LHGNRDPQIL PAQPYRDGIA
QCPTGAPNEV WSYGEEVCDV LTGCLALREK LKPYIKALME ETHKHNTPVM RPLFFEFPEQ
ETSWTITDQY CFGPDLLIAP VMHEGMRERD VWLPEGETWT DLATGESYSG GQTLHYATPL
NRIPVFIREG GQYRSLLNL