Gene SNSL254_A4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4029 
Symbol 
ID6483288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3917726 
End bp3920044 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content57% 
IMG OID642739288 
Productalpha-xylosidase YicI 
Protein accessionYP_002042998 
Protein GI194442737 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0205936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA GCGACGGAAA CTGGCTGATC CAGCCTGGCC TTAATTTGAT TCACCCGGTT 
CAGGTGTTCG ACGTTGAACA GCACGGGAAT GAGATGGTGG TCTATGCCGC GCCGCGCGAT
GTCCGCGAAC GAACCTGGCA ACTGGATACC CCGTTGTTTA CCCTGCGCTT TTTCTCGCCG
CAGGAAGGAG TGATAGGTGT ACGGATGGAA CACTTCCAGG GCGCTTTGGA TAACGGCCCA
CATTATCCGC TCAATGTCTT GCAAGATATC AACGTGGAGA TGCAGAACAA CGCCGAATTT
GCCGAACTCA AGAGCGGCAG CCTGAGCGTG CGCGTCACCA AAGGCGAGCT CTGGTCACTG
GATTTTCTGC GCAACGGTGT ACGTATCACC GGTAGCCAGT TGAAGAATAA CGGCTATGTG
CAGGATACCA ACAGCGGGCG CAACTACATG TTCGAGCGTC TGGATCTCGG CGTGGGCGAA
ACCGTCTACG GGCTTGGCGA GCGTTTTACC GCGCTGGTGC GCAACGGTCA GACGGTGGAG
ACCTGGAACC GTGACGGCGG CACCAGTACC GAGCAGTCTT ACAAGAACAT CCCGTTCTAT
ATCACCAACC GTGGCTACGG TGTGCTGGTG AATCATCCGC AGTGCGTGTC GTTTGAAATT
GGCTCCGAGA AAGTCTCCAA AGTTCAGTTC AGCGTTGAGA GCGAGTATCT GGAATACTTC
GTCATCGACG GCCCGACCCC GAAAGACGTG CTGAACCGCT ATACCCAATT TACGGGTCGT
CCGGCGCTGC CGCCCGCCTG GTCGTTTGGC CTGTGGCTGA CCACCTCATT CACCACCAAC
TACGACGAAG CGACCGTTAA CAGCTTTATC GATGGTATGG CCGAGCGCAA TCTGCCGCTG
CACGTCTTCC ACTTCGACTG TTTCTGGATG AAGGCTTTCC AGTGGTGCGA TTTTGAATGG
GACCCGGTGA CTTTCCCCGA TCCGAAAGGG ATGATTCGCC GCCTGAAAGC GAAAGGGCTG
AAAGTCTGCG TGTGGATTAA CCCCTACATC GGCCAGAAAT CCCCGGTTTT CCAGGAGCTG
AAAGAGAAAG GGTATTTACT TAAACGCCCG GACGGCTCCT TGTGGCAGTG GGATAAATGG
CAGCCGGGAC TGGCGATTTA CGACTTCACC AACCCGCAAG CCTGCGAATG GTATGCCGAC
AAGCTGAAGG GCCTGGTGGA GATGGGAGTG GACTGCTTCA AAACTGACTT CGGCGAACGC
ATTCCAACGG ATGTGCAGTG GTTTGATGGT TCAGATCCAC AGAAAATGCA CAACCATTAT
GCCTACATCT ACAACGAACT GGTGTGGAAC GTGCTGAAAG AGACCGTGGG CGTTGAAGAG
GCGGTGCTGT TCGCCCGTTC CGCCTCGGTG GGCGCGCAAC AGTTCCCGGT GCACTGGGGC
GGCGACTGCT ACGCCAACTA CGAATCGATG GCGGAAAGCC TGCGCGGCGG GCTGTCCATC
GGCCTGTCAG GGTTTGGCTT CTGGAGTCAC GATATTGGCG GATTCGAGAA TACCGCGCCG
GCGCATGTCT ACAAGCGTTG GTGCGCATTC GGCTTGCTCT CCAGCCACAG CCGCCTGCAC
GGTAGCAAAT CCTACCGGGT TCCGTGGGCC TATGACGACG AGTCCTGTGA CGTGGTGCGC
TTCTTCACCG AACAGAAGTG CCGGATGATG CCGTATCTGT ATCGGGAAGC GGCGCGCGCC
AACGAAGCCG GCACGCCAAT GATGCGGGCG ATGATGCTGG AGTTTCCGGA CGATCCGGCG
TGTGATTATC TTGATCGCCA GTACATGCTG GGAGATGCGG TCATGGTAGC GCCGGTATTT
AGTGAAGCGG GCGACGTGGA GTTCTACCTG CCAGAAGGAC GCTGGACGCA CCTGTGGCGC
AACGATGAAG TGCAGGGCAG CCGCTGGCAT AAACAGCAGC ATGACTTCCT GAGCCTGCCA
GTGTACGTGC GTGACAATAC ACTACTGGCG CTGGGCAACA ATAGCCAGAA GCCCGATTAC
GCCTGGCATG AGGGTACGGC CTTCCAGTTA TTCCATCTGG ATGACGGTTG CGAAGCGGTC
TGCGAAGTCC CTGCTACGGA TGGTTCGACA ATCTTTACGC TGCAGGCGAA ACGCACAGGC
AATACCATTA CGGTGAGCGG CGAAGGCGAG GCGCGCAACT GGACGCTGTG TCTGCGTAAT
ATTACGCAGA TTAGCGGTGC CAAATGCGGC TCATATGCGG GAAGTGAACT GGGCGTAGTG
GTCACCCCGC AGGGAAATGA AGTGGTGATT ACGCTTTAA
 
Protein sequence
MKISDGNWLI QPGLNLIHPV QVFDVEQHGN EMVVYAAPRD VRERTWQLDT PLFTLRFFSP 
QEGVIGVRME HFQGALDNGP HYPLNVLQDI NVEMQNNAEF AELKSGSLSV RVTKGELWSL
DFLRNGVRIT GSQLKNNGYV QDTNSGRNYM FERLDLGVGE TVYGLGERFT ALVRNGQTVE
TWNRDGGTST EQSYKNIPFY ITNRGYGVLV NHPQCVSFEI GSEKVSKVQF SVESEYLEYF
VIDGPTPKDV LNRYTQFTGR PALPPAWSFG LWLTTSFTTN YDEATVNSFI DGMAERNLPL
HVFHFDCFWM KAFQWCDFEW DPVTFPDPKG MIRRLKAKGL KVCVWINPYI GQKSPVFQEL
KEKGYLLKRP DGSLWQWDKW QPGLAIYDFT NPQACEWYAD KLKGLVEMGV DCFKTDFGER
IPTDVQWFDG SDPQKMHNHY AYIYNELVWN VLKETVGVEE AVLFARSASV GAQQFPVHWG
GDCYANYESM AESLRGGLSI GLSGFGFWSH DIGGFENTAP AHVYKRWCAF GLLSSHSRLH
GSKSYRVPWA YDDESCDVVR FFTEQKCRMM PYLYREAARA NEAGTPMMRA MMLEFPDDPA
CDYLDRQYML GDAVMVAPVF SEAGDVEFYL PEGRWTHLWR NDEVQGSRWH KQQHDFLSLP
VYVRDNTLLA LGNNSQKPDY AWHEGTAFQL FHLDDGCEAV CEVPATDGST IFTLQAKRTG
NTITVSGEGE ARNWTLCLRN ITQISGAKCG SYAGSELGVV VTPQGNEVVI TL