Gene SeD_A4135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4135 
Symbol 
ID6871211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3983456 
End bp3985774 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content56% 
IMG OID642787081 
Productalpha-xylosidase YicI 
Protein accessionYP_002217708 
Protein GI198246042 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA GCGACGGAAA CTGGCTGATC CAGCCTGGCC TTAATTTGAT TCACCCGGTT 
CAGGTGTTCG ACGTTGAACA GCACGGGAAT GAGATGGTGA TCTATGCCGC GCCGCGTGAT
GTCCGCGAAC GAACCTGGCA ACTGGATACC CCGTTGTTTA CCCTGCGCTT TTTCTCGCCG
CAGGAAGGAG TGATAGGTGT ACGGATGGAA CACTTCCAGG GCGCTTTGGA TAACGGCCCA
CATTATCCGC TCAATGTCTT GCAAGATATC AACGTGGAGA TGCAGAACAA CGCCGAATTT
GCCGAACTCA AGAGCGGCAG CCTGAGCGTG CGCGTCACCA AAGGCGAGAT CTGGTCTCTG
GATTTTCTGC GCAACGGTGT ACGTATCACC GGTAGCCAGT TGAAGAATAA CGGCTATGTG
CAGGATACCA ACAGCGGGCG CAACTACATG TTCGAGCGTC TGGATCTCGG CGTGGGCGAA
ACCGTCTACG GGCTGGGCGA GCGCTTTACC GCGCTGGTGC GTAACGGTCA GACGGTGGAG
ACCTGGAACC GCGACGGCGG CACCAGCACC GAGCAATCGT ACAAGAATAT CCCGTTCTAT
ATCACCAACC GTGGCTACGG TGTGCTGGTG AATCATCCGC AGTGCGTGTC GTTTGAAATT
GGCTCCGAGA AAGTCTCCAA AGTTCAGTTC AGCGTCGAGA GCGAGTATCT GGAATACTTC
GTCATCGACG GCCCGACCCC GAAAGACGTG CTGAACCGCT ATACCCAATT TACGGGTCGT
CCGGCGCTGC CGCCCGCCTG GTCGTTTGGC CTGTGGCTGA CCACCTCATT CACCACCAAC
TACGACGAAG CGACCGTTAA CAGTTTTATC GACGGTATGG CCGAGCGCAA TCTGCCGCTG
CACGTCTTCC ACTTCGACTG TTTCTGGATG AAAGCTTTTC AGTGGTGCGA TTTTGAATGG
GACCCGGTGA CTTTCCCCGA TCCGAAAGGG ATGATTCGCC GCCTGAAAGC GAAAGGGCTG
AAAGTCTGCG TGTGGATTAA CCCCTACATC GGCCAGAAAT CCCCGGTCTT CCAGGAGTTG
AAAGAGAAAG GATATTTGCT AAAACGCCCG GACGGCTCCT TGTGGCAGTG GGATAAATGG
CAGCCGGGAC TGGCGATTTA CGACTTCACC AACCCGCAAG CCTGCGAATG GTATGCCGAC
AAGCTGAAGG GCCTGGTGGA GATGGGAGTG GACTGCTTCA AAACTGACTT CGGCGAACGC
ATTCCAACGG ATGTGCAGTG GTTTGATGGT TCAGATCCAC AGAAAATGCA CAACCATTAT
GCCTACATCT ACAACGAATT GGTGTGGAAC GTGCTGAAAG AGACCGTCGG CGTTGAAGAG
GCGGTGCTGT TCGCCCGTTC CGCCTCGGTG GGCGCGCAAC AGTTCCCGGT GCACAGGGGC
GGCGACTGCT ACGCCAACTA CGAATCGATG GCGGAAAGCC TGCGCGGCGG GCTGTCCATC
GGCCTGTCAG GGTTTGGCTT CTGGAGTCAT GATATTGGCG GATTCGAGAA TACCGCGCCG
GCGCATGTCT ACAAGCGGTG GTGCGCATTC GGCTTGCTCT CCAGCCACAG CCGCCTGCAC
GGTAGCAAAT CCTACCGGGT TCCGTGGGCC TATGACGACG AGTCCTGTGA CGTGGTGCGC
TTCTTCACTG AACAGAAGTG CCGGATGATG CCGTATCTGT ATCGGGAAGC GGCGCGTGCC
AACGAAGCCG GTACGCCAAT GATGCGGGCG ATGATGCTGG AGTTCCCGGA CGATCCGGCG
TGTGATTATC TTGATCGCCA GTACATGCTG GGAGATGCGG TCATGGTAGC GCCGGTATTT
AGTGAAGCGG GCGACGTGGA GTTCTACCTG CCAGAAGGCC GCTGGACGCA CCTGTGGCGC
AACGATGAAG TGCAGGGCAG TCGCTGGCAT AAACAGCAGC ATGACTTCCT GAGCCTGCCA
GTGTATGTGC GTGACAATAC ACTACTGGCG CTGGGCAACA ATAGCCAGAA GCCCGATTAC
GCCTGGCATG AGGGTACGGC CTTCCAGTTA TTCCATCTGG ATGACGGCTG CGAAGCGGTC
TGCGAAGTCC CTGCTACGGA TGGTTCGACA ATCTTTACGC TGCAGGCGAA ACGCACAGGC
AATACCATTA CGGTGAGCGG CGAAGGCGAG GCGCGCAACT GGACGCTGTG TCTGCGTAAT
ATTACGCAGA TTAGCGGTAC CAAATGCGGC TCATATGCGG GAAGTGAACT GGGCGTAGTG
GTTACCCCGC TGGGAAATGA AGTGGTGATT ACGCTTTAA
 
Protein sequence
MKISDGNWLI QPGLNLIHPV QVFDVEQHGN EMVIYAAPRD VRERTWQLDT PLFTLRFFSP 
QEGVIGVRME HFQGALDNGP HYPLNVLQDI NVEMQNNAEF AELKSGSLSV RVTKGEIWSL
DFLRNGVRIT GSQLKNNGYV QDTNSGRNYM FERLDLGVGE TVYGLGERFT ALVRNGQTVE
TWNRDGGTST EQSYKNIPFY ITNRGYGVLV NHPQCVSFEI GSEKVSKVQF SVESEYLEYF
VIDGPTPKDV LNRYTQFTGR PALPPAWSFG LWLTTSFTTN YDEATVNSFI DGMAERNLPL
HVFHFDCFWM KAFQWCDFEW DPVTFPDPKG MIRRLKAKGL KVCVWINPYI GQKSPVFQEL
KEKGYLLKRP DGSLWQWDKW QPGLAIYDFT NPQACEWYAD KLKGLVEMGV DCFKTDFGER
IPTDVQWFDG SDPQKMHNHY AYIYNELVWN VLKETVGVEE AVLFARSASV GAQQFPVHRG
GDCYANYESM AESLRGGLSI GLSGFGFWSH DIGGFENTAP AHVYKRWCAF GLLSSHSRLH
GSKSYRVPWA YDDESCDVVR FFTEQKCRMM PYLYREAARA NEAGTPMMRA MMLEFPDDPA
CDYLDRQYML GDAVMVAPVF SEAGDVEFYL PEGRWTHLWR NDEVQGSRWH KQQHDFLSLP
VYVRDNTLLA LGNNSQKPDY AWHEGTAFQL FHLDDGCEAV CEVPATDGST IFTLQAKRTG
NTITVSGEGE ARNWTLCLRN ITQISGTKCG SYAGSELGVV VTPLGNEVVI TL