Gene SeHA_C4075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4075 
Symbol 
ID6490548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3957586 
End bp3959904 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content56% 
IMG OID642744175 
Productalpha-xylosidase YicI 
Protein accessionYP_002047780 
Protein GI194449977 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.397579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value0.758007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA GCGACGGAAA CTGGCTGATC CAGCCTGGCC TTAATTTGAT TCACCCGGTT 
CAGGTGTTCG ACGTTGAACA GCACGGGAAT GAGATGGTGA TCTATGCCGC GCCGCGTGAT
GTCCGCGAAC GAACCTGGCA ACTGGATACC CCGTTGTTTA CCCTGCGCTT TTTCTCACCG
CAGGAAGGAG TGATAGGTGT ACGGATGGAA CACTTCCAGG GCGCTTTGGA TAACGGCCCA
CATTATCCGC TCAATGTCTT GCAAGATATC AACGTAGAGA TGCAGAACAA CGCTGAATTT
GCCGAACTCA AGAGCGGCAG CCTGAGCGTG CGCGTCACCA AAGGCGAGAT CTGGTCTCTG
GATTTTCTGC GTAACGGTGT ACGTATCACC GGTAGTCAGT TGAAAAATAA CGGCTATGTG
CAGGACACCA ATAGCGGGCG CAACTACATG TTCGAGCGTC TGGATCTCGG CGTGGGCGAC
ACCGTCTACG GGCTTGGCGA GCGTTTTACC GCGCTGGTGC GTAACGGTCA GACGGTGGAG
ACCTGGAACC GCGACGGCGG CACCAGCACC GAGCAATCGT ACAAGAATAT CCCGTTCTAT
ATCACCAACC GTGGCTACGG TGTGCTGGTG AATCATCCGC AGTGCGTGTC GTTTGAAATT
GGCTCCGAGA AAGTCTCCAA AGTTCAGTTC AGCGTCGAGA GCGAGTATCT GGAATACTTC
GTCATCGACG GCCCGACCCC GAAAGACGTG CTGAACCGCT ATACCCAATT TACGGGTCGT
CCGGCGCTGC CGCCCGCCTG GTCGTTTGGC CTGTGGCTGA CCACCTCATT CACCACCAAC
TACGACGAAG CGACCGTTAA CAGTTTTATC GACGGTATGG CCGAGCGCAA TCTGCCGCTG
CACGTCTTCC ACTTCGACTG TTTCTGGATG AAGGCCTTCC AGTGGTGCGA TTTTGAATGG
GACCCGGTGA CTTTCCCCGA TCCGAAAGGG ATGATTCGCC GCCTGAAAGC GAAAGGGCTG
AAAGTCTGCG TGTGGATTAA CCCCTACATC GGCCAGAAAT CCCCGGTCTT CCAGGAGCTG
AAAGAGAAAG GATATTTGCT AAAACGCCCG GACGGCTCCT TGTGGCAGTG GGATAAATGG
CAGCCGGGAC TGGCGATTTA CGACTTCACC AACCCGCAAG CCTGCGAATG GTATGCCGAC
AAGCTGAAGG GCCTGGTGGA GATGGGAGTG GACTGCTTCA AAACTGACTT CGGCGAACGC
ATTCCAACGG ATGTGCAGTG GTTTGATGGT TCAGATCCAC AGAAAATGCA CAACCATTAT
GCCTACATCT ACAACGAACT GGTGTGGAAC GTGCTGAAAG AGACCGTCGG CGTTGAAGAG
GCGGTGCTGT TCGCCCGTTC CGCCTCGGTG GGCGCGCAAC AGTTCCCGGT GCACTGGGGC
GGCGACTGCT ACGCCAACTA CGAATCGATG GCGGAAAGCC TGCGCGGCGG GCTGTCCATC
GGCCTGTCAG GGTTTGGCTT CTGGAGTCAT GATATTGGCG GATTCGAGAA TACCGCGCCG
GCGCATGTCT ACAAGCGGTG GTGCGCATTC GGCTTGCTCT CCAGCCACAG CCGCCTGCAC
GGTAGCAAAT CCTACCGGGT TCCGTGGGCC TATGACGACG AGTCCTGTGA CGTGGTGCGC
TTCTTCACTG AACAGAAGTG CCGGATGATG CCGTATCTGT ATCGGGAAGC GGCGCGTGCC
AACGAAGCCG GTACGCCAAT GATGCGGGCG ATGATGCTGG AGTTCCCGGA CGATCCGGCG
TGTGATTATC TTGATCGCCA GTACATGCTG GGAGATGCGG TCATGGTAGC GCCGGTATTT
AGTGAAGCGG GCGACGTGGA GTTCTACCTG CCAGAAGGCC GCTGGACGCA CCTGTGGCGC
AACGATGAAG TGCAGGGCAG CCGCTGGCAT AAACAGCAGC ATGACTTCCT GAGTCTGCCA
GTGTACGTGC GTGACAATAC ACTACTGGCG CTGGGCAACA ATAGCCAGAA GCCCGATTAC
GCCTGGCATG AGGGTACGGC CTTCCAGTTA TTCCATCTGG ATGACGGCTG CGAAGCGGTC
TGCGAAGTCC CTGCTACGGA TGGTTCGACA ATCTTTACGC TGCAGGCGAA ACGCACAGGC
AATACCATTA CGGTGAGCGG CGAAGGCAAG GCGCGCAACT GGACGCTGTG TCTGCGTAAT
ATTACGCAGA TTAGCGGTAC CAAATGCGGC TCATATGCGG GAAGTGAACT GGGCGTAGTG
GTCACCCCGC AGGGAAATGA AGTGGTGATT ACGCTTTAA
 
Protein sequence
MKISDGNWLI QPGLNLIHPV QVFDVEQHGN EMVIYAAPRD VRERTWQLDT PLFTLRFFSP 
QEGVIGVRME HFQGALDNGP HYPLNVLQDI NVEMQNNAEF AELKSGSLSV RVTKGEIWSL
DFLRNGVRIT GSQLKNNGYV QDTNSGRNYM FERLDLGVGD TVYGLGERFT ALVRNGQTVE
TWNRDGGTST EQSYKNIPFY ITNRGYGVLV NHPQCVSFEI GSEKVSKVQF SVESEYLEYF
VIDGPTPKDV LNRYTQFTGR PALPPAWSFG LWLTTSFTTN YDEATVNSFI DGMAERNLPL
HVFHFDCFWM KAFQWCDFEW DPVTFPDPKG MIRRLKAKGL KVCVWINPYI GQKSPVFQEL
KEKGYLLKRP DGSLWQWDKW QPGLAIYDFT NPQACEWYAD KLKGLVEMGV DCFKTDFGER
IPTDVQWFDG SDPQKMHNHY AYIYNELVWN VLKETVGVEE AVLFARSASV GAQQFPVHWG
GDCYANYESM AESLRGGLSI GLSGFGFWSH DIGGFENTAP AHVYKRWCAF GLLSSHSRLH
GSKSYRVPWA YDDESCDVVR FFTEQKCRMM PYLYREAARA NEAGTPMMRA MMLEFPDDPA
CDYLDRQYML GDAVMVAPVF SEAGDVEFYL PEGRWTHLWR NDEVQGSRWH KQQHDFLSLP
VYVRDNTLLA LGNNSQKPDY AWHEGTAFQL FHLDDGCEAV CEVPATDGST IFTLQAKRTG
NTITVSGEGK ARNWTLCLRN ITQISGTKCG SYAGSELGVV VTPQGNEVVI TL