Gene SNSL254_A4295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4295 
Symbol 
ID6484373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4187836 
End bp4189872 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content56% 
IMG OID642739541 
Productalpha-glucosidase 
Protein accessionYP_002043235 
Protein GI194443906 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC AAAGTAAGCA TTTATCCTCT GTCCTGATAG AAAAGAACAT TGAGGGCTTT 
ACGCTGACGT ACCACCAGCG CCTGATTTTA CGCCACAGCG CCGAAACCCC CTGTCTGTGG
ATTGGCGCGG GCGTTGCCGA CATTGACATG TTTCGCGGCA ACTTCAGCAT CAAAGACAAA
CTTAACGAGA AGATTGCATT AACGGAGGCC ACCGTCAGCG AGCTACCTGA CGGCTGGCTG
GTACAATTCA GCCGTGGCGC AACAATTAGC GCCACCCTTC GCATCTCCAC CGATGAGGCG
GGACGCCTGC AGCTGGATCT GCAAAACGAC GACCTGCACC ATAACCGTAT CTGGTTACGC
CTCGCAGCTA ATCCAGACGA CCATATCTAC GGCTGCGGCG AACAGTTCTC TTATTTCGAT
TTGCGCGGCA AGCCGTTCCC GCTGTGGACC AGCGAACAGG GCGTTGGCCG TAATAAAACC
AGCTATGTCA CCTGGCAGGC AGACTGTAAA GAGAACTCCG GCGGCGACTA TTACTGGACC
TTCTTCCCGC AACCGACCTT TGTCAGCACG CAGAAGTATT ACTGCCACGT CGATAATAGC
TGCTATATGA ATTTCGACTT CAGCGCGCCG GAGTATCACG AACTGGCGCT GTGGGAAGAT
AAAACTACGC TACGTTTTGA GTGTGCCGAC ACCTACATCG CCCTACTGGA AAAACTGACT
GCGCTGTTAG GTCGCCAGCC GGAGCTGCCG GACTGGGTTT ACGACGGCGT CACGCTAGGC
ATTCAGGGCG GTACGGAAGT TTGCCAGCAA AAACTGGATA CCATGCGCAA CGCAGGCGTA
AAAGTGAACG GTATTTGGGC GCAGGACTGG TCCGGCATTC GTATGACCTC TTTCGGCAAA
CGCGTGATGT GGAACTGGAA GTGGAATAGC GACAACTATC CACAGCTGGA TAGCCGTATC
AAACAGTGGA AAGAAGAAGG CGTCCAGTTC CTCTCTTATA TCAACCCATA CGTCGCCAGT
GATAAAGACC TCTGCGCCGA GGCGGCGAAA CACGGCTACC TGGCGAAAGA CGCCACGGGC
GGCGACTATC TGGTCGAGTT TGGCGAATTC TATGGCGGCG TGGTCGATCT GACCAATCCT
GAAGCTTACG ACTGGTTCAA AGACGTCATC AAAAAGAACA TGATCGCGCT CGGCTGCAGC
GGCTGGATGG CGGATTTCGG CGAATATCTG CCGACCGACA CGTATCTGCA CAACGGCGTC
AGCGCAGAGA TCATGCATAA CGCCTGGCCT GCGCTCTGGG CGAAGTGTAA CTACGAAGCG
CTACAGGAGA CCGGCAAGCT CGGCGAGATC CTGTTCTTTA TGCGTGCGGG TTACACCGGC
AGTCAGAAAT ATTCCACCAT GATGTGGGCA GGCGACCAGA ACGTTGACTG GAGCCTTGAT
GATGGTCTGG CCTCTGTCGT GCCTGCGGCA TTGTCGCTGG CGATGACCGG CCACGGTCTG
CATCACAGCG ATATCGGCGG CTACACCACC CTGTTTGACA TGAAACGCAG CAAAGAGTTG
CTGCTGCGCT GGTGCGATTT CAGCGCCTTT ACGCCGATGA TGCGCACCCA TGAAGGCAAC
CGCCCCGGCG ATAACTGGCA GTTCGACGGC GACGCGGAAA CTATTGCCCA CTTTGCCCGC
ATGACCACCG TCTTTACCAC GCTGAAACCG TATCTCAAGC AGGCGGTGGC GCAAAACGCG
GCTACCGGTC TGCCGGTCAT GCGTCCGCTA TTCCTGCACT ACGAGAACGA TGCCACAACC
TACACCCTGA AATATCAATA TCTGCTCGGT CAGGATCTGC TGGTCGCGCC GGTTCACGAG
CAGGGGCGTT GCGACTGGAC GCTGTACCTG CCGGAAGAGC ACTGGGTGAA TATCTGGACC
GGCGAAGCTC ACCACGGCGG TGAAATTACC GTGGATGCGC CCATTGGTAA GCCGCCAGTC
TTCTATCGCG CGAAGAGCGA GTGGGCTTCA CTTTTTGCTT CTTTACGGAA TATCTAA
 
Protein sequence
MSTQSKHLSS VLIEKNIEGF TLTYHQRLIL RHSAETPCLW IGAGVADIDM FRGNFSIKDK 
LNEKIALTEA TVSELPDGWL VQFSRGATIS ATLRISTDEA GRLQLDLQND DLHHNRIWLR
LAANPDDHIY GCGEQFSYFD LRGKPFPLWT SEQGVGRNKT SYVTWQADCK ENSGGDYYWT
FFPQPTFVST QKYYCHVDNS CYMNFDFSAP EYHELALWED KTTLRFECAD TYIALLEKLT
ALLGRQPELP DWVYDGVTLG IQGGTEVCQQ KLDTMRNAGV KVNGIWAQDW SGIRMTSFGK
RVMWNWKWNS DNYPQLDSRI KQWKEEGVQF LSYINPYVAS DKDLCAEAAK HGYLAKDATG
GDYLVEFGEF YGGVVDLTNP EAYDWFKDVI KKNMIALGCS GWMADFGEYL PTDTYLHNGV
SAEIMHNAWP ALWAKCNYEA LQETGKLGEI LFFMRAGYTG SQKYSTMMWA GDQNVDWSLD
DGLASVVPAA LSLAMTGHGL HHSDIGGYTT LFDMKRSKEL LLRWCDFSAF TPMMRTHEGN
RPGDNWQFDG DAETIAHFAR MTTVFTTLKP YLKQAVAQNA ATGLPVMRPL FLHYENDATT
YTLKYQYLLG QDLLVAPVHE QGRCDWTLYL PEEHWVNIWT GEAHHGGEIT VDAPIGKPPV
FYRAKSEWAS LFASLRNI