Gene SeSA_A4229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4229 
Symbol 
ID6516400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4107002 
End bp4109038 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content55% 
IMG OID642749191 
Productalpha-glucosidase 
Protein accessionYP_002116938 
Protein GI194737357 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCTC TACCACAACG GTCAACCGAT TTTGAACTGA CAACATCACA GGATGGTTTT 
GCACTTAGCT GGCAGCAGCG CCTGATTTTA CGCCACAGCG CCGAAACCCC CTGTCTGTGG
ATTGGCGCGG GCGTTGCCGA CATTGATATG TTTCGCGGCA ACTTTAGTAT CAAAGACAAA
CTTAACGAGA AGATTGCATT AACGGAGGCC ACGGTCAGCG AGCTACCCGA CGGCTGGCTG
GTACAATTCA GCCGTGGCGC AACAATTAGC GCCACCCTTC GCATCTCCGC CGATGAGGCG
GGACGCCTGA CGTTGGATCT GCAAAACGAC GACCTGCACC ATAACCGTAT CTGGTTACGC
CTCGCAGCTA ATCCAGACGA CCATATCTAC GGCTGCGGCG AACAGTTCTC TTATTTCGAT
TTGCGCGGCA AGCCGTTCCC GCTGTGGACC AGCGAACAGG GCGTTGGCCG TAATAAAACC
AGCTATGTCA CCTGGCAGGC AGACTGCAAA GAGAACGCCG GCGGCGACTA TTACTGGACC
TTCTTCCCGC AACCGACCTT TGTCAGCACG CAAAAATACT ACTGCCACGT TGAAAATAGC
TGTTATATGA ATTTCGACTT CAGCGCGCCG GAGTATCACG AACTGGCGCT GTGGGAAGAT
AAAACTACGC TGCGTTTTGA GTGTGCCGAC ACCTACATCG CCCTGCTGGA AAAACTGACT
GCGCTGTTAG GCCGCCAGCC GGAGCTGCCG GACTGGGTTT ACGACGGCGT CACGTTAGGC
ATTCAGGGCG GTACGGAAGT TTGCCAGAAA AAACTGGATA CCATGCGCAA CGCTGGCGTA
AAAGTGAACG GTATTTGGGC GCAGGACTGG TCCGGTATCC GCATGACCTC CTTTGGCAAG
CGCGTGATGT GGAACTGGAA GTGGAATAGC GACAACTATC CGCAACTGGA TAGCCGGATC
AAACAGTGGA AAGAAGAAGG CGTCCAGTTC CTCTCTTATA TCAATCCATA CGTCGCCAGT
GATAAAGACC TCTGCGCCGA GGCGGCGAAA CACGGCTATC TGGCGAAAGA CGCCACGGGC
GGCGACTATC TGGTCGAGTT TGGCGAATTC TATGGCGGCG TGGTCGATCT GACCAATCCT
GAAGCTTACG ACTGGTTCAA AGACGTCATC AAAAAGAACA TGATCGCGCT CGGCTGCAGC
GGCTGGATGG CGGATTTCGG CGAATATCTG CCGACCGACA CGTATCTGCA CAACGGCGTC
AGCGCCGAGA TCATGCATAA CGCCTGGCCT GCGCTGTGGG CGAAATGTAA CTACGAAGCG
CTACAGGAGA CCGGCAAACT CGGCGAGATC CTGTTCTTTA TGCGTGCGGG TTACACCGGT
AGCCAGAAGT ATTCCACCAT GATGTGGGCA GGTGACCAGA ACGTTGACTG GAGCCTTGAT
GATGGTCTGG CCTCGGTAGT ACCTGCGGCA TTGTCGCTGG CGATGACCGG CCATGGTCTG
CATCACAGCG ATATCGGCGG CTACACCACC CTGTTTGACA TGAAACGCAG CAAAGAGTTG
CTGCTGCGCT GGTGCGATTT CAGCGCCTTT ACGCCGATGA TGCGCACCCA TGAAGGCAAC
CGCCCCGGCG ATAACTGGCA GTTCGACGGC GACGCGGAAA CTATTGCCCA CTTTGCCCGC
ATGACCACCG TCTTTACCAC GCTGAAACCG TATCTCAAGC AGGCGGTGGC GCAAAACGCG
GCTACCGGTC TGCCGGTCAT GCGTCCGCTA TTCCTGCACT ACGAGAACGA TGCCGCAACC
TACGCCCTGA AATATCAATA TCTGCTCGGT CAGGATCTGC TGGTCGCGCC GGTTTATGAG
CAGGGGCGTT GCGATTGGAC GCTGTACCTG CCGGAAGATC ACTGGGTGAA TATCTGGACC
GGCGAAGCTC ACCACGGCGG TGAAATTACC GTGGATGCGC CCATTGGCAA GCCGCCGGTC
TTCTATCGCG CGAAGAGCGA GTGGGCTTCA CTTTTTGCTT CTTTACGGAA TATCTAA
 
Protein sequence
MNSLPQRSTD FELTTSQDGF ALSWQQRLIL RHSAETPCLW IGAGVADIDM FRGNFSIKDK 
LNEKIALTEA TVSELPDGWL VQFSRGATIS ATLRISADEA GRLTLDLQND DLHHNRIWLR
LAANPDDHIY GCGEQFSYFD LRGKPFPLWT SEQGVGRNKT SYVTWQADCK ENAGGDYYWT
FFPQPTFVST QKYYCHVENS CYMNFDFSAP EYHELALWED KTTLRFECAD TYIALLEKLT
ALLGRQPELP DWVYDGVTLG IQGGTEVCQK KLDTMRNAGV KVNGIWAQDW SGIRMTSFGK
RVMWNWKWNS DNYPQLDSRI KQWKEEGVQF LSYINPYVAS DKDLCAEAAK HGYLAKDATG
GDYLVEFGEF YGGVVDLTNP EAYDWFKDVI KKNMIALGCS GWMADFGEYL PTDTYLHNGV
SAEIMHNAWP ALWAKCNYEA LQETGKLGEI LFFMRAGYTG SQKYSTMMWA GDQNVDWSLD
DGLASVVPAA LSLAMTGHGL HHSDIGGYTT LFDMKRSKEL LLRWCDFSAF TPMMRTHEGN
RPGDNWQFDG DAETIAHFAR MTTVFTTLKP YLKQAVAQNA ATGLPVMRPL FLHYENDAAT
YALKYQYLLG QDLLVAPVYE QGRCDWTLYL PEDHWVNIWT GEAHHGGEIT VDAPIGKPPV
FYRAKSEWAS LFASLRNI