Gene SeD_A4407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4407 
Symbol 
ID6875647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4255615 
End bp4257651 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content55% 
IMG OID642787327 
Productalpha-glucosidase 
Protein accessionYP_002217938 
Protein GI198244106 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.264935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCTC TACCACAACG GTCAACCGAT TTTGAACTGA CAACATCACA GGATGGTTTT 
GCGCTTAGCT GGCAACAGCG CCTGATTTTA CGTCACAGCA CCGAAAATCC CTGTCTGTGG
ATTGGCGCGG GCGTTGCCGA CATTGACATG TTTCGCGGCA ACTTTAGCAT CAAAGACAAA
CTTAACGAAA AGATTGCATT AACGGAGGCC GCCGTCAGCG AGCTACCTGA CGGCTGGCTG
GTACAATTCA GCCGTGGCGC AACAATTAGC GCCACCCTTC GCATCTCCAC CGATGAGGCG
GGACGCCTGC AGCTGGATCT GCAAAACGAC GACCTGCACC ATAACCGTAT CTGGTTACGC
CTCGCAGCTA ATCCAGACGA CCATATCTAC GGCTGCGGCG AACAGTTCTC TTATTTCGAT
TTGCGCGGCA AGCCGTTCCC GCTGTGGACC AGCGAACAGG GCGTTGGCCG TAATAAAACC
AGCTATGTCA CCTGGCAGGC AGACTGTAAA GAGAACGCCG GCGGCGACTA TTACTGGACC
TTCTTCCCCC AGCCAACCTT TGTCAGCACG CAGAAGTATT ACTGCCACGT CGATAATAGC
TGCTATATGA ATTTCGACTT CAGCGCGCCG GAGTATCACG AACTGGCGCT GTGGGAAGAT
AAAACTACGC TGCGTTTTGA GTGTGCCGAC ACCTACATCG CCCTGCTGGA AAAACTGACT
GCGCTGTTAG GCCGCCAGCC GGAGCTGCCG GACTGGGTTT ATGACGGCGT CACGCTAGGC
ATTCAGGGCG GTACGGAAGT TTGCCAGCAA AAACTGGATA CCATGCGCAA CGCAGGCGTA
AAAGTGAACG GTATTTGGGC GCAGGACTGG TCCGGTATCC GCATGACCTC CTTTGGCAAG
CGCGTGATGT GGAACTGGAA GTGGAATAGC GACAACTATC CACAGCTGGA TAGCCGTATC
AAACAGTGGA AAGAAGAAGG CGTCCAGTTC CTCTCTTATA TCAACCCATA CGTCGCCAGT
GATAAAGACC TCTGCGCCGA GGCGGCGAAA CACGGCTATC TGGCGAAAGA CGCCACGGGC
GGCGACTATC TGGTCGAGTT TGGCGAATTC TATGGCGGCG TGGTCGATCT GACCAATCCT
GAAGCTTACG ACTGGTTCAA AGACGTCATC AAAAAGAACA TGATCGCGCT CGGCTGCAGC
GGCTGGATGG CGGATTTCGG CGAATATCTG CCGACCGACA CGTATCTGCA CAACGGCGTC
AGCGCCGAGA TCATGCATAA CGCCTGGCCT GCGCTGTGGG CGAAATGTAA CTACGAAGCG
CTACAGGAGA CCGGCAAGCT CGGCGAGATC CTGTTCTTTA TGCGTGCGGG TTACACCGGC
AGTCAGAAAT ATTCCACCAT GATGTGGGCA GGTGACCAGA ACGTTGACTG GAGCCTTGAT
GATGGTCTGG CCTCTGTCGT GCCTGCGGCA TTGTCGCTGG CAATGACCGG CCATGGTCTG
CATCACAGCG ATATCGGCGG CTACACCACC CTGTTTGACA TGAAACGCAG CAAAGAGTTG
CTGCTGCGCT GGTGCGATTT CAGCGCCTTT ACGCCGATGA TGCGCACCCA TGAAGGCAAC
CGCCCCGGCG ATAACTGGCA GTTCGACGGC GACGCGGAAA CTATTGCCCA CTTTGCCCGC
ATGACCACCG TCTTTACCAC GCTGAAACCG TATCTCAAGC AGGCGGTGGC GCAAAACGCG
GCTACCGGTC TGCCGGTCAT GCGTCCGCTA TTCCTGCATT ACGAGAACGA TGCCGCAACC
TACACCCTGA AATATCAATA TCTGCTCGGT CAGGATCTGC TGGTCGCGCC GGTTCACGAG
CAGGGGCGTT GCGACTGGAC GCTGTACCTG CCGGAAGATC ACTGGGTGAA TATCTGGACC
GGCGAAGCTC ACCACGGCGG TGAAATTACC GTGGATGCGC CCATTGGCAA GCCGCCGGTC
TTCTATCGCG CGAAGAGCGA GTGGGCTTCA CTTTTTGCTT CTTTACGGAA TATCTAA
 
Protein sequence
MNSLPQRSTD FELTTSQDGF ALSWQQRLIL RHSTENPCLW IGAGVADIDM FRGNFSIKDK 
LNEKIALTEA AVSELPDGWL VQFSRGATIS ATLRISTDEA GRLQLDLQND DLHHNRIWLR
LAANPDDHIY GCGEQFSYFD LRGKPFPLWT SEQGVGRNKT SYVTWQADCK ENAGGDYYWT
FFPQPTFVST QKYYCHVDNS CYMNFDFSAP EYHELALWED KTTLRFECAD TYIALLEKLT
ALLGRQPELP DWVYDGVTLG IQGGTEVCQQ KLDTMRNAGV KVNGIWAQDW SGIRMTSFGK
RVMWNWKWNS DNYPQLDSRI KQWKEEGVQF LSYINPYVAS DKDLCAEAAK HGYLAKDATG
GDYLVEFGEF YGGVVDLTNP EAYDWFKDVI KKNMIALGCS GWMADFGEYL PTDTYLHNGV
SAEIMHNAWP ALWAKCNYEA LQETGKLGEI LFFMRAGYTG SQKYSTMMWA GDQNVDWSLD
DGLASVVPAA LSLAMTGHGL HHSDIGGYTT LFDMKRSKEL LLRWCDFSAF TPMMRTHEGN
RPGDNWQFDG DAETIAHFAR MTTVFTTLKP YLKQAVAQNA ATGLPVMRPL FLHYENDAAT
YTLKYQYLLG QDLLVAPVHE QGRCDWTLYL PEDHWVNIWT GEAHHGGEIT VDAPIGKPPV
FYRAKSEWAS LFASLRNI