Gene SeHA_C4341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4341 
Symbol 
ID6490635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4231577 
End bp4233613 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content56% 
IMG OID642744427 
Productalpha-glucosidase 
Protein accessionYP_002048016 
Protein GI194450429 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC AAAGTAAGCA TTTATCCTCT GTCCTGATAG AAAAGAACAT TGAGGGCTTT 
ACGCTGACGT ACCACCAGCG CCTGATTTTA CGCCACAGCG CCGAAACCCC CTGTCTGTGG
ATTGGCGCGG GCGTTGCCGA CATTGACATG TTTCGCGGCA ACTTCAGCAT CAAAGACAAA
CTTAACGAGA AGATTGCATT AACGGAGGCC ACCATCAGCG AGCTACCTGA CGGCTGGCTG
GTACAATTCA GCCGTGGCGC AACAATTAGC GCCACCCTTC GCATCTCCAC CGATGAGGCG
GGACGCCTGC AGCTGGATCT GCAAAACGAC GACCTGCACC ATAACCGTAT CTGGTTACGC
CTCGCAGCTA ATCCAGACGA CCATATCTAC GGCTGCGGCG AACAGTTCTC TTATTTCGAT
TTGCGCGGCA AGCCGTTCCC GCTGTGGACC AGCGAACAGG GCGTTGGCCG TAATAAAACC
AGCTATGTCA CCTGGCAGGC AGACTGTAAA GAGAACTCCG GCGGCGACTA TTACTGGACC
TTCTTCCCGC AACCGACCTT TGTCAGCACG CAGAAGTATT ACTGCCACGT CGATAATAGC
TGCTATATGA ATTTCGACTT CAGCGCGCCG GAGTATCACG AACTGGCGCT GTGGGAAGAT
AAAACTACGC TACGTTTTGA GTGTGCCGAC ACCTACATCG CCCTACTGGA AAAACTGACT
GCGCTGTTAG GTCGCCAGCC GGAGCTGCCG GACTGGGTTT ACGACGGCGT CACGCTAGGC
ATTCAGGGCG GTACGGAAGT TTGCCAGCAA AAACTGGATA CCATGCGCAA CGCAGGCGTA
AAAGTGAACG GTATTTGGGC GCAGGACTGG TCCGGTATCC GCATGACCTC CTTTGGCAAA
CGCGTGATGT GGAACTGGAA GTGGAATAGC GACAACTATC CGCAACTGGA TAGCCGGATC
AAACAGTGGA AAGAAGAAGG CGTACAGTTC CTCTCTTATA TCAACCCATA CGTCGCCAGT
GATAAAGACC TCTGCGCCGA GGCGGCGAAA CACGGCTACC TGGCGAAAGA CGCCACGGGC
GGCGACTATC TGGTCGAGTT TGGCGAATTC TATGGCGGCG TGGTCGATCT GACCAATCCT
GAAGCTTACG ACTGGTTCAA AGACGTCATC AAAAAGAACA TGATCGCGCT CGGCTGCAGC
GGCTGGATGG CAGATTTCGG CGAATATCTG CCGACCGACA CGTATCTGCA CAACGGCGTC
AGCGCCGAGA TCATGCATAA CGCCTGGCCC GCGCTGTGGG CGAAGTGTAA CTACGAAGCG
CTACAGGAGA CCGGCAAGCT CGGCGAGATC CTGTTCTTTA TGCGTGCGGG TTACACCGGC
AGTCAGAAAT ATTCCACCAT GATGTGGGCA GGCGACCAGA ACGTTGACTG GAGCCTTGAT
GATGGTCTGG CCTCTGTCGT GCCTGCGGCA TTGTCGCTGG CGATGACCGG CCATGGTCTG
CATCACAGCG ATATCGGCGG CTACACCACC CTGTTTGACA TGAAACGCAG CAAAGAGTTG
CTGCTGCGCT GGTGCGATTT CAGCGCCTTT ACGCCGATGA TGCGCACCCA TGAAGGCAAC
CGCCCCGGCG ATAACTGGCA GTTCGACGGC GACGCGGAAA CTATTGCCCA CTTTGCCCGC
ATGACCACCG TCTTTACCAC GCTGAAACCG TACCTCAAGC AGGCGGTGGC GCAAAACGCG
GCTACCGGTC TGCCGGTCAT GCGTCCGCTA TTCCTGCACT ACGAGAACGA TGCCGCAACC
TACACCCTGA AATATCAATA TCTGCTCGGT CAGGATCTGC TGGTCGCGCC GGTTCACGAG
CAGGGGCGTT GCGACTGGAC GCTGTACCTG CCGGAAGATC ACTGGGTGAA TATCTGGACC
GGTGAAGTTC ACCACGGCGG TGAAATTACC GTGGATGCGC CCATTGGCAA GCCGCCGGTC
TTCTATCGCG CGAAGAGCGA GTGGGCTTCA CTTTTTGCTT CTTTACGGAA TATCTAA
 
Protein sequence
MSTQSKHLSS VLIEKNIEGF TLTYHQRLIL RHSAETPCLW IGAGVADIDM FRGNFSIKDK 
LNEKIALTEA TISELPDGWL VQFSRGATIS ATLRISTDEA GRLQLDLQND DLHHNRIWLR
LAANPDDHIY GCGEQFSYFD LRGKPFPLWT SEQGVGRNKT SYVTWQADCK ENSGGDYYWT
FFPQPTFVST QKYYCHVDNS CYMNFDFSAP EYHELALWED KTTLRFECAD TYIALLEKLT
ALLGRQPELP DWVYDGVTLG IQGGTEVCQQ KLDTMRNAGV KVNGIWAQDW SGIRMTSFGK
RVMWNWKWNS DNYPQLDSRI KQWKEEGVQF LSYINPYVAS DKDLCAEAAK HGYLAKDATG
GDYLVEFGEF YGGVVDLTNP EAYDWFKDVI KKNMIALGCS GWMADFGEYL PTDTYLHNGV
SAEIMHNAWP ALWAKCNYEA LQETGKLGEI LFFMRAGYTG SQKYSTMMWA GDQNVDWSLD
DGLASVVPAA LSLAMTGHGL HHSDIGGYTT LFDMKRSKEL LLRWCDFSAF TPMMRTHEGN
RPGDNWQFDG DAETIAHFAR MTTVFTTLKP YLKQAVAQNA ATGLPVMRPL FLHYENDAAT
YTLKYQYLLG QDLLVAPVHE QGRCDWTLYL PEDHWVNIWT GEVHHGGEIT VDAPIGKPPV
FYRAKSEWAS LFASLRNI