Gene SeAg_B4562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4562 
Symbol 
ID6793535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4464511 
End bp4465866 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content56% 
IMG OID642778648 
Productalpha-galactosidase 
Protein accessionYP_002149214 
Protein GI197248112 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACGG CACCCAAAAT TACCTTTATC GGCGCAGGTT CTACGATTTT CGTCAAAAAT 
ATCCTCGGCG ATGTGTTTCA CCGCGAGGCG CTAAAGTCAG CGCATGTCGC CCTGATGGAT
ATTGACGAAA CCCGGCTGGA AGAGTCGCAC ATTGTGGTAC GGAAACTGAT GGACTCAGCG
GGCGCTTCTG GCCGGATTAC CTGCCATACC CACCAGAAAG CGGCGCTACA GGATGCGGAT
TTCGTGGTGG TCGCCTTTCA GATTGGCGGC TATGAACCCT GCACCGTGAC CGATTTTGAG
GTTTGTAAGC GTCATGGCCT GGAACAGACG ATCGCCGATA CGCTGGGGCC GGGCGGCATT
ATGCGCGCGC TGCGGACCAT CCCGCATCTG TGGCGGATTT GCGAAGACAT GACGGAAGTC
TGTCCGAAGG CCACCATGCT CAATTACGTC AACCCGATGG CGATGAATAC CTGGGCGATG
TATGCCCGTT ATCCGCATAT CAAACAGGTC GGCCTGTGCC ATTCGGTACA GGGAACGGCG
GAAGAACTGG CGCGCGACCT GAATATCGAT CCCGCCACGC TGCGCTACCG CTGCGCCGGC
ATTAACCACA TGGCGTTTTA CCTGGAACTG GAGCGCAAAA CGGCTGACGG GACTTATGTC
GATCTCTATC CTGAATTGCT GGCGGCCTAT GACGCCGGAC AGGCGCCGAA GCCCAATATT
CACGGCAATG AACGCTGCCA GAACATTGTG CGCTATGAGA TGTTCAAAAA GTTGGGCTAC
TTCGTCACCG AGTCATCAGA GCATTTTGCC GAGTACACGC CGTGGTTTAT TAAACCGGGA
CGCGAGGATC TGATTGCGCG CTACAAGGTG CCGCTGGATG AATATCCGAA ACGCTGCGTA
GAACAACTGG CGAACTGGCA TAAAGAGCTG GAGGAGTATA AAACCGCCGA GCGTATCGGC
ATCAAACCGT CCCGCGAGTA CGCCAGCACC ATTATGAACG CTCTGTGGAC CGGCGAGCCG
AGCGTGATTT ACGGCAATGT GCGTAATGAG GGGCTGATTG ATAACCTGCC GCAGGGAAGC
TGCGTGGAAG TGGCTTGTCT GGTGGATGCC AACGGCATTC AACCGACGAA GGTGGGGACG
ATCCCTTCTC ATCTGGCGGC GATGATGCAG ACCAACATCA ACGTGCAAAC GCTGTTGACC
GAAGCCATCC TCACGGAAAA CCGCGATCGC GTGTATCACG CGGCGATGAT GGACCCTCAT
ACCGCAGCGG TGCTGGGTAT CGAAGAAATC TATGCGTTGG TTGACGATCT GATCGCCGCG
CATGGCGACT GGCTTCCGGC CTGGTTACGC CGTTAA
 
Protein sequence
MMTAPKITFI GAGSTIFVKN ILGDVFHREA LKSAHVALMD IDETRLEESH IVVRKLMDSA 
GASGRITCHT HQKAALQDAD FVVVAFQIGG YEPCTVTDFE VCKRHGLEQT IADTLGPGGI
MRALRTIPHL WRICEDMTEV CPKATMLNYV NPMAMNTWAM YARYPHIKQV GLCHSVQGTA
EELARDLNID PATLRYRCAG INHMAFYLEL ERKTADGTYV DLYPELLAAY DAGQAPKPNI
HGNERCQNIV RYEMFKKLGY FVTESSEHFA EYTPWFIKPG REDLIARYKV PLDEYPKRCV
EQLANWHKEL EEYKTAERIG IKPSREYAST IMNALWTGEP SVIYGNVRNE GLIDNLPQGS
CVEVACLVDA NGIQPTKVGT IPSHLAAMMQ TNINVQTLLT EAILTENRDR VYHAAMMDPH
TAAVLGIEEI YALVDDLIAA HGDWLPAWLR R