Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4693 |
Symbol | |
ID | 6872578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4557146 |
End bp | 4558501 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642787591 |
Product | alpha-galactosidase |
Protein accession | YP_002218189 |
Protein GI | 198243990 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.44607 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACGG CACCCAAAAT TACCTTTATC GGCGCAGGTT CTACGATTTT CGTCAAAAAT ATCCTCGGCG ATGTGTTTCA CCGCGAGGCG CTAAAGTCAG CGCATGTCGC CCTGATGGAT ATTGACGAAA CCCGGCTGGA AGAGTCGCAC ATTGTGGTAC GGAAACTGAT GGACTCAGCG GGCGCTTCTG GCCGGATTAC CTGCCATACC AACCAGAAAG CGGCGCTACA GGATGCGGAT TTCGTGGTGG TCGCCTTTCA GATTGGCGGC TATGAACCCT GCACCGTGAC CGATTTTGAG GTTTGTAAGC GTCATGGCCT GGAACAGACG ATCGCCGATA CGCTGGGGCC GGGCGGCATC ATGCGCGCGC TGCGGACCAT CCCGCATCTG TGGCGGATTT GCGAAGACAT GACGGAAGTC TGTCCGAAGG CCACCATGCT CAATTACGTC AACCCGATGG CGATGAATAC CTGGGCGATG TATGCCCGTT ATCCGCATAT CAAACAGGTC GGCCTGTGCC ATTCGGTACA GGGAACGGCG GAAGAACTGG CGCGCGACCT GAATATCGAT CCCACCTCGC TGCGCTACCG CTGCGCCGGC ATTAACCACA TGGCGTTTTA CCTCGAACTG GAGCGCAAAA CGGCTGACGG GACTTATGTC AATCTCTATC CTGAATTGCT GGCGGCCTAT GACGCCGGAC AGGCGCCGAA GCCCAATATT CACGGCAATG AACGCTGCCA GAACATCGTG CGCTATGAGA TGTTCAAAAA GTTGGGCTAC TTCGTCACTG AGTCATCAGA GCATTTTGCC GAGTACACGC CGTGGTTTAT TAAACCGGGA CGCGAGGATC TGATTGCGCG CTACAAGGTG CCGCTGGATG AATATCCGAA ACGCTGCGTA GAACAACTGG CGAACTGGCA TAAAGAGCTG GAGGAGTATA AAACCGCCGA GCGTATCGAC ATCAAACCGT CCCGCGAGTA CGCCAGCACC ATTATGAACG CTCTGTGGAC CGGCGAGCCG AGCGTGATTT ACGGCAATGT GCGTAACGAG GGGCTGATTG ATAACCTGCC GCAGGGAAGC TGCGTGGAAG TGGCTTGTCT GGTGGATGCC AACGGCATTC AACCGACGAA GGTGGGGACG ATCCCTTCCC ATCTGGCGGC GATGATGCAG ACCAACATCA ACGTGCAAAC GCTGTTGACC GAAGCCATCC TCACGGAAAA CCGCGATCGC GTGTATCACG CGGCGATGAT GGACCCTCAT ACCGCAGCGG TGCTGGGTAT CGAAGAAATC TATGCGTTGG TTGACGATCT GATCGCCGCG CATGGCGACT GGCTTCCGGC CTGGTTACGC CGTTAA
|
Protein sequence | MMTAPKITFI GAGSTIFVKN ILGDVFHREA LKSAHVALMD IDETRLEESH IVVRKLMDSA GASGRITCHT NQKAALQDAD FVVVAFQIGG YEPCTVTDFE VCKRHGLEQT IADTLGPGGI MRALRTIPHL WRICEDMTEV CPKATMLNYV NPMAMNTWAM YARYPHIKQV GLCHSVQGTA EELARDLNID PTSLRYRCAG INHMAFYLEL ERKTADGTYV NLYPELLAAY DAGQAPKPNI HGNERCQNIV RYEMFKKLGY FVTESSEHFA EYTPWFIKPG REDLIARYKV PLDEYPKRCV EQLANWHKEL EEYKTAERID IKPSREYAST IMNALWTGEP SVIYGNVRNE GLIDNLPQGS CVEVACLVDA NGIQPTKVGT IPSHLAAMMQ TNINVQTLLT EAILTENRDR VYHAAMMDPH TAAVLGIEEI YALVDDLIAA HGDWLPAWLR R
|
| |