Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4644 |
Symbol | |
ID | 6484642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4533499 |
End bp | 4534854 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642739866 |
Product | alpha-galactosidase |
Protein accession | YP_002043548 |
Protein GI | 194445609 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACGG CACCCAAAAT TACCTTTATC GGCGCAGGTT CTACGATTTT CGTCAAAAAT ATCCTCGGCG ATGTGTTTCA CCGCGAGGCG CTAAAGTCAG CGCATGTCGC CCTGATGGAT ATTGACGAAA CCCGGCTGGA AGAGTCGCAC ATTGTGGTAC GGAAACTGAT GGACTCAGCG GGCGCTTCTG GCCGGATTAC CTGCCATACC AACCAGAAAG CGGCGCTACA GGATGCGGAT TTCGTGGTGG TCGCCTTTCA GATTGGCGGC TATGAACCCT GCACCGTGAC CGATTTTGAG GTTTGTAAGC GTCATGGCCT GGAACAGACG ATCGCCGATA CGCTGGGGCC GGGCGGCATC ATGCGCGCGC TGCGGACCAT CCCGCATCTG TGGCGGATTT GCGAAGACAT GACGGAAGTC TGTCCGAAGG CCACCATGCT CAATTACGTC AACCCGATGG CGATGAATAC CTGGGCGATG TATGCCCGCT ATCCGCATAT CAAACAGGTC GGCCTGTGCC ATTCGGTACA GGGAACAGCG GAAGAACTGG CGCGCGACCT GAATATCGAT CCCGCCTCGC TGCGCTACCG CTGCGCCGGC ATTAACCACA TGGCGTTTTA CCTGGAACTG GAGCGCAAAA CGGCTGACGG GACTTATGTC GATCTCTATC CTGAATTGCT GGCGGCCTAT GACGCCGGAC AGGCGCCGAA GCCCAATATT CACGGCAATG AACGCTGCCA GAACATTGTG CGCTATGAGA TGTTCAAAAA GTTGGGCTAC TTCGTCACTG AGTCATCAGA GCATTTTGCC GAGTACACGC CGTGGTTTAT TAAACCGGGA CGCGAGGATC TGATTGCGCG CTACAAGGTG CCGCTGGATG AATATCCGAA ACGCTGCGTA GAACAACTGG CGAACTGGCA TAAAGAGCTG GAGGAGTATA AAACCGCCGA GCGTATCGAC ATCAAACCGT CCCGCGAGTA CGCCAGCACC ATTATGAACG CTCTGTGGAC CGGCGAGCCG AGCGTGATTT ACGGCAATGT GCGTAATGAG GGGCTGATTG ATAACCTGCC GCAGGGAAGC TGCGTGGAAG TGGCTTGTCT GGTGGATGCC AACGGCATTC AACCGACGAA GGTGGGGACG ATCCCTTCTC ATCTGGCGGC GATGATGCAG ACCAACATCA ACGTGCAAAC GCTGTTGACC GAAGCCATCC TCACGGAAAA CCGCGATCGC GTGTATCACG CGGCGATGAT GGACCCTCAT ACCGCAGCGG TGCTGGGTAT CGAAGAAATC TATGCGTTGG TTGACGATCT GATCGCCGCG CATGGCGACT GGCTTCCGGC CTGGTTACGC CGTTAA
|
Protein sequence | MMTAPKITFI GAGSTIFVKN ILGDVFHREA LKSAHVALMD IDETRLEESH IVVRKLMDSA GASGRITCHT NQKAALQDAD FVVVAFQIGG YEPCTVTDFE VCKRHGLEQT IADTLGPGGI MRALRTIPHL WRICEDMTEV CPKATMLNYV NPMAMNTWAM YARYPHIKQV GLCHSVQGTA EELARDLNID PASLRYRCAG INHMAFYLEL ERKTADGTYV DLYPELLAAY DAGQAPKPNI HGNERCQNIV RYEMFKKLGY FVTESSEHFA EYTPWFIKPG REDLIARYKV PLDEYPKRCV EQLANWHKEL EEYKTAERID IKPSREYAST IMNALWTGEP SVIYGNVRNE GLIDNLPQGS CVEVACLVDA NGIQPTKVGT IPSHLAAMMQ TNINVQTLLT EAILTENRDR VYHAAMMDPH TAAVLGIEEI YALVDDLIAA HGDWLPAWLR R
|
| |