Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B2336 |
Symbol | |
ID | 6794265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 2252208 |
End bp | 2253206 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642776536 |
Product | D-galactose-binding periplasmic protein |
Protein accession | YP_002147161 |
Protein GI | 197251275 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.777429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGA AGGTACTGAC CCTTTCTGCC GTGATGGCAA GTCTGTTATT CGGCGCGCAC GCGCACGCGG CTGATACTCG TATTGGCGTG ACGATTTATA AATATGACGA TAACTTTATG TCCGTGGTGC GTAAGGCTAT TGAAAAAGAT GGCAAATCCG CGCCGGATGT TCAGCTACTG ATGAATGACT CGCAAAACGA TCAGTCCAAA CAGAATGATC AAATTGACGT TTTATTGGCG AAAGGGGTTA AAGCTCTGGC GATTAACCTG GTCGACCCGG CAGCCGCCGG TACGGTGATT GAGAAAGCGC GCGGGCAAAA TGTGCCGGTG GTGTTCTTTA ACAAAGAGCC TTCCCGCAAA GCGCTGGACA GCTATGACAA GGCGTATTAT GTCGGGACTG ACTCTAAAGA ATCCGGTGTG ATTCAGGGCG ACTTGATTGC CAAACACTGG CAGGCGAATC AGGGTTGGGA TCTGAATAAA GACGGTAAAA TTCAGTATGT TCTGCTGAAA GGCGAGCCGG GGCACCCGGA TGCTGAAGCC CGTACGACGT ATGTGGTGAA AGAGTTAAAT GATAAAGGTA TTCAGACCGA GCAACTGGCG TTAGACACCG CCATGTGGGA TACCGCGCAG GCAAAAGATA AGATGGATGC CTGGCTGTCT GGCCCGAACG CTAACAAGAT TGAAGTGGTT ATCGCGAATA ACGATGCGAT GGCGATGGGC GCGGTAGAGG CGCTGAAAGC GCATAATAAA TCGTCGATTC CGGTCTTTGG CGTCGATGCG TTACCGGAAG CTTTGGCGCT GGTGAAATCG GGCGCGATGG CCGGTACGGT ACTGAATGAC GCCAACAATC AGGCGAAAGC GACATTCGAT CTGGCGAAAA ACCTCGCCGA AGGCAAGGGC GCGGCTGACG GCACCAGCTG GAAGATTGAG AACAAAATCG TGCGCGTGCC TTATGTCGGC GTGGACAAAG ACAATCTGAG CGAATTTACC CAAAAATAA
|
Protein sequence | MNKKVLTLSA VMASLLFGAH AHAADTRIGV TIYKYDDNFM SVVRKAIEKD GKSAPDVQLL MNDSQNDQSK QNDQIDVLLA KGVKALAINL VDPAAAGTVI EKARGQNVPV VFFNKEPSRK ALDSYDKAYY VGTDSKESGV IQGDLIAKHW QANQGWDLNK DGKIQYVLLK GEPGHPDAEA RTTYVVKELN DKGIQTEQLA LDTAMWDTAQ AKDKMDAWLS GPNANKIEVV IANNDAMAMG AVEALKAHNK SSIPVFGVDA LPEALALVKS GAMAGTVLND ANNQAKATFD LAKNLAEGKG AADGTSWKIE NKIVRVPYVG VDKDNLSEFT QK
|
| |