Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2381 |
Symbol | |
ID | 6484489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2297842 |
End bp | 2298840 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642737720 |
Product | D-galactose-binding periplasmic protein |
Protein accession | YP_002041462 |
Protein GI | 194443772 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAGA AGGTACTGAC CCTTTCTGCC GTGATGGCAA GTCTGTTATT CGGCGCGCAC GCGCACGCGG CTGATACTCG TATTGGCGTG ACGATTTATA AATATGACGA TAACTTTATG TCCGTGGTGC GTAAGGCTAT TGAAAAAGAT GGCAAATCCG CGCCGGATGT TCAGCTACTG ATGAATGACT CGCAAAACGA TCAGTCCAAA CAGAATGATC AAATTGACGT TTTATTGGCG AAAGGGGTTA AAGCTCTGGC GATTAACCTG GTCGACCCGG CAGCCGCCGG TACGGTGATT GAGAAAGCTC GCGGGCAAAA TGTGCCGGTG GTGTTCTTTA ACAAAGAGCC TTCCCGCAAA GCGCTGGACA GCTATGACAA GGCGTATTAT GTCGGGACTG ACTCTAAAGA ATCCGGTGTG ATTCAGGGCG ACTTGATTGC CAAACACTGG CAAGCGAATC AGGGTTGGGA TCTGAATAAA GACGGTAAAA TTCAGTATGT TCTGCTGAAA GGCGAGCCGG GGCATCCGGA TGCTGAAGCC CGTACGACGT ATGTGGTGAA AGAGTTAAAT GCTAAAGGTA TTCAGACCGA GCAACTGGCG TTAGACACCG CTATGTGGGA TACCGCGCAG GCAAAAGATA AGATGGATGC CTGGCTGTCT GGCCCGAACG CTAACAAGAT TGAAGTGGTT ATCGCGAATA ACGATGCGAT GGCGATGGGC GCGGTAGAGG CGCTGAAAGC GCATAATAAA TCGTCGATTC CGGTCTTTGG CGTCGATGCG TTACCGGAAG CCCTGGCGCT GGTGAAATCG GGCGCGATGG CCGGTACGGT ACTGAATGAC GCCAACAATC AGGCGAAAGC GACATTCGAT CTGGCGAAAA ACCTCGCCGA AGGCAAGGGC GCGGCTGACG GCACCAGCTG GAAGATTGAG AACAAAATCG TGCGCGTGCC TTATGTCGGC GTGGACAAAG ACAATCTGAG CGAATTTACC CAAAAATAA
|
Protein sequence | MNKKVLTLSA VMASLLFGAH AHAADTRIGV TIYKYDDNFM SVVRKAIEKD GKSAPDVQLL MNDSQNDQSK QNDQIDVLLA KGVKALAINL VDPAAAGTVI EKARGQNVPV VFFNKEPSRK ALDSYDKAYY VGTDSKESGV IQGDLIAKHW QANQGWDLNK DGKIQYVLLK GEPGHPDAEA RTTYVVKELN AKGIQTEQLA LDTAMWDTAQ AKDKMDAWLS GPNANKIEVV IANNDAMAMG AVEALKAHNK SSIPVFGVDA LPEALALVKS GAMAGTVLND ANNQAKATFD LAKNLAEGKG AADGTSWKIE NKIVRVPYVG VDKDNLSEFT QK
|
| |