Gene SNSL254_A2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2381 
Symbol 
ID6484489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2297842 
End bp2298840 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content51% 
IMG OID642737720 
ProductD-galactose-binding periplasmic protein 
Protein accessionYP_002041462 
Protein GI194443772 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA AGGTACTGAC CCTTTCTGCC GTGATGGCAA GTCTGTTATT CGGCGCGCAC 
GCGCACGCGG CTGATACTCG TATTGGCGTG ACGATTTATA AATATGACGA TAACTTTATG
TCCGTGGTGC GTAAGGCTAT TGAAAAAGAT GGCAAATCCG CGCCGGATGT TCAGCTACTG
ATGAATGACT CGCAAAACGA TCAGTCCAAA CAGAATGATC AAATTGACGT TTTATTGGCG
AAAGGGGTTA AAGCTCTGGC GATTAACCTG GTCGACCCGG CAGCCGCCGG TACGGTGATT
GAGAAAGCTC GCGGGCAAAA TGTGCCGGTG GTGTTCTTTA ACAAAGAGCC TTCCCGCAAA
GCGCTGGACA GCTATGACAA GGCGTATTAT GTCGGGACTG ACTCTAAAGA ATCCGGTGTG
ATTCAGGGCG ACTTGATTGC CAAACACTGG CAAGCGAATC AGGGTTGGGA TCTGAATAAA
GACGGTAAAA TTCAGTATGT TCTGCTGAAA GGCGAGCCGG GGCATCCGGA TGCTGAAGCC
CGTACGACGT ATGTGGTGAA AGAGTTAAAT GCTAAAGGTA TTCAGACCGA GCAACTGGCG
TTAGACACCG CTATGTGGGA TACCGCGCAG GCAAAAGATA AGATGGATGC CTGGCTGTCT
GGCCCGAACG CTAACAAGAT TGAAGTGGTT ATCGCGAATA ACGATGCGAT GGCGATGGGC
GCGGTAGAGG CGCTGAAAGC GCATAATAAA TCGTCGATTC CGGTCTTTGG CGTCGATGCG
TTACCGGAAG CCCTGGCGCT GGTGAAATCG GGCGCGATGG CCGGTACGGT ACTGAATGAC
GCCAACAATC AGGCGAAAGC GACATTCGAT CTGGCGAAAA ACCTCGCCGA AGGCAAGGGC
GCGGCTGACG GCACCAGCTG GAAGATTGAG AACAAAATCG TGCGCGTGCC TTATGTCGGC
GTGGACAAAG ACAATCTGAG CGAATTTACC CAAAAATAA
 
Protein sequence
MNKKVLTLSA VMASLLFGAH AHAADTRIGV TIYKYDDNFM SVVRKAIEKD GKSAPDVQLL 
MNDSQNDQSK QNDQIDVLLA KGVKALAINL VDPAAAGTVI EKARGQNVPV VFFNKEPSRK
ALDSYDKAYY VGTDSKESGV IQGDLIAKHW QANQGWDLNK DGKIQYVLLK GEPGHPDAEA
RTTYVVKELN AKGIQTEQLA LDTAMWDTAQ AKDKMDAWLS GPNANKIEVV IANNDAMAMG
AVEALKAHNK SSIPVFGVDA LPEALALVKS GAMAGTVLND ANNQAKATFD LAKNLAEGKG
AADGTSWKIE NKIVRVPYVG VDKDNLSEFT QK