Gene SNSL254_A0505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0505 
Symbol 
ID6485559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp516826 
End bp518526 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content56% 
IMG OID642735925 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002039699 
Protein GI194444886 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.679744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.144898 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTAC TGAATCGGCT TAATCAATAT CAACGCCTCT GGCAGCCTTC CGCCGGGGAA 
ACGCAACATG TCACCGTTAG CGAACTGGCC GAACGCTGTT TTTGTAGCGA GCGCCATCTG
CGAACGCTGC TGCGTCAGGC CCAGCAGGCA GGTTGGCTAA GATGGGAGGC GCAGTCCGGG
CGCGGGAAAC GTGGACGGCT CCAGTTTCTG GTCACGCCGG AATCGCTCCG CACCGCCATG
ATGGAACAGG CGCTGGAAAA AGGGCAGCAG CTCAACGTAC TGGAACTGGC GCAGCTGGCG
CCGGGCGAGT TACGGGCGAT GCTTCAGCCT TTTATGGGCG GCCAATGGCA AAACGATACG
CCGACATTGC GTATTCCCTA CTATCGTCCG CTTGATCCGC TGCAGCCGGG TTTCCTGCCA
GGCCGCGCGG AACAGCATCT CGCAGGGCAA GTTTTTTCCG GGTTAACGCG CTTCGACCGC
GACAGTCAAT ATCCTTGCGG GGATTTGGCG CATCACTGGG AGATTTCCGC CGACGGTTTA
CGCTGGGATT TTTATATTCG CTCCACGCTG CACTGGCATA ATGGCGATAC GGTGGACACC
ACGCAGCTAC ACGAACGCCT GGAAAGGCTG CTTACCCTAC CGGCGCTAAG CAAATTGTTT
ATTAGCGTCG CACGGATCGA AGTAACGCAT CCTCAGTGCC TGACCTTTCT CCTTCACCGA
CCTGATTACT GGCTGGCGCA TCGTCTGGCG AGCTATTGTA GCGGTCTGGC GCATCCTGAC
CTGCCGCTTA TCGGCACAGG TCCTTTTCGC CTGGCGTTGT TCACGCCGGA ACTGGTGCGT
CTGGAAAGTC ATGACCATTA TCACCTCAGC CATCCGCTGC TGAAAGCGAT TGAATTCTGG
ATCACCCCGC AACTGTTTGC CCAGGATCTG GGCACCAGTT GCCGCCATCC GGTGCAGATT
GCCATCGGCA AACCGGAAGA GCTGGCGACG CTGAGTCAGG TAAGTAGTGG TATCAGTCTT
GGCTTTTGTT ATTTAACCCT CAAAAAGGGC TCACGGCTCA ACGTACAGCA GGCGCGGCGT
CTGATACATA TTATCCATCA TACTTCGCTG CTGAAAACCT TACCGGTAGA TGAGAACTTG
ATTATGCCAA GTCAGGGGCT GCTACCCGGC TGGACAATCC CGCAATGGCA GGACGTTGAT
GAAACGCCAT TGCCGAAAAA ACTTACCCTG GCGTATCACC TTCCCGTAGA GCTGCATACG
ATGGCGGAAC AGCTTCGACA TTACCTGGCG ACGCTCGGCT GTGAGTTAAC GTTGATTTTT
CATAATGCCA AAAACTGGGA TAACTGCCCT GCGTTGGCGC AAGCGGATCT GATGATGGGC
GACAGGCTGA TCGGCGAAGC GCCGGAATAT ACGCTGGAGC AGTGGCTACG TTGCGATCAG
ATCTGGCCGC ATGTCCTGGA CGCGCCTGCG TTTTCCCATC TGCAGGCTAC GCTTGACGCT
CTGCAAATTC AGCCCAATGA AAAAGATCGC CGCGCCACGC TACAACAAGT TTTTGCTAAC
CTGATGGATG ACGCCACACT TACGCCGCTG TTTAATTATC ACTATCGCAT CAGCGCCCCA
CCGGGCGTTA ACGGCGTTCG GCTCACCCCT CGCGGCTGGT TTGAATTTAG CGAAGCCTGG
CTTCCGCCGC CTTCGCCGTG A
 
Protein sequence
MRLLNRLNQY QRLWQPSAGE TQHVTVSELA ERCFCSERHL RTLLRQAQQA GWLRWEAQSG 
RGKRGRLQFL VTPESLRTAM MEQALEKGQQ LNVLELAQLA PGELRAMLQP FMGGQWQNDT
PTLRIPYYRP LDPLQPGFLP GRAEQHLAGQ VFSGLTRFDR DSQYPCGDLA HHWEISADGL
RWDFYIRSTL HWHNGDTVDT TQLHERLERL LTLPALSKLF ISVARIEVTH PQCLTFLLHR
PDYWLAHRLA SYCSGLAHPD LPLIGTGPFR LALFTPELVR LESHDHYHLS HPLLKAIEFW
ITPQLFAQDL GTSCRHPVQI AIGKPEELAT LSQVSSGISL GFCYLTLKKG SRLNVQQARR
LIHIIHHTSL LKTLPVDENL IMPSQGLLPG WTIPQWQDVD ETPLPKKLTL AYHLPVELHT
MAEQLRHYLA TLGCELTLIF HNAKNWDNCP ALAQADLMMG DRLIGEAPEY TLEQWLRCDQ
IWPHVLDAPA FSHLQATLDA LQIQPNEKDR RATLQQVFAN LMDDATLTPL FNYHYRISAP
PGVNGVRLTP RGWFEFSEAW LPPPSP