Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0505 |
Symbol | |
ID | 6485559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 516826 |
End bp | 518526 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642735925 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002039699 |
Protein GI | 194444886 |
COG category | [R] General function prediction only |
COG ID | [COG4533] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.679744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.144898 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCTAC TGAATCGGCT TAATCAATAT CAACGCCTCT GGCAGCCTTC CGCCGGGGAA ACGCAACATG TCACCGTTAG CGAACTGGCC GAACGCTGTT TTTGTAGCGA GCGCCATCTG CGAACGCTGC TGCGTCAGGC CCAGCAGGCA GGTTGGCTAA GATGGGAGGC GCAGTCCGGG CGCGGGAAAC GTGGACGGCT CCAGTTTCTG GTCACGCCGG AATCGCTCCG CACCGCCATG ATGGAACAGG CGCTGGAAAA AGGGCAGCAG CTCAACGTAC TGGAACTGGC GCAGCTGGCG CCGGGCGAGT TACGGGCGAT GCTTCAGCCT TTTATGGGCG GCCAATGGCA AAACGATACG CCGACATTGC GTATTCCCTA CTATCGTCCG CTTGATCCGC TGCAGCCGGG TTTCCTGCCA GGCCGCGCGG AACAGCATCT CGCAGGGCAA GTTTTTTCCG GGTTAACGCG CTTCGACCGC GACAGTCAAT ATCCTTGCGG GGATTTGGCG CATCACTGGG AGATTTCCGC CGACGGTTTA CGCTGGGATT TTTATATTCG CTCCACGCTG CACTGGCATA ATGGCGATAC GGTGGACACC ACGCAGCTAC ACGAACGCCT GGAAAGGCTG CTTACCCTAC CGGCGCTAAG CAAATTGTTT ATTAGCGTCG CACGGATCGA AGTAACGCAT CCTCAGTGCC TGACCTTTCT CCTTCACCGA CCTGATTACT GGCTGGCGCA TCGTCTGGCG AGCTATTGTA GCGGTCTGGC GCATCCTGAC CTGCCGCTTA TCGGCACAGG TCCTTTTCGC CTGGCGTTGT TCACGCCGGA ACTGGTGCGT CTGGAAAGTC ATGACCATTA TCACCTCAGC CATCCGCTGC TGAAAGCGAT TGAATTCTGG ATCACCCCGC AACTGTTTGC CCAGGATCTG GGCACCAGTT GCCGCCATCC GGTGCAGATT GCCATCGGCA AACCGGAAGA GCTGGCGACG CTGAGTCAGG TAAGTAGTGG TATCAGTCTT GGCTTTTGTT ATTTAACCCT CAAAAAGGGC TCACGGCTCA ACGTACAGCA GGCGCGGCGT CTGATACATA TTATCCATCA TACTTCGCTG CTGAAAACCT TACCGGTAGA TGAGAACTTG ATTATGCCAA GTCAGGGGCT GCTACCCGGC TGGACAATCC CGCAATGGCA GGACGTTGAT GAAACGCCAT TGCCGAAAAA ACTTACCCTG GCGTATCACC TTCCCGTAGA GCTGCATACG ATGGCGGAAC AGCTTCGACA TTACCTGGCG ACGCTCGGCT GTGAGTTAAC GTTGATTTTT CATAATGCCA AAAACTGGGA TAACTGCCCT GCGTTGGCGC AAGCGGATCT GATGATGGGC GACAGGCTGA TCGGCGAAGC GCCGGAATAT ACGCTGGAGC AGTGGCTACG TTGCGATCAG ATCTGGCCGC ATGTCCTGGA CGCGCCTGCG TTTTCCCATC TGCAGGCTAC GCTTGACGCT CTGCAAATTC AGCCCAATGA AAAAGATCGC CGCGCCACGC TACAACAAGT TTTTGCTAAC CTGATGGATG ACGCCACACT TACGCCGCTG TTTAATTATC ACTATCGCAT CAGCGCCCCA CCGGGCGTTA ACGGCGTTCG GCTCACCCCT CGCGGCTGGT TTGAATTTAG CGAAGCCTGG CTTCCGCCGC CTTCGCCGTG A
|
Protein sequence | MRLLNRLNQY QRLWQPSAGE TQHVTVSELA ERCFCSERHL RTLLRQAQQA GWLRWEAQSG RGKRGRLQFL VTPESLRTAM MEQALEKGQQ LNVLELAQLA PGELRAMLQP FMGGQWQNDT PTLRIPYYRP LDPLQPGFLP GRAEQHLAGQ VFSGLTRFDR DSQYPCGDLA HHWEISADGL RWDFYIRSTL HWHNGDTVDT TQLHERLERL LTLPALSKLF ISVARIEVTH PQCLTFLLHR PDYWLAHRLA SYCSGLAHPD LPLIGTGPFR LALFTPELVR LESHDHYHLS HPLLKAIEFW ITPQLFAQDL GTSCRHPVQI AIGKPEELAT LSQVSSGISL GFCYLTLKKG SRLNVQQARR LIHIIHHTSL LKTLPVDENL IMPSQGLLPG WTIPQWQDVD ETPLPKKLTL AYHLPVELHT MAEQLRHYLA TLGCELTLIF HNAKNWDNCP ALAQADLMMG DRLIGEAPEY TLEQWLRCDQ IWPHVLDAPA FSHLQATLDA LQIQPNEKDR RATLQQVFAN LMDDATLTPL FNYHYRISAP PGVNGVRLTP RGWFEFSEAW LPPPSP
|
| |