Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B2365 |
Symbol | |
ID | 6794901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 2281810 |
End bp | 2283615 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642776564 |
Product | extracellular solute-binding protein |
Protein accession | YP_002147189 |
Protein GI | 197247461 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0436387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGCGC GCGTAATGCT TCTGCTTGTC GCACTGGTTA GCGCTGGCGC CCAGGCCCAG GAAATCAAAG AAAGCTACGC TTTCGCCGTA CTCGGCGAAC CTAAGTATGC TTTTAACTTT GATCACTTTG ATTATGTGAA TCCTGCTGCG CCGAAAGGCG GTCAGATGAC GCTTTCCGCC ATTGGTACGT TCGATAATTT CAATCGCTAT TCGCTGCGCG GCAATCCCGG CGTACGTACC GAAGCCCTTT ACGATACGCT TTTTACCACC TCGGATGATG AACCCGGAAG CTATTATCCG CTGATTGCCG ACCATGCCCG CTATGCCGCC GACTATTCCT GGGTGGAAAT CTCGATTAAT CCCCGCGCTC GTTTTCACGA TGGCACGCCC ATTACCGCCC GCGATGTAGC CTTTACCTTT CATAAGTTTA TGACCGAAGG CGTACCGCAG TTCCGTCTGG TCTATAAAGG TACTACCGTG AAGGCGATTG CGCCTTTAAC CGTGCGAATT GAGCTGGCGA AGCCTGGTAA AGAAGACATG CTTAGTCTGT TTTCATTACC GATCATGCCC GAAAAATTCT GGAAAAATCA CAAGCTCAGC GATCCGCTTT CAACGCCGCC CTTAGCCAGC GGCCCATACC GGATTACTCA GTGGAAAATG GGCCAGTACA TTGTCTATTC ACGCGTCAAA AACTACTGGG CGGCTAATCT GCCGGTCAAT CGTGGACGTT TTAACCTCGA CACTATCCGC TACGATTACT ACCTTGATGA CAATGTCGCT TTCGAGGCGT TTAAAGCGGG CGCATTTGAT CTACGGCTGG AAAACGACGC TAAAAACTGG GCAACGCGCT ATATCGGTAA AAATTTCGAT AATCATTACA TCATTAAAGA AGAACAGAAA AACGAGTCGG CGCAGGACAC ACGCTGGCTG GCCTTTAATA TTCAGCGCCC GGTATTTAAA GACCGGCGGG TACGTGAAGC TGTCACCCTG GCCTTCGATT TTGAGTGGAT GAATAAAGCG CTGTTCTATA ATGCCTGGAG CCGAACCAAC AGTTACTTCC AGAATACCGA GTACGCCGCC AGAAATTACC CTGACGCCGA TGAGCTGGTA TTACTCGCGC CGATGAAAAA AGATCTTCCT CCTGAAGTCT TCACTCAGAT CTATCAGCCG CCGGTCTCTA ACGGCGACGG CTACGATCGC GAAAATCTTC TTAAAGCTGA CGCCTTGTTG ACGCAGGCCG GATGGGTGAT CAACGGACAG CAACGGGTCA ATAGCGTCAC CGGTAAGCCT CTGACGTTTG AACTTCTCCT TCCTGCCAGC AGTAATAGCC AGTGGGTTCT GCCCTTCCAG CATAATCTTC AGCGTCTGGG CATTACGATG ACTATCCGTC AGGTTGATAA TTCTCAGCTC ACCAACCGGA TGCGCAGCCG CGACTATGAC ATGATGCCGA GGCTATGGCG GGCGATGCCC TGGCCCAGCT CCGATCTACA AATCTCATGG GCGTCGGAAT ACATTGACTC CAGTTATAAC GCTCCCGGCG TACAAAGCTC GGTGGTGGAT AAACTGATCG CGCAAATTAT CGCAGCGCAG GGTGATAAAG CGAAACTGGT GCCGCTGGGA CGGGCGCTGG ATCGCGTGCT GACCTGGAAC TATTACATGC TGCCGATGTG GTATATGGCG CAAGACAGGC TCGCCTGGTG GGATAAATTC TCCCATCCGG CGATTCGCCC GGTATATACC ATCGGGTTAG ATACTTGGTG GTATGATGTC AACAAAGCCG CCAAGCTACC GGCAGCCAGG AGGTAG
|
Protein sequence | MIARVMLLLV ALVSAGAQAQ EIKESYAFAV LGEPKYAFNF DHFDYVNPAA PKGGQMTLSA IGTFDNFNRY SLRGNPGVRT EALYDTLFTT SDDEPGSYYP LIADHARYAA DYSWVEISIN PRARFHDGTP ITARDVAFTF HKFMTEGVPQ FRLVYKGTTV KAIAPLTVRI ELAKPGKEDM LSLFSLPIMP EKFWKNHKLS DPLSTPPLAS GPYRITQWKM GQYIVYSRVK NYWAANLPVN RGRFNLDTIR YDYYLDDNVA FEAFKAGAFD LRLENDAKNW ATRYIGKNFD NHYIIKEEQK NESAQDTRWL AFNIQRPVFK DRRVREAVTL AFDFEWMNKA LFYNAWSRTN SYFQNTEYAA RNYPDADELV LLAPMKKDLP PEVFTQIYQP PVSNGDGYDR ENLLKADALL TQAGWVINGQ QRVNSVTGKP LTFELLLPAS SNSQWVLPFQ HNLQRLGITM TIRQVDNSQL TNRMRSRDYD MMPRLWRAMP WPSSDLQISW ASEYIDSSYN APGVQSSVVD KLIAQIIAAQ GDKAKLVPLG RALDRVLTWN YYMLPMWYMA QDRLAWWDKF SHPAIRPVYT IGLDTWWYDV NKAAKLPAAR R
|
| |