Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1987 |
Symbol | |
ID | 6793661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 1930495 |
End bp | 1931517 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642776213 |
Product | hypothetical protein |
Protein accession | YP_002146844 |
Protein GI | 197251170 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000318105 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TGTCAGGCGT TTTCCTTCTG CTGTTGGTTG TGCTGGGTAT TGCCGCGGGC GTGGGGATGT GGAAAGTTCG CCATCTGGCG AACAGCACGT TACTTATTAA AAACGAGACT ATCTTTACGC TCAAGGCGGG AACGGGGCGG CTGGCGCTTG GCGACCAGCT TTATGATGAA AAAATCATTA ATCGCCCCCG GGTATTTCAG TGGCTGCTGC GCGTGGAGCC TGAGTTATCG CACTTTAAAG CAGGAACTTA CCGTTTTACG CCGGGGATGA CCGTACGGGA GATGCTTGAG TTGCTGGAGA GCGGCAAAGA AGCGCAATTC CCGTTGCGGT TTGTGGAAGG GATGCGCCTT AGCGATTACC TGAAACAGCT ACGAGAGGCG CCGTATATTC GCCATACATT GCCGGATGAT GACTACGCCA CTGTCGCTCA GGCATTAAAG CTTGCGCACC CGGAATGGGT AGAAGGGTGG TTCTGGCCTG ATACCTGGAT GTATACCGCC AACACCAGCG ATGTCGCTAT TCTCAAGCGA GCGCATCAAA AGATGGTGAA AGCTGTCGAT ACTGTCTGGA AAGGTCGGGC CGAGGGGCTG CCTTATAAAG ATCAGAACCA ACTGGTGACA ATGGCCTCGA TTATTGAAAA AGAGACGGCT GTCGCCAGCG AACGCGATCA GGTGGCCTCA GTCTTTATTA ATCGCCTGAG AATCGGTATG CGCCTTCAGA CCGATCCCAC CGTGATTTAC GGGATGGGGA CGAGTTATAA TGGTAACTTG TCGCGTGCGG ATCTGGAAAA GCCGACGGCT TATAACACGT ATACCATAAC CGGGCTGCCG CCAGGACCGA TTGCATCGCC CAGCGAAGCG TCATTGCAGG CGGCGGCGCA TCCGGCGAAA ACGCCGTATC TCTATTTTGT GGCCGACGGT AAAGGCGGTC ACACATTTAA CACCAATCTT GCCAGCCATA ATCGGTCAGT GCAGGAGTAC CTGAAAGTGC TTAAGGAAAA AAATGGGCAG TAA
|
Protein sequence | MKKLSGVFLL LLVVLGIAAG VGMWKVRHLA NSTLLIKNET IFTLKAGTGR LALGDQLYDE KIINRPRVFQ WLLRVEPELS HFKAGTYRFT PGMTVREMLE LLESGKEAQF PLRFVEGMRL SDYLKQLREA PYIRHTLPDD DYATVAQALK LAHPEWVEGW FWPDTWMYTA NTSDVAILKR AHQKMVKAVD TVWKGRAEGL PYKDQNQLVT MASIIEKETA VASERDQVAS VFINRLRIGM RLQTDPTVIY GMGTSYNGNL SRADLEKPTA YNTYTITGLP PGPIASPSEA SLQAAAHPAK TPYLYFVADG KGGHTFNTNL ASHNRSVQEY LKVLKEKNGQ
|
| |