Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1298 |
Symbol | |
ID | 6485422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1284396 |
End bp | 1285418 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642736696 |
Product | hypothetical protein |
Protein accession | YP_002040453 |
Protein GI | 194445329 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.166048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00000000000000101215 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAT TGTCAGGCGT TTTCCTTCTG CTGTTGGTTG TGCTGGGTAT TGCCGCGGGC GTGGGGATGT GGAAAGTTCG CCATCTGGCG AACAGCACGT TACTTATTAA AGACGAGACT ATCTTTACGC TCAAGGCGGG AACGGGGCGG CTGGCGCTTG GTGACCAGCT TTATGATGAA AAAATCATTA ATCGCCCCCG GGTATTTCAG TGGCTGCTGC GCGTGGAGCC TGAGTTATCA CACTTTAAAG CGGGAACTTA CCGTTTTACG CCGGGGATGA CCGTACGGGA GATGCTTGAG TTGCTGGAGA GCGGCAAAGA AGCGCAATTC CCGTTGCGGT TTGTGGAAGG GATGCGCCTT AGCGACTACC TGAAACAGCT ACGAGAGGCG CCGTATATTC GCCATACATT GCCGGATGAT GACTACGCCA CTGTCGCTCA GGCATTAAAG CTTGCGCACC CGGAATGGGT AGAAGGGTGG TTCTGGCCTG ATACCTGGAT GTATACCGCC AACACCAGCG ATGTCGCTAT TCTCAAGCGA GCGCATCAAA AGATGGTGAA AGCTGTCGAT ACTGTCTGGA AAGGTCGGGC CGAGGGGCTG CCTTATAAAG ATCAGAACCA ACTGGTGACA ATGGCCTCGA TTATTGAAAA AGAGACGGCT GTCGCCAGCG AACGCGATCA GGTGGCCTCA GTCTTTATTA ATCGCCTGAG AATCGGTATG CGCCTTCAGA CCGATCCCAC CGTGATTTAC GGGATGGGGA CGAGTTATAA TGGTAACTTG TCGCGTGCGG ATCTGGAAAA GCCGACGGCT TATAACACGT ATACCATAAC CGGGCTGCCG CCAGGACCGA TTGCATCGCC CAGCGAAGCG TCATTGCAGG CGGCGGCGCA TCCGGCGAAA ACGCCGTATC TCTATTTTGT GGCCGACGGT AAAGGTGGTC ACACATTTAA CACCAATCTT GCCAGCCATA ATCGGTCAGT GCAGGAGTAC CTGAAAGTGC TTAAGGAAAA AAATGGGCAG TAA
|
Protein sequence | MKKLSGVFLL LLVVLGIAAG VGMWKVRHLA NSTLLIKDET IFTLKAGTGR LALGDQLYDE KIINRPRVFQ WLLRVEPELS HFKAGTYRFT PGMTVREMLE LLESGKEAQF PLRFVEGMRL SDYLKQLREA PYIRHTLPDD DYATVAQALK LAHPEWVEGW FWPDTWMYTA NTSDVAILKR AHQKMVKAVD TVWKGRAEGL PYKDQNQLVT MASIIEKETA VASERDQVAS VFINRLRIGM RLQTDPTVIY GMGTSYNGNL SRADLEKPTA YNTYTITGLP PGPIASPSEA SLQAAAHPAK TPYLYFVADG KGGHTFNTNL ASHNRSVQEY LKVLKEKNGQ
|
| |