Gene SNSL254_A1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1298 
Symbol 
ID6485422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1284396 
End bp1285418 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content53% 
IMG OID642736696 
Producthypothetical protein 
Protein accessionYP_002040453 
Protein GI194445329 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.166048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00000000000000101215 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAT TGTCAGGCGT TTTCCTTCTG CTGTTGGTTG TGCTGGGTAT TGCCGCGGGC 
GTGGGGATGT GGAAAGTTCG CCATCTGGCG AACAGCACGT TACTTATTAA AGACGAGACT
ATCTTTACGC TCAAGGCGGG AACGGGGCGG CTGGCGCTTG GTGACCAGCT TTATGATGAA
AAAATCATTA ATCGCCCCCG GGTATTTCAG TGGCTGCTGC GCGTGGAGCC TGAGTTATCA
CACTTTAAAG CGGGAACTTA CCGTTTTACG CCGGGGATGA CCGTACGGGA GATGCTTGAG
TTGCTGGAGA GCGGCAAAGA AGCGCAATTC CCGTTGCGGT TTGTGGAAGG GATGCGCCTT
AGCGACTACC TGAAACAGCT ACGAGAGGCG CCGTATATTC GCCATACATT GCCGGATGAT
GACTACGCCA CTGTCGCTCA GGCATTAAAG CTTGCGCACC CGGAATGGGT AGAAGGGTGG
TTCTGGCCTG ATACCTGGAT GTATACCGCC AACACCAGCG ATGTCGCTAT TCTCAAGCGA
GCGCATCAAA AGATGGTGAA AGCTGTCGAT ACTGTCTGGA AAGGTCGGGC CGAGGGGCTG
CCTTATAAAG ATCAGAACCA ACTGGTGACA ATGGCCTCGA TTATTGAAAA AGAGACGGCT
GTCGCCAGCG AACGCGATCA GGTGGCCTCA GTCTTTATTA ATCGCCTGAG AATCGGTATG
CGCCTTCAGA CCGATCCCAC CGTGATTTAC GGGATGGGGA CGAGTTATAA TGGTAACTTG
TCGCGTGCGG ATCTGGAAAA GCCGACGGCT TATAACACGT ATACCATAAC CGGGCTGCCG
CCAGGACCGA TTGCATCGCC CAGCGAAGCG TCATTGCAGG CGGCGGCGCA TCCGGCGAAA
ACGCCGTATC TCTATTTTGT GGCCGACGGT AAAGGTGGTC ACACATTTAA CACCAATCTT
GCCAGCCATA ATCGGTCAGT GCAGGAGTAC CTGAAAGTGC TTAAGGAAAA AAATGGGCAG
TAA
 
Protein sequence
MKKLSGVFLL LLVVLGIAAG VGMWKVRHLA NSTLLIKDET IFTLKAGTGR LALGDQLYDE 
KIINRPRVFQ WLLRVEPELS HFKAGTYRFT PGMTVREMLE LLESGKEAQF PLRFVEGMRL
SDYLKQLREA PYIRHTLPDD DYATVAQALK LAHPEWVEGW FWPDTWMYTA NTSDVAILKR
AHQKMVKAVD TVWKGRAEGL PYKDQNQLVT MASIIEKETA VASERDQVAS VFINRLRIGM
RLQTDPTVIY GMGTSYNGNL SRADLEKPTA YNTYTITGLP PGPIASPSEA SLQAAAHPAK
TPYLYFVADG KGGHTFNTNL ASHNRSVQEY LKVLKEKNGQ