Gene SNSL254_A1495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1495 
Symbol 
ID6482413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1464221 
End bp1465447 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content46% 
IMG OID642736882 
Productputative regulatory protein 
Protein accessionYP_002040636 
Protein GI194445121 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID[TIGR02152] ribokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.943692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000000000000150767 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTAAAG AAGAAAGACG TCATGCCATC ATTAATTTAC TGATAAAGGA TAATAGTGTT 
AGCGTCAGTA AACTTTCAGA CCTTTATAAG GTTAGCCAGG AGACTATTCG TTCCGATCTA
CGCTATTTCC AGAAATCAGG TATGCTTCAG CGTTGCTATG GCGGAGGGAT TTTAAACCGT
GACGCGCTGA GTAAGCTTAT CACTGAAAAT AAGATTGATA TCTCCAGCAC TATCGCCACG
CCAATCCATC AGGATGCAAA ACTGCGCCGG GAAAACCCAA AAAAAGCAGG CAAGGTGTGT
GTTTTAGGCT CATTCAATAT TGATGTTTCA GCAACCGTGC CGTGGTTTCC ACAAAGCGGA
GAATCCATTC TGGCCAGTCA ATTTGGATTC TATCCTGGAG GTAAAGGAGC CAACCAGGCT
TTAGCGGCAA ACAATGCCGG CGCTGCGGCA CATTTTATTT TTAAAGTGGG CAAAGATCAG
TTCAGCGCAT TTGCTATGAA TCATATTATT CAATCAGGCA TCACCTCATA CAGCGCGTAT
CAAACAGATA AAGCGCCCAC CGGTAGCGCA TTGATCTATG TCTCCGCCGT GGATGGCGAT
AATATTATCG CCATCTACCC TGGCGCCAAT ATGATGCTCA CCACGCAAGA GATTAACGAG
CAACACCGTT ATATCGCCGA GTCTGACGTT ATGTTAATGC AGCTCGAAAC GAACATTGAA
GCGTTGACTG AATTTATTCG CCTGGGCAAA CAAGAAAATA AAATGATCAT GCTGAATCCT
GCCCCCTATA CGAAACAGGT GACGCATTTA TTATCTGATA TTGACATCAT CACGCCGAAT
GAAACTGAAG CCTCTTTTTT ATCCGGCGTA ACCATTACTG ATATTAATGA TGCGAAAAAA
GCCGGAAATA TTATTCTGCA ATCCGGGGTG AAAAAAGTCA TCATTACCCT TGGCGCCCGT
GGTTCTCTGC TCTGTGAGCA CGCCCGCACG TTGTATATTC CTGCATGGAG CGCCGTGGTA
AAAGATGCCG CCGGGGCCGG TGACGCTTTT AATGGCGCCT TAGCCGCCGC GCTGGCGCGA
CAAGCAGACA TGGTCGCAGC CATTCAATAT GCCTCCGCTT TCGCTTCTCT GGCGGTGGAA
CAAGTCGGTG CGTCGAGTAT GCCTCAGCAC TTGCAGGTTT TACATCGAAT GCGTACCCAA
TCTAATAAAG TCATTCACAT TAATTAA
 
Protein sequence
MFKEERRHAI INLLIKDNSV SVSKLSDLYK VSQETIRSDL RYFQKSGMLQ RCYGGGILNR 
DALSKLITEN KIDISSTIAT PIHQDAKLRR ENPKKAGKVC VLGSFNIDVS ATVPWFPQSG
ESILASQFGF YPGGKGANQA LAANNAGAAA HFIFKVGKDQ FSAFAMNHII QSGITSYSAY
QTDKAPTGSA LIYVSAVDGD NIIAIYPGAN MMLTTQEINE QHRYIAESDV MLMQLETNIE
ALTEFIRLGK QENKMIMLNP APYTKQVTHL LSDIDIITPN ETEASFLSGV TITDINDAKK
AGNIILQSGV KKVIITLGAR GSLLCEHART LYIPAWSAVV KDAAGAGDAF NGALAAALAR
QADMVAAIQY ASAFASLAVE QVGASSMPQH LQVLHRMRTQ SNKVIHIN