Gene SeD_A1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1960 
Symbol 
ID6874675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1891979 
End bp1893205 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content46% 
IMG OID642785080 
Productputative regulatory protein 
Protein accessionYP_002215746 
Protein GI198245744 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID[TIGR02152] ribokinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000000520335 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTAAAG AAGAAAGACG TCATGCCATC ATTAATTTAC TGATAAAGGA TAATAGTGTT 
AGCGTCAGTA AACTTTCAGA CCTTTATAAG GTTAGCCAGG AGACTATTCG TTCCGATCTA
CGCTATTTCC AGAAATCAGG TATGCTTCAG CGTTGCTATG GCGGAGGGAT TTTAAACCGT
GACGCGCTGA GTAAGCTTAT CACTGAAAAT AAGATTGATA TCTCCAGCAC TATCGCCACG
CCAATCCATC AGGATGCAAA ACTGCGCCGG GAAAACCCAA AAAAAGCAGG CAAGGTGTGT
GTTTTAGGCT CATTCAATAT TGATGTTTCA GCAACCGTGC CGTGGTTTCC ACAAAGCGGA
GAATCCATTC TGGCCAGTCA ATTTGGATTC TATCCTGGCG GTAAAGGAGC CAACCAGGCT
TTAGCGGCGA ACAATGCCGG CGCTGCGGCA CATTTTATTT TTAAAGTGGG CAAAGATCAG
TTCAGCGCAT TTGCTATGAA TCATATTATT CAATCAGGTA TCGCCTCATA CAGCGCGTAT
CAAACAGATA AAGCGCCCAC CGGTAGCGCA TTGATCTATG TCTCCGCCGT GGATGGCGAT
AATATTATCG CCATCTACCC TGGCGCCAAT ATGATGCTCA CCACGCAAGA GATTAACGAG
CAACACCGTT ATATCGCCGA GTCTGACGTT ATGTTAATGC AGCTCGAAAC GAACATTGAA
GCGTTGACTG AATTTATTCG TCTGGGCAAA CAAGAAAATA AAATGATCAT GCTGAATCCT
GCCCCCTATA CGAAACAGGT GACGCATTTA TTATCTGATA TTGACATCAT CACGCCGAAT
GAAACTGAAG CCTCTTTTTT ATCCGGCGTA ACCATTACTG ATATTAATGA TGCGAAAAAA
GCCGGAAATA TTATTCTGCA ATCCGGGGTG AAAAAAGTCA TCATTACCCT TGGCGCCCGT
GGATCTCTGC TCTGTGAGCA CGCCCGCACG TTGTATATTC CTGCGTGGAG CGCCGTGGTA
AAAGATGCCG CCGGGGCCGG TGACGCTTTT AATGGCGCCT TAGCCGCCGC GCTGGCGCGA
CAAGCAGACA TGGTCGCAGC CATTCAATAT GCCTCCGCTT TCGCTTCTCT GGCGGTGGAA
CAAGTCGGTG CGTCGAGTAT GCCTCAGCAC TTGCAGGTTT TACATCGAAT GCGTACCCAA
TCTAATAAAG TCATTCACAT TAATTAA
 
Protein sequence
MFKEERRHAI INLLIKDNSV SVSKLSDLYK VSQETIRSDL RYFQKSGMLQ RCYGGGILNR 
DALSKLITEN KIDISSTIAT PIHQDAKLRR ENPKKAGKVC VLGSFNIDVS ATVPWFPQSG
ESILASQFGF YPGGKGANQA LAANNAGAAA HFIFKVGKDQ FSAFAMNHII QSGIASYSAY
QTDKAPTGSA LIYVSAVDGD NIIAIYPGAN MMLTTQEINE QHRYIAESDV MLMQLETNIE
ALTEFIRLGK QENKMIMLNP APYTKQVTHL LSDIDIITPN ETEASFLSGV TITDINDAKK
AGNIILQSGV KKVIITLGAR GSLLCEHART LYIPAWSAVV KDAAGAGDAF NGALAAALAR
QADMVAAIQY ASAFASLAVE QVGASSMPQH LQVLHRMRTQ SNKVIHIN