Gene SeD_A2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2554 
Symbol 
ID6873355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2434107 
End bp2435795 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content59% 
IMG OID642785629 
ProductPTS system fructose-specific transporter subunits IIBC 
Protein accessionYP_002216287 
Protein GI198244051 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1299] Phosphotransferase system, fructose-specific IIC component
[COG1445] Phosphotransferase system fructose-specific component IIB 
TIGRFAM ID[TIGR00829] PTS system, fructose-specific, IIB component
[TIGR01427] PTS system, fructose subfamily, IIC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.695435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.000294778 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACGC TGCTGATAAT TGACGCTAAT CTCGGCCAGG CTCGCGCTTA CATGGCGAAA 
ACCCTGCTTG GAGCGGCGGC GCATAAAGCA AATCTGGAAA TTATCGACAA TCCGAATGAT
GCTGAGCTGG CGATCGTGCT GGGCGAATCT CTGCCGAACG ACAATGCTCT GAACGGCAAA
AAGGTTTGGC TGGGGGATAT TGGTCGCGCC GTCGCGCATC CGGAACTGTT TCTGAGTGAA
GCCAAAAGCC ATGCGACCCT TTACAGCGCC CCTGCTGCGG CTGCTCCTGC GGCGAGCGGC
GGTCCGAAAC GCGTGGTCGC GGTAACGGCC TGCCCGACCG GCGTCGCCCA TACTTTTATG
GCGGCTGAAG CCATTGAAAC AGAAGCGAAA AAACGCGGCT GGTGGGTAAA AGTCGAAACG
CGCGGCTCCG TGGGGGCCGG CAATGCCATC ACCCCGGAAG AGGTGGCGGA AGCGGATTTA
GTGATTGTGG CGGCGGATAT CGAAGTGGAT CTGGCGAAGT TTGCCGGTCT CCCGATGTAC
CGCACGTCGA CCGGCCTGGC GCTGAAAAAG ACGGCGCAGG AGCTGGATAA AGCCGTAGCG
GAAGCGACGC CGTATCAACC GGCGGGTAAG GCATCGCAAG CGGCGACCGA AGGGAAGAAA
GAGAGCGCTG GCGCATACCG GCATCTGTTG ACGGGCGTTT CTTACATGCT GCCTATGGTG
GTTGCCGGGG GACTGTGTAT TGCGCTTTCC TTCGCCTTTG GTATTGAAGC CTTTAAGGTG
CCGGACACGC TGGCGGCGGC GCTGATGCAG ATTGGCGGCG GTTCGGCGTT TGCGCTGATG
GTGCCGGTAC TGGCGGGTTA TATCGCTTTC TCCATCGCGG ATCGCCCAGG CCTTACGCCA
GGTCTGATTG GCGGTATGCT GGCGGTTAGC ACCGGTTCTG GTTTTATCGG CGGGATCATT
GCCGGCTTCC TTGCCGGGTA TATGGCGAAG CTTATCAGTA CCAAACTGAA ACTTCCGCAA
AGTATGGAAG CGCTGAAGCC AATCCTGATC ATCCCGTTAA TTTCCAGTCT GGTGGTGGGG
CTGGCGATGA TTTACCTGAT CGGTAAACCG GTTGCCGGGA TTCTGGAAGG GCTCACCCAC
TGGCTGCAAA CCATGGGGAC CGCAAACGCG GTGCTGCTGG GCGCGATTCT CGGCGGGATG
ATGTGTACCG ACATGGGTGG CCCGGTGAAC AAAGCGGCGT ATGCGTTTGG CGTTGGTCTG
CTGAGTACGC AAACTTACGC GCCGATGGCG GCGATCATGG CCGCAGGTAT GGTACCGCCG
TTAGCGCTGG GTCTGGCAAC GATGGTGGCG CGTCGTAAGT TCGACAAAGC GCAGCAGGAA
GGCGGTAAAG CGGCGTTGGT ACTGGGGCTG TGCTTTATCA CTGAAGGCGC TATTCCGTTT
GCGGCGCGCG ACCCGATGCG TGTACTGCCG TGCTGTATCG TTGGCGGCGC GTTGACCGGG
GCTATTTCTA TGGCGGTAGG CGCAAAATTG ATGGCGCCGC ATGGCGGTCT GTTTGTTCTG
CTTATCCCAG GCGCAATTAC GCCGGTATTG GGATACCTGC TGGCAATTGT GGCCGGTACG
CTGGTGGCAG GACTGGCTTA TGCCGTCCTG AAACGTCCGG AGACGGAAGT CGCGGCAAAA
GCGGCATAA
 
Protein sequence
MKTLLIIDAN LGQARAYMAK TLLGAAAHKA NLEIIDNPND AELAIVLGES LPNDNALNGK 
KVWLGDIGRA VAHPELFLSE AKSHATLYSA PAAAAPAASG GPKRVVAVTA CPTGVAHTFM
AAEAIETEAK KRGWWVKVET RGSVGAGNAI TPEEVAEADL VIVAADIEVD LAKFAGLPMY
RTSTGLALKK TAQELDKAVA EATPYQPAGK ASQAATEGKK ESAGAYRHLL TGVSYMLPMV
VAGGLCIALS FAFGIEAFKV PDTLAAALMQ IGGGSAFALM VPVLAGYIAF SIADRPGLTP
GLIGGMLAVS TGSGFIGGII AGFLAGYMAK LISTKLKLPQ SMEALKPILI IPLISSLVVG
LAMIYLIGKP VAGILEGLTH WLQTMGTANA VLLGAILGGM MCTDMGGPVN KAAYAFGVGL
LSTQTYAPMA AIMAAGMVPP LALGLATMVA RRKFDKAQQE GGKAALVLGL CFITEGAIPF
AARDPMRVLP CCIVGGALTG AISMAVGAKL MAPHGGLFVL LIPGAITPVL GYLLAIVAGT
LVAGLAYAVL KRPETEVAAK AA