Gene SeD_A2547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2547 
Symbol 
ID6873982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2425709 
End bp2427031 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content51% 
IMG OID642785622 
Productregulatory protein UhpC 
Protein accessionYP_002216280 
Protein GI198245195 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.0466951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATAA AAACATTTAT TTTCAGCCAG CCCGATAAGC CGATTGTAAA AGAGAAAAAA 
GAGATAGACC AGACCTATAA AAAATTTCGT TTTGAGATTA TCGCCTCGGT CTTTATCTCT
TATGCCGTCT TTTATCTTAC CCGTAAAAAC TTCTCCGCCG CGATGCCGGC CATGCTGACG
GAAACGTCGC TGACCGCAGA AGACTTCGCC ATTATGTCTT CCATTTTTTA CATTCTTTAT
GGCGCGATGA AGTTTGTCGG CGGGATGCTG GTCGATAAAA TTAACCCGAA AGCTATGACC
GGCCCGGTGC TGATTGGCGT GGGGATTGTA AATATTCTGT TCGGCTTTTC CGATAGTGTG
GCGGCATTCT ACGTACTGTA CAGCCTCAAC GCGATCTTAC AGGGCACCAG CTTTCCGCCG
ATGGCGAAAA TTATGGCCTC GTGGTTTTCG AAAAACGAAC GGGGACGCTG GTGGGCTATC
GTTGAAGCGG CGCACAATAT CGGCGGCAGC CTCGCGCCGC TGCTGACCAG CTTTGCTATC
GCCTTTAGCG GTAGCTGGAA AATGGGGTTT TATGTTCCCG GCGCCATTTC GCTGCTGATG
GGGATAGTGG CGCTATTTAC CATTAAAGAT CGTCCCGGTA CGTTAGGTTT GCCCAACGTA
GGGCAGTGGC GTAACGACCC GACGGAACTG GCCCAGGTCA AGGCCAGCCC GGTCAACCTG
AGCTTCTGGC AGATTTTTGT GAAATATATC CTGACCAATC CACTGGTATG GATCATTATT
ATCGGTGATA TGTCGGTTTA TATTGCGCGC ACCATCCTTA ACGACTGGCC GCAGATTTAC
TATTCGCAGG TTCACGGCTG GAGCCTGATA AAAGCGAACT CGATTATTTC CTGGTTTGAG
GCGGGCGGAC TGGCGGGTGG GTTGCTGGCA GGCTACTTGT CTGACTTTAT GTTCAAAAGT
AACCGCTGGA TGACCGGATT AATCTTCGCG TTGGCGCTGT GCATATGCAT CGTGCTGGTG
CCGCTGGTTC AGGATACCTC TTACACCCTC ACCGCGATTC TGTTCACCAT CATGGGCTTC
GCCTTATACG GACCGCATAT GCTTTTTGCC GTCGGCTGTC TGGATGTGAC CCATAAGGAT
GCGGCGGGAT CGATTACCGG CTTTCGGGGA TTGTTCAGCT ATGTCGGCGC GGCAATGGCC
GGTGTGCCGG TAATTATGGT GAAAAATAGC TGGGCGTGGT CGGGCGTTTA TATCTATGCG
GTGATCGCCA TTCTGCTAAC GACTCTGTCG CTGGCGCTGC TCTCCAGGCT GCATCGGTTA
TAA
 
Protein sequence
MSIKTFIFSQ PDKPIVKEKK EIDQTYKKFR FEIIASVFIS YAVFYLTRKN FSAAMPAMLT 
ETSLTAEDFA IMSSIFYILY GAMKFVGGML VDKINPKAMT GPVLIGVGIV NILFGFSDSV
AAFYVLYSLN AILQGTSFPP MAKIMASWFS KNERGRWWAI VEAAHNIGGS LAPLLTSFAI
AFSGSWKMGF YVPGAISLLM GIVALFTIKD RPGTLGLPNV GQWRNDPTEL AQVKASPVNL
SFWQIFVKYI LTNPLVWIII IGDMSVYIAR TILNDWPQIY YSQVHGWSLI KANSIISWFE
AGGLAGGLLA GYLSDFMFKS NRWMTGLIFA LALCICIVLV PLVQDTSYTL TAILFTIMGF
ALYGPHMLFA VGCLDVTHKD AAGSITGFRG LFSYVGAAMA GVPVIMVKNS WAWSGVYIYA
VIAILLTTLS LALLSRLHRL