Gene SeD_A2798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2798 
SymbolptsI 
ID6872121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2671136 
End bp2672863 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content52% 
IMG OID642785852 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_002216502 
Protein GI198244707 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.620926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCAG GCATTTTAGC ATCCCCGGGT ATCGCTTTCG GCAAAGCACT GCTGCTGAAA 
GAAGACGAAA TCGTCATTGA CCGGAAAAAA ATTTCTGCCG ACAAGGTTGA TCAGGAAGTT
GAACGTTTTC TGAGCGGTCG TGCCAAGGCA TCTGCGCAAC TGGAAGCGAT CAAGACAAAA
GCTGGTGAAA CGTTCGGTGA AGAAAAAGAA GCCATCTTTG AAGGGCATAT TATGCTGCTC
GAAGATGAGG AGCTGGAGCA GGAAATCATA GCCCTGATTA AAGATAAGCA CATGACGGCT
GACGCAGCCG CACATGAAGT TATCGAAGGT CAGGCCACTG CCCTGGAAGA ACTGGATGAT
GAATACCTGA AAGAGCGCGC GGCTGACGTA CGTGATATCG GTAAGCGCCT GCTGCGCAAC
ATCCTGGGTC TGGCCATTAT CGATCTGAGC GCGATTCAGG AAGAAGTTAT CCTGGTTGCC
TCTGACTTGA CGCCGTCAGA AACCGCGCAG CTGAACCTGC AGAAAGTGCT GGGCTTCATC
ACCGACGCGG GCGGACGCAC GTCTCATACC TCCATCATGG CGCGTTCTCT GGAACTGCCA
GCGATCGTAG GTACCGGTAG CGTCACCGCT CAGGTGAAAA ACGGCGACTA TCTGATTCTG
GATGCCGTAA ACAACCAGGT TTACGTCAAC CCGACCAACG ATGTTATTGA GCAACTGCGC
GCCGTCCAGG AGCAGGTTGC GACCGAGAAA GCGGAACTCG CAAAACTGAA AGATCTGCCG
GCAATCACGC TGGATGGACA TCAGGTTGAA GTTTGCGCCA ACATCGGTAC CGTTCGTGAC
GTTGAAGGCG CTGAGCGTAA CGGCGCGGAA GGCGTTGGTC TGTATCGTAC TGAATTCCTG
TTCATGGATC GCGACGCGCT GCCGACGGAA GAAGAGCAGT TTGCCGCCTA TAAAGCGGTC
GCTGAAGCGT GCGGCTCGCA GGCGGTTATC GTCCGTACCA TGGACATTGG CGGCGACAAA
GAGCTGCCGT ACATGAACTT CCCGAAAGAA GAGAACCCGT TCCTGGGCTG GCGCGCCGTG
CGTATCGCCA TGGATCGCAA AGAGATCCTG CGTGACCAGG TTCGCGCGAT TCTGCGTGCC
TCCGCTTTCG GTAAATTGCG CATTATGTTC CCGATGATCA TCTCTGTTGA AGAAGTTCGC
GCGCTGCGCA AAGAGATTGA AATCTACAAA CAGGAACTGC GTGACGAAGG TAAAGCATTT
GACGAAAGCA TTGAGATTGG CGTGATGGTG GAAACACCGG CTGCGGCGAC AATTGCGCGT
CATTTAGCCA AAGAAGTTGA TTTCTTTAGT ATCGGCACCA ATGATTTAAC GCAGTACACC
CTGGCAGTTG ACCGTGGTAA TGATATGATT TCACACCTTT ACCAGCCAAT GTCACCGTCC
GTACTGAACT TGATCAAGCA AGTTATTGAT GCTTCTCATG CAGAAGGTAA ATGGACTGGC
ATGTGTGGTG AGCTTGCAGG CGACGAACGT GCTACACTTC TGTTGCTGGG GATGGGTCTG
GACGAATTCT CTATGAGCGC CATTTCTATC CCGCGCATTA AGAAGATTAT CCGTAACACG
AACTTCGAAG ATGCGAAGGT GTTAGCAGAG CAGGCTCTTG CTCAACCGAC AACGGACGAG
TTAATGACGC TGGTTAACAA GTTCATTGAA GAAAAAACAA TCTGCTAA
 
Protein sequence
MISGILASPG IAFGKALLLK EDEIVIDRKK ISADKVDQEV ERFLSGRAKA SAQLEAIKTK 
AGETFGEEKE AIFEGHIMLL EDEELEQEII ALIKDKHMTA DAAAHEVIEG QATALEELDD
EYLKERAADV RDIGKRLLRN ILGLAIIDLS AIQEEVILVA SDLTPSETAQ LNLQKVLGFI
TDAGGRTSHT SIMARSLELP AIVGTGSVTA QVKNGDYLIL DAVNNQVYVN PTNDVIEQLR
AVQEQVATEK AELAKLKDLP AITLDGHQVE VCANIGTVRD VEGAERNGAE GVGLYRTEFL
FMDRDALPTE EEQFAAYKAV AEACGSQAVI VRTMDIGGDK ELPYMNFPKE ENPFLGWRAV
RIAMDRKEIL RDQVRAILRA SAFGKLRIMF PMIISVEEVR ALRKEIEIYK QELRDEGKAF
DESIEIGVMV ETPAAATIAR HLAKEVDFFS IGTNDLTQYT LAVDRGNDMI SHLYQPMSPS
VLNLIKQVID ASHAEGKWTG MCGELAGDER ATLLLLGMGL DEFSMSAISI PRIKKIIRNT
NFEDAKVLAE QALAQPTTDE LMTLVNKFIE EKTIC