Gene HS_1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1097 
SymbolptsI 
ID4240597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1230350 
End bp1232077 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content37% 
IMG OID638104659 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_719309 
Protein GI113461240 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0431339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCAG GTATTACAGC ATCACCAGGT ATTGTATTTG GTAAGGCACT GGTGCTTAAA 
GAAGAAAAGA TTGTTCTTGA TACGCAAAAA ATAAAAGATG AGCAAATAGA CATTGAAGTT
GCTCGTTTTT ATAGCGGGCG TGCGGCAGCT ATAGAGCAAT TGATTGCGAT TAAAGATCGT
GCTTTAGTCT CTTTAGGTGA GGAAAAAGCT GCAATTTTTG AAGGACATTT AATGATCCTT
GAAGATGAAG AACTGGAAGA AGAAATTTTA GATTACTTAC GTTCAAATAA GGTCAATGCA
GGCGTTGCTG CAAGTAAGAT TATTGATCAA CAAGTTGCTA TGTTGTCTGA AATTGACGAT
GAATACCTAC AAGAGCGTGC CGGAGACATT CGTGATATTG GTAATCGTTT GATTAAAAAT
ATTTTAGGTA TGAAAATCGT TGATTTAGGG GATATTAATG AAGAGGCTAT TTTGGTTGCT
TATGATTTAA CCCCTTCTGA AACAGCACAA TTAAATTTGG ATAAAGTATT AGGCTTCATT
ACAGACATTG GTGGGCGAAC ATCACATACT TCTATTATGG CTCGTTCTCT TGAATTGCCG
GCTATTGTAG GAACCAATGA TATCACTTCA AAAGTCAATA CTGGTGATTA TCTCGTTCTT
GATGCGGTTA ATAATGCGAT TTATGTTAAT CCGACACAAG AAGAAATTGA GCGTTTGAAA
GCTTTAGAAC GACAATTAGC CGAAGAAAAA GCGGAATTAG CTAAATTGAA AGATTTGCCA
GCATTGACTT TAGACGGACA TCAAGTAGAT GTGGTAGCTA ATATTGGCAC TATTCGTGAT
TGTGAGGGGG CTGATCGTAA TGGTGCTGAA GGCGTCGGAT TATATCGGAC AGAATTCTTA
TTTATGGATC GTGATCAGTT ACCAACAGAA GAAGAACAAT TTATTGCTTA TAAAGAAGTT
GTTGAAGCAA TGAATGGTCG TTTAGTTGTG TTACGTACAA TGGATATTGG TGGAGATAAA
GATTTACCGT ATTTGAATTT ACCTAAAGAA ATGAATCCTT TCTTAGGGTG GCGTGCGATT
CGAATTGGTA TGGATCGTCG TGAAATTTTA CATGCTCAAT TACGTGCAGT ATTGCGTGCG
TCAGCTTTTG GTAAATTAGC TGTTATGTTC CCGATGATTA TTTCTGTTGA AGAAATTCGA
GAATTGAAAT CTGTGATTGA AAGTTTGAAA CAAGAATTAC GTGATGAGGG TAAGGCTTTT
GATGAGAATC TTCAAGTTGG GGTAATGGTT GAAACACCGG CGGCAGCAAT AAATGCAAAA
TTTTTAGCAA AAGAAGTCGA TTTTTTTAGT ATCGGGACGA ATGATTTAAC CCAATATACT
TTAGCGGTTG ATCGTGGGAA TGAATTAATT TCACATTTAT ACAATCCGAT GACACCGGCT
GTATTAAGCT TAATTAAGCA TGTGATAGAT GCATCTCATG AGGAAGGTAA ATGGACTGGA
ATGTGCGGTG AGTTAGCAGG CGATGAAAAT GCCACATTGT TATTGTTAGG TATGGGATTA
GATGAATTTA GTATGAGTGC AATTTCGATA CCTCGTATTA AAAAATTGAT TCGCAATGTG
AATTATCAGG ATGCTAAATT ACTTGCTGAG CAAGCATTAC AGCAACCAAC TGCGGCAGGT
ATTTTGAGCT TAGTTAATGA TTTTTTAGTG GAAAAAGCAC TTAATTAA
 
Protein sequence
MISGITASPG IVFGKALVLK EEKIVLDTQK IKDEQIDIEV ARFYSGRAAA IEQLIAIKDR 
ALVSLGEEKA AIFEGHLMIL EDEELEEEIL DYLRSNKVNA GVAASKIIDQ QVAMLSEIDD
EYLQERAGDI RDIGNRLIKN ILGMKIVDLG DINEEAILVA YDLTPSETAQ LNLDKVLGFI
TDIGGRTSHT SIMARSLELP AIVGTNDITS KVNTGDYLVL DAVNNAIYVN PTQEEIERLK
ALERQLAEEK AELAKLKDLP ALTLDGHQVD VVANIGTIRD CEGADRNGAE GVGLYRTEFL
FMDRDQLPTE EEQFIAYKEV VEAMNGRLVV LRTMDIGGDK DLPYLNLPKE MNPFLGWRAI
RIGMDRREIL HAQLRAVLRA SAFGKLAVMF PMIISVEEIR ELKSVIESLK QELRDEGKAF
DENLQVGVMV ETPAAAINAK FLAKEVDFFS IGTNDLTQYT LAVDRGNELI SHLYNPMTPA
VLSLIKHVID ASHEEGKWTG MCGELAGDEN ATLLLLGMGL DEFSMSAISI PRIKKLIRNV
NYQDAKLLAE QALQQPTAAG ILSLVNDFLV EKALN