Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4712 |
Symbol | |
ID | 5736555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6018411 |
End bp | 6020888 |
Gene Length | 2478 bp |
Protein Length | 825 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281876 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_001547471 |
Protein GI | 159901224 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1925] Phosphotransferase system, HPr-related proteins [COG4668] Mannitol/fructose-specific phosphotransferase system, IIA domain |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.846513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATCGAGC TACAGCAGCG TCATATCCAA CTCAAGAGTC AGCCCGCCAA CAAGCAAGCG GCAATTCAAC AAGCAGGCCA GCTTCTGGTT GCCAATGGGC AGATCGAGTC TGGCTATATT CAAAGTATGC TCCAACGCGA AGCCCTTTCG AATACTTATC TTGGCAATGG CATCGCAATT CCTCATGGTT TGCCCGAAGC CCGCGATCTG ATTCGCCAAA CCGGAATTGT GGTACTGCAA GTGCCCAACG GGGTGCAGTG GAATCCAGGT GAAACTGTGC ACTTAATCGT TGGCATCGCC GCTCGCTCCG ATGAACATAT TGCAATTTTG CGCCAACTTA CCCGCGTGCT TGGCGATAAA ACCTTAGTTG ACCAACTCAC CCACACCACC AATCCTGATG ATCTGATTCG CGCATTGACT GGCGCTGAAA CGGCTCCCGC AGCCTCAGCG GCTCCAGCTA TCAGTGTGGC GGTTGCCAGT GACCAGCCCT TTTTTGAAAC CAAAGTTTTG AATCCCACTG GCTTTCATGC TCGCCCGGCC ACAACCTTTG TTGATTGCGC TAAGCGTTTT CAAGCGGAAG TGCGCATACG TTATGGCTCG CGTGAGGCCA ATGGCAAAAG CTTAATCTCT ATTTTGCAAC TTGGCATTCC CCATGGCGCG ACGATTCAGG TCACGGCTCA AGGTGCTGAT GCCGCTGCGG CGCTACGTGG TTTACAACAG GCATTGGCCC AAGGCTTAGC TGATGAAACA GCGGAAACGC TTCCAACACG CGCTGCAGTC GATGTGCGCT GGACACCCAA GGCCGTTGAG CAGACGATCA ATGGCATTAG CGCCGCGCCA GGCTTGGCGA TTGGCTTGCT CCGCCGTTAT AGCCATAGCG AGTTGGTGAT CGAAGATCGG CCAAGTGACC CGATGGTTGA GGTCAATCGC TTTGAGCTGG CATTGGCCGC TGCCCAAGCC GAGCTGTCAA CGCTCTATGA TGAAGTGCAA GCGCGAATTG GCTCAGGCAA GGCGGCGATT TTCCGCGTGC ATAGCGAAAT GTTGAGCGAT ACCAGCCTTG TGCAACAAAC CGTCTCGTTG TTATTTGAAA AACATAGCGC TGAATGGGCT TGGCATCAGG TGATCAGCGG GCGCGTGGCC CAAATCGAAA AACTTGACGA TCAAGTGCTA GCTGGCCGTG CGGTTGATCT GAGTGATGTT GGTCAGCGGG TGTTACGCCA GTTGATCGGC GGCGATCATC AACGGCCTTT GAATGCAGCC ACGCCAGTGA TTATTTTGGC CGACGATTTG ACTCCCTCAG ATACCGCCGC CTTTGACCCC GACACTATTT TGGGCTTTGG CACAGTACGC GGCGGCCCAA CTTCGCACAC CGCAATTTTA GCGCGTTCGC TGGGAATTCC AGCAATCGTT GGGGCTGGCG AAGGCTTGTT GAATTTGCCC GAAGAAGCAA TTGCAATTCT CGATGGCTAC AACGGCAAAT TGTATCTCAA CCCAAGTAGT GCCGATATCG AGGCCGCGAC CAACTTGCAA GCCGAATTGG CCGAACAACA CGAGCGTGCT CAAGCTACGC GCTTCGAGCC AGCCCAAACC AGCGATGGCC ATCGAATCGA AATTGCTGCC AACATCAATC GCGTTGCTGA TGCGAGTGTT GCCGCCGCTG CGGGAGCCGA AGGCGTGGGC TTGATGCGCA CCGAATTTCT GTTCCTTGAG CGCGATAGTG CTCCCAGCGA GGAAGAACAA TTCGAGGCCT ATCGCGATAT GGTGCAAGCG ATGGCGGGCC ACTCGGTGAT TATTCGCACG CTGGATATTG GCGGCGATAA AGTTGTGCCC TATCTCGATT TGCCCAAAGA AGATAATTCG TTCTTGGGCA TTCGCGGGAT TCGACTCTGC CTAGCCCGAC CTGAATTATT TATCCCACAG CTCCGTGCGA TTTATCGCGC CGCCGCTTTT GGGCCGCTCA AAATTATGTT CCCAATGATT GCCACCTTGG AAGATTGGTA CGCCGCCCGC GATTTGGCTG AGCAAGTGCG CCGCGAACTC GACGCGCCTC AAGTGCCACT CGGCATTATG GTCGAAGTGC CATCGGCGGC AGTGCTCGCC GAACAATTTG CCCAAGAAGT CGATTTCTTC TCGATTGGCA CCAACGATTT GACCCAATAC ACCCTAGCAA TGGATCGTTT ACACCCACAA TTAGCTAGCC AAGCCGATGG CCTGCATCCA GCGGTGCTGC GAATGATCGA CCTGACCGCC CACGCCGCCA ACGCCCACAA CAAATGGGTT GGTGTATGTG GTGGCATTGC CGCCGATGCC CGTGGTTCCT TGATTTTGGT GGGCTTGGGC GTGCATGAAC TCAGCGTCAG CGTACCAGCC ATCGCCGAAC TCAAAGCTGC CATTCGTCAA CATAGCTTGG CCGATTTACA AGCACTCGCG CAACGTGCCT TGGCATGTCG CAGTGCTGCC GAGGTACATC AATTATGA
|
Protein sequence | MIELQQRHIQ LKSQPANKQA AIQQAGQLLV ANGQIESGYI QSMLQREALS NTYLGNGIAI PHGLPEARDL IRQTGIVVLQ VPNGVQWNPG ETVHLIVGIA ARSDEHIAIL RQLTRVLGDK TLVDQLTHTT NPDDLIRALT GAETAPAASA APAISVAVAS DQPFFETKVL NPTGFHARPA TTFVDCAKRF QAEVRIRYGS REANGKSLIS ILQLGIPHGA TIQVTAQGAD AAAALRGLQQ ALAQGLADET AETLPTRAAV DVRWTPKAVE QTINGISAAP GLAIGLLRRY SHSELVIEDR PSDPMVEVNR FELALAAAQA ELSTLYDEVQ ARIGSGKAAI FRVHSEMLSD TSLVQQTVSL LFEKHSAEWA WHQVISGRVA QIEKLDDQVL AGRAVDLSDV GQRVLRQLIG GDHQRPLNAA TPVIILADDL TPSDTAAFDP DTILGFGTVR GGPTSHTAIL ARSLGIPAIV GAGEGLLNLP EEAIAILDGY NGKLYLNPSS ADIEAATNLQ AELAEQHERA QATRFEPAQT SDGHRIEIAA NINRVADASV AAAAGAEGVG LMRTEFLFLE RDSAPSEEEQ FEAYRDMVQA MAGHSVIIRT LDIGGDKVVP YLDLPKEDNS FLGIRGIRLC LARPELFIPQ LRAIYRAAAF GPLKIMFPMI ATLEDWYAAR DLAEQVRREL DAPQVPLGIM VEVPSAAVLA EQFAQEVDFF SIGTNDLTQY TLAMDRLHPQ LASQADGLHP AVLRMIDLTA HAANAHNKWV GVCGGIAADA RGSLILVGLG VHELSVSVPA IAELKAAIRQ HSLADLQALA QRALACRSAA EVHQL
|
| |