Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1476 |
Symbol | |
ID | 8543858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 2000821 |
End bp | 2003406 |
Gene Length | 2586 bp |
Protein Length | 861 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646386187 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_003265922 |
Protein GI | 262194713 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR00830] PTS system, glucose subfamily, IIA component [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.287759 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATGC AAGCGCAGCA ACACGCTCTG CGAGCGCCGC TCTCGGGCAC CCTGCTGGCG CTCGAGCAGG TACCCGACCC GGTCTTCGCC CAGCGCATGA TCGGCGACGG CGTCGCGCTC GAGCCCGAGA GCAACGCGCT GCTCGCGCCC TGCGATGGCA CGGTCGCGCA CCTGCACGAG GCCTACCACG CGATCACGCT GGAGACCGAG ACCGGTCTGC AAGTGCTGCT GCACATCGGC ATCGACACCG TGCAGCTCGC GGGCGAGGGC TTCTTGCCGC ACGTGGCCGC GGGCCAGGCG GTCAAGACCG GCGATGTGCT GATCGAGTTC GACCCCGACT ACCTGCGCGC GCACGCCCGC AGCCTGCTGA CCATGATGCT CATCGTCGAT ATGGACGGAG TGTCCGAAAT GCAGGCGGCG ACCGGCCAGG TGCGCGCTGG TGAGGACGTG GCGCTGCGCC TGCAGACGCG AGCGGGCGCG GCTCGAGCGC TGGCCTCGGC CGCCGAAGAG CGGGTCTCGG CGGCGATCGA GGTCACCGAT CCCGCCGGCT TGCACGCGCG TCCGGCGGCC GTGCTGGCCA GCGTCGCGCG CCGCTACCAG GCCGCGATCG CGCTGTGCAA GGGCGACCGC GAGGCCAACG CCAAGAGCGT GCTCAGCGTG ATGCAGCTCG ACGTGCGCCA CGGCGATCAG GTGCTGCTGC GCGCGCGCGG CAGCGACGCC GAGTCCGCGC TGGGCGACAT CGCCGCGCAG CTCGAAGAGG CGCTGCGCGC GGCCGAGAGC GACGCGGGCG CGGTGTTGGC AACCGCGGCG CCAGCGGCCG AGGCGCTCAC TGCCGATCCC GGACAGTTGG ACGAATCGAC CGGCGCGCTG GCCGGCGTCA CGGCCTCGCC GGGCTTGGCC CTGGGCAGCG CCTTTCGCCT GCGCCGGCAG CGCATCGACT GGCCCGAGAC CGCGCCCGAT CCCCAGGCCG CGCGGCGCGA GCTGGAGCGC GCCATCGAGC GCGCCCAGGC GGCCCTGCAG GCCATGCAAG GGCGGCTCCG CGGACAGGGC GAGCAGGCGC CCGCCGATAT TTTGGCCGCG CAGCAGGAGC TCTTGGGCGA CCCGGACATC GCCGAGATGG CGTATCGCAG CATCGGCCAC GGCAAGAGCG CGCCGAGCGC GTGGCACGCG GCGATCGAGC ACCACAGCGG GCGTCTGGCC GCGCTCGACA ACGAGCTGCT GGCGGCCCGT GCCAGCGATC TGCGCGACGT CGGCGAGCGC GTGCTGCGCC AGCTCGTGGG CCAGTCCGAG GCCGCGCTCG ATGTCCCCGC GGGTTCGATC CTCATCGCCG AGGAGCTCAC GCCCTCGCAG GCGTCGCTGC TCGATGGCGG CGCGGTGCGC GGCTTCTGCA CCACCATGGG CACGTCCAAC TCGCACGTGG CCATCGTCGC GCGTTCGCTC GGTATCCCGG CCCTGGTCGG CATGGACGCG CGCATCCTGA GCGTGGCCGA TGGCACGCCG GTGCTGGTCG ACGCCGAGCG CGGCGCCGTG CACATCGCGC CCTCGGACGA GCAGCGCGCG CGCACCGAGG AGCGCATCGC GGCGGCCCGG GCCGAGCTGG CGGCGGCGAG CGCGCACGCG CTCGAACCCG CGCGCACCAG CGACGGCCAC GCCATCGCCG TGCTCGCCAA CCTCAGCCAC ACCGGCGAGA TCGAGCACCT ACGCATCCAC GGCGCCGAGG GCGTGGGTCT GCTGCGCTCG GAGTTTCTGT TCGCCGAACG CAGCAGCGCG CCCGATGAGG ACGAGCAGAG TCGCGTCTAC GGCGAGATCG CCGGGGCGCT GGGCGCCGGG CAGGCGCTGG TGATTCGCAC TCTGGACGTC GGCGGCGATA AACCGCTCGC CTATCTGCCG CTGCCGCGCG AGGACAATCC ATTTCTCGGC GAGCGCGGGA TTCGCGTGGG CCTCAACCAG CCGCAGATCC TGCGCACGCA GCTCCGCGCG GTGCTGCGGG CGGCGCGCGC GGCCGCGCCG GGCGGGCCCG AGCTGCGGGT GATGTTCCCG ATGATCGCGA CGCTCGAGGA GTGGCGGCAG GCAAAGGCGA TCTTCGACGA CGAGCGCGCC ACGCTGGCGG CCGAGAGCGA GGGCGGGGCG GAGCTGCCGG CGATTTTGCT CGGCATCATG GTCGAGGTGC CCGCGGTCGC GCTGCTGGCC GAGCAGTTCG CGCGCGAGGT CGATTTCTTC TCCATCGGCA CCAACGACCT GGCCCAGTAC ACGCTGGCCA TGGATCGCGG CAACCCGGCC TTTGCCGCGC AGCTCGACGG CCTCTCGCCC GCGCTGGTGC AGCTCGTGGC CAGGACCGTC GCGGGCGCGC GCGCGCACGG CAAGCGCGTG AGCGTATGCG GCAACCTGGC CAGCGACCTG CGCGCGGTGC CGATTCTCAT CGGCCTCGGC GTCGACGCCT TGAGCGTGGA CCTCCGCTCC ATTCCGCGCC TCAAGCAGGC GATTCGCAGC ACCGAGCTGG CCGCGTGCGA GGCCATGGCC AACGCGGCCC TGCAAGCCGG CACCAGCGCC GAGATCCACG CGCTGAGCGG CGAGCGCGAG GCGTAG
|
Protein sequence | MNMQAQQHAL RAPLSGTLLA LEQVPDPVFA QRMIGDGVAL EPESNALLAP CDGTVAHLHE AYHAITLETE TGLQVLLHIG IDTVQLAGEG FLPHVAAGQA VKTGDVLIEF DPDYLRAHAR SLLTMMLIVD MDGVSEMQAA TGQVRAGEDV ALRLQTRAGA ARALASAAEE RVSAAIEVTD PAGLHARPAA VLASVARRYQ AAIALCKGDR EANAKSVLSV MQLDVRHGDQ VLLRARGSDA ESALGDIAAQ LEEALRAAES DAGAVLATAA PAAEALTADP GQLDESTGAL AGVTASPGLA LGSAFRLRRQ RIDWPETAPD PQAARRELER AIERAQAALQ AMQGRLRGQG EQAPADILAA QQELLGDPDI AEMAYRSIGH GKSAPSAWHA AIEHHSGRLA ALDNELLAAR ASDLRDVGER VLRQLVGQSE AALDVPAGSI LIAEELTPSQ ASLLDGGAVR GFCTTMGTSN SHVAIVARSL GIPALVGMDA RILSVADGTP VLVDAERGAV HIAPSDEQRA RTEERIAAAR AELAAASAHA LEPARTSDGH AIAVLANLSH TGEIEHLRIH GAEGVGLLRS EFLFAERSSA PDEDEQSRVY GEIAGALGAG QALVIRTLDV GGDKPLAYLP LPREDNPFLG ERGIRVGLNQ PQILRTQLRA VLRAARAAAP GGPELRVMFP MIATLEEWRQ AKAIFDDERA TLAAESEGGA ELPAILLGIM VEVPAVALLA EQFAREVDFF SIGTNDLAQY TLAMDRGNPA FAAQLDGLSP ALVQLVARTV AGARAHGKRV SVCGNLASDL RAVPILIGLG VDALSVDLRS IPRLKQAIRS TELAACEAMA NAALQAGTSA EIHALSGERE A
|
| |