Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3918 |
Symbol | |
ID | 8734375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 4159238 |
End bp | 4161943 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 646504542 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_003395710 |
Protein GI | 284045370 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase [TIGR02364] dihydroxyacetone kinase, phosphotransfer subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.644077 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCGGGA TCGTCATCGT CTCGCACAGC GCGCGGCTCG CCGAGGGCGT CGTAGAGCTG GCGCGCGAGA TGGCCGCCGA CGTGCCGCTC GTCGCGGCGG GCGGCCTGGA GCCGCCGGCC GAGGGCGAGC CGGCGCCGCT CGGCACCGAC GCCGCGCGGG TGATGGCGGC GGTCGAGGAG GCCGCCGCGG CCGGCGACGG GGTCCTGGTC CTGATGGACC TCGGCTCGGC CGTGCTGAGC GCCGAGATGG CCGTCGAGCT GTTGGATGAG GCCGTTGCCG CGCAGGTGCG CCTCGTCCCC GCCCCGTTGG TCGAGGGCGC CGTCGCCGCC GCCGTCACCG CCCAGGCCGG CGGCTCGCTT GACGCGGTCG CCGAGGAGGC CCGCGGCGGG TTGACCGCCA AGGCCGCGCA CCTCGGCGAG GCGGCCGGCG ACGAGTCCGA GCCGCGGTCG TTTGCTGGTC CCACAGACCA GCAATCGACC GCGCCTCCCG CCGACGCGGT CGAGGACCGC TTCGTCGTGA CGGTCGCGCA GGGGCTGCAC GCGCGGCCGG CCGCGCGCTT CGTGCGGACC GCGGCCGCGC TCGACGCGCG CGTCGAGGTG GAGAACGGGA CGACCGGCGC GGGTCCCGTT TCAGCCGGCT CGCTCAACTC GATCGCGACG CTCGGCGTGC GCGAGGGCCA CGAGCTGGTC GTCCGCGCCT CCGGCCCCGA CGCCCGCCGG GCGCTGGAGC AGCTGCGCGC CGTCGCGACC GACGCCGCCG CCCCGGTCTC GGCCGGCCGC GCGCCGGCGC CGACGATCGG CGCCCCGAAC ACGCTCGCGC CGCCGGACGG CGCGGGCGCG GGCATGGCGG CGCCGCCGGG CTCCCCTCCG GGCACGCTCG CCGGGATCGC GAGCTCGCCG GGCGTGGCGC TTGGGCCACT GCGGCCGATC GCCGCCGAGG CGGCCGAGCC ACCGCCGATC GACGACGCGC CGAGCGGCAC GCCGGAGGAG GAGTGGGCGG CGCTCGCGGC CGCACGCGCG GCCGTGCAGG CCGAGATCGG CGAGCGGCGC GAGCGGCTGG CCGCGCAGGT CGGCGAGGAG GAGGCCGAGA TCCTCGACGC GCTCGGGCTC GCGCTCGACG ACGAGGCGCT GCTCGACCCG GCGAGAGCGG CGATCTTCGA GCGGCGCAGC AGCGCCGCGC GCGCATGGGC CGACGCCGTC GAGGGGATCA CGGCCCGCTA CCGCGCCCTC GACGACGCCT ACCAGCGCGA GCGCGCCGGC GACCTCGCCG ACGTCGGCCG CCGCGTGCTC GCCGCACTGG CAGCGGGTGC GAGCGGCACG GCCGGAGCGA GCGGCGTGGC CGGCGCAGCC GGCGCGAGCG GCGCAGCCGG CGCGAGCGGC GCAGCCGGCG CGAGCGGCGC AGCCGGAGCG GCCGGCGCCG GTGGCGCAGC CGGCGCCGCC TCCGCGGCCG GCATCCTCGT CGCGCGGGAG CTGACGCCGC TCGACGCGGC CGGGCTCGAC CGCGACGCGG TCAGCGGGAT CGCGACCGCC GAGGGCGGGC CGACCTCGCA CGGGGCGATC CTCGCCCGCG CGCTCGGGGT CCCCGCCGTC GTCGGGCTCG GCGCGGCGCT GCTCGACCTG CCGGCGGGCA CGCCGGCGGC GCTCGACGGC GACCGCGGGC TGCTCGTCCC CTCCCCCGCG CCCGACGTCG CGCGCGAGTA CGCCGAGCGC CGCGCGCGCG AGGCCGCGCT CGCCGACCAG GCGCGCGCCG CGGCGCACCG TCCGGCGGCG ACGCGCGACG GGATCCGCAT CGAGGTCGCG GCGAACGCGG GCGACGCCGG CGACGCCGTG GAGGCGGCCG CCGTCGGCGC CGACGGCGTC GGGCTGCTGC GGACCGAGTT CGCCTTCCTC GACCGCGACG GCGCCCCCAG CGAGGACGAG CAGGCGGCGA TCTACGGCGC CGCCGCTGCG GCGCTCGACG GCCGGCCGCT CGTGATCCGC ACGCTCGACG CCGGCGCCGA CAAGCCGCTC CCCTACCTCG GCATGCCACC CGAGCAGAAC CCCTTCCTCG GCGTCCGCGG CGTGCGGCTC GGGCTCGCCC GGCCGCAGCT GCTCGCAACG CAGCTGCGCG CGATCGTCCG CACCGCGGCC GAGCACGAGA ACGTCAAGGT GATGTTCCCG ATGATCGCCA CGATCGACGA GCTGCGCAGC GGCCGGCGGA TGCTCGACGA GGCGTGCGCG GCGGTCGGCG AGCCGCTCAG AGACGGCTTC GAGGTCGGGA TCATGGTCGA GGTCCCGGCC GCTGCGCTGA CCGCCGTCCA GCTCGCGCAC GAGGTCGACT TCTTCTCGCT CGGCACGAAC GACCTGACGC AGTACGTGCT CGCGGCCGAG CGCGGAAACG CCGCGCTCGC GCGCCTCGCC GACGGCCTCC ATCCCGCGGT GCTGCGGCTC GTGCACGAGG TCTGCAGCGC CGCCCGCGCG CACGGCCGCT GGGTCGGCGT CTGCGGCGAA CTGGGCGCCG ACCCCGCCGC GATCCCCCTG CTGGTCGGGC TCGGCGTCCG CGAGCTGAGC GTCGCGGCGC CGGCGGTCCC GGCCGTCAAG GCGGCGGTGC GAGCGCTCGA CGCCGACGAG GCGGAGACGC TCGCGCACGC GGCGCTGGCG GCGGAGTCGG CTGACGCCGT CCGCGCGCTG GTCGTTTCCG ATGACGGACC AGCGGGCAGG CGTTGA
|
Protein sequence | MVGIVIVSHS ARLAEGVVEL AREMAADVPL VAAGGLEPPA EGEPAPLGTD AARVMAAVEE AAAAGDGVLV LMDLGSAVLS AEMAVELLDE AVAAQVRLVP APLVEGAVAA AVTAQAGGSL DAVAEEARGG LTAKAAHLGE AAGDESEPRS FAGPTDQQST APPADAVEDR FVVTVAQGLH ARPAARFVRT AAALDARVEV ENGTTGAGPV SAGSLNSIAT LGVREGHELV VRASGPDARR ALEQLRAVAT DAAAPVSAGR APAPTIGAPN TLAPPDGAGA GMAAPPGSPP GTLAGIASSP GVALGPLRPI AAEAAEPPPI DDAPSGTPEE EWAALAAARA AVQAEIGERR ERLAAQVGEE EAEILDALGL ALDDEALLDP ARAAIFERRS SAARAWADAV EGITARYRAL DDAYQRERAG DLADVGRRVL AALAAGASGT AGASGVAGAA GASGAAGASG AAGASGAAGA AGAGGAAGAA SAAGILVARE LTPLDAAGLD RDAVSGIATA EGGPTSHGAI LARALGVPAV VGLGAALLDL PAGTPAALDG DRGLLVPSPA PDVAREYAER RAREAALADQ ARAAAHRPAA TRDGIRIEVA ANAGDAGDAV EAAAVGADGV GLLRTEFAFL DRDGAPSEDE QAAIYGAAAA ALDGRPLVIR TLDAGADKPL PYLGMPPEQN PFLGVRGVRL GLARPQLLAT QLRAIVRTAA EHENVKVMFP MIATIDELRS GRRMLDEACA AVGEPLRDGF EVGIMVEVPA AALTAVQLAH EVDFFSLGTN DLTQYVLAAE RGNAALARLA DGLHPAVLRL VHEVCSAARA HGRWVGVCGE LGADPAAIPL LVGLGVRELS VAAPAVPAVK AAVRALDADE AETLAHAALA AESADAVRAL VVSDDGPAGR R
|
| |