Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0914 |
Symbol | |
ID | 7976642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 973267 |
End bp | 974988 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644797872 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_002949045 |
Protein GI | 239826421 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA CCATTCAAGG AATTGCTGCA TCGAGCGGTA TTGCCATCGC AAAGGCATAC CGCTTAGAGA CCCCTGATTT TGTTGTAGAG AAAAAAGTTG TTTCCGATCC GCAGGCAGAA GTTGCACGTT TCGAAGCAGC AGTGGCGAAA GCAACAGAAG AGCTAGAAAT CATCAAGCAA CATGCTTTGC AACAATTAGG GGAAGATAAA GCTGCGATTT TTTCTGCTCA CCTGCTTGTA TTAAATGATC CAGAATTATT AAATCCAGTC AAAGAAAAAA TTAAGACAGA GCAAGTGAAC GCGGAATACG CGTTGAACGA AACGGCGACG ATGTTTATTT CCATGTTTGA AGCAATGGAT AATGAGTATA TGAAGGAACG CGCCGCTGAT ATTCGCGATG TAACGAAACG TGTACTTGCT CATTTACTTG GAGTTACGAT TTCCAATCCA AGCCTTATTT CCGAAGAAGT TGTGATTATT GCCGAAGATT TAACTCCATC CGATACAGCT CAATTAAACC GCCAATACGT AAAAGGGTTT GCAACCGATA TCGGAGGACG TACATCCCAT TCTGCCATCA TGGCGAGATC GATGGAAATT CCAGCTGTTG TCGGTACGAA ACAAGTGACA GCAGAAGTTC AAAATGGTGA TGTTGTGATT ATCGATGGAT TGGATGGACA GGTTATTGTC AATCCATCGG ATGAGGTGCT CGCACAATAT GAAGAAAAAC GCGCTCGCTA TGAAGCGCAA AAAGCAGAAT GGGCAAAACT TGTACATGAA AAAACAGTGA CAAGCGACGG GTATCATGTG GAACTAGCTG CTAACATCGG TACACCAGAT GATGTCAAAG GAGTACTCGA AAACGGAGCC GAAGGAATCG GATTATACCG CACGGAATTT TTATACATGG GCCGTTCCGA ACTGCCAACA GAGGAAGAAC AGTTTGAAGC GTATAAAACA GTGCTTGAAC GAATGGAAGG AAAACCTGTT GTTGTTCGTA CGCTTGACAT CGGCGGGGAT AAAGAGCTTC CATATTTGGA TCTTCCAAAA GAAATGAACC CGTTTTTAGG ATTCCGTGCC ATTCGTCTTT GCTTAGAAAT GCAGGACATG TTCCGCACGC AGCTTCGCGC TTTGTTGAGA GCGAGCGTAT ATGGCAATTT GAAAATCATG TTTCCGATGA TTGCAACGCT TGATGAATTC CGTCAAGCGA AGGCGATTCT TTTAGAGGAA AAAGAAAAAT TGCAGCGCGA GGGTGTACCA GTGGCTGACG ATATTGAAGT CGGCATGATG GTGGAAATTC CGGCGGCAGC TGTACTTGCC GACCAATTTG CCAAAGAAGT AGATTTCTTT AGCATTGGCA CCAATGACTT AATTCAGTAT ACGATGGCAG CAGACCGCAT GAACGAACGC GTTTCATATC TTTACCAACC GTACAATCCA GCGATTTTGC GCCTTATTAG CAACGTCATT GATGCTGCGC ATAAAGAAGG AAAATGGGCT GGAATGTGCG GCGAAATGGC AGGCGATGCG ATCGCCATTC CGATTTTGCT CGGTTTAGGG CTGGACGAGT TTAGTATGAG CGCAACTTCT ATTTTGCGCG CACGTTCGCA AATGAAAAAA TTATCAAAAG AAGAAGCGGC GCGCTTTAAA GAAACCGTCC TCTCGATGAG CACAGCGGAA GAAGTAGTTG CATTTGTGAA ACAAACATTC CATATAGAAT AA
|
Protein sequence | MIKTIQGIAA SSGIAIAKAY RLETPDFVVE KKVVSDPQAE VARFEAAVAK ATEELEIIKQ HALQQLGEDK AAIFSAHLLV LNDPELLNPV KEKIKTEQVN AEYALNETAT MFISMFEAMD NEYMKERAAD IRDVTKRVLA HLLGVTISNP SLISEEVVII AEDLTPSDTA QLNRQYVKGF ATDIGGRTSH SAIMARSMEI PAVVGTKQVT AEVQNGDVVI IDGLDGQVIV NPSDEVLAQY EEKRARYEAQ KAEWAKLVHE KTVTSDGYHV ELAANIGTPD DVKGVLENGA EGIGLYRTEF LYMGRSELPT EEEQFEAYKT VLERMEGKPV VVRTLDIGGD KELPYLDLPK EMNPFLGFRA IRLCLEMQDM FRTQLRALLR ASVYGNLKIM FPMIATLDEF RQAKAILLEE KEKLQREGVP VADDIEVGMM VEIPAAAVLA DQFAKEVDFF SIGTNDLIQY TMAADRMNER VSYLYQPYNP AILRLISNVI DAAHKEGKWA GMCGEMAGDA IAIPILLGLG LDEFSMSATS ILRARSQMKK LSKEEAARFK ETVLSMSTAE EVVAFVKQTF HIE
|
| |