Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3384 |
Symbol | |
ID | 8138751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3916054 |
End bp | 3917814 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871002 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_003023167 |
Protein GI | 253701978 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 112 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTTGG AGAGGAGTGA TTCCAAAGAG CTGAAGGGGA TCGGCGCCTC CGCCGGTATC GCCATCGGTC AGGTCCGCAT CACGGATCGG CGCCGGGTGT CGGTGACCGA GGTGGTGGTG GCGCACGAGG AGATCCCGGG AGAGGTGGCG CGCTTCGCCA AGGCGATCGA GCGGGCCAAA GGGGAGCTCT GCGCCCTTAA GGACCAGCTC TCCAGCACCC ACGGCCCGGA ACACCTCTGC GTCATCGACG CGCACCTGAT GCTTTTGGAC GACACCATGC TGGTGGGCGA AACCACCCAA TACATAGAGC GGCTCGGCAT CAACGCCGAA GGGGCGCTGA AAAGGACCCT GTCGCGCTTC AAGTCCTTCT TCGACGGGGT GGAAGACGAC TACCTGCGCG AGCGCGGCAA CGACGTGGAG ACGGTGGTCG AGCGGGTGCT TAGGAACATG GTAGGGCAGA AGCAGGAATC CATCGCCAAG ATCGAGGGGA AGGTGATCGT GGTGGCCCAC GACCTCTCCC CGGCGGACAT CCTGCAGATC GACAAGGAGA AGGTGATCGG CTTCATCACC GACCTGGGGG GGAAGACCTC CCACTCCTCG ATCCTGGCGC GCTCCTTCGA GATCCCTGCC GTGGTGGGTC TGGAGCGGGC CACAGTGGAG GTTGGGGAGG GGGATACCCT GATCGTCGAC GGCGCCACCG GCGTGGTCAT AGTCAACCCG GGGGTCGAGC AGTTCAGGGA CTACCTGCAG AAAAAACAGC GCTACGAGTA CGTGGAAAGG GAACTGCTGA AGCTGCGGGA CCTGCCGGCC GTGACTCTCG ACGGCCACCG GATGCGGTTA AAGGGGAACG TCGAGTTCAT CCAGGAAGTG GCCGGCATCC ACCGGCACGG CGGCGAGGGG ATCGGCCTGT ACCGCACCGA GATGCTCTTC TTGAACCGCA ACACGCTTCC CGGCGAGGAG GAGCAGTTCG AGGCCTACGC GGCGCTGGTG AAGAAGATGG CCCCGCAATC GGTGACCATC AGGACGCTGG ACATAGGCGG CGACAAGTGC CTGACCGATC TGGAGCTTAC CGACGAAATG AACCCTGCCT TGGGGGTAAG GGCGATCCGC CTCTCGCTGC GCCAGCCCGA GGCCTTCAAG GCCCAGCTGC GGGCCATCCT CCGGGCCAGC GCCTTCGGCG AAGTCCGCCT CTTCTTCCCT ATGGTCTCCG GGGTGGCCGA GGTACGCGCC GCCAAGGCGC TCTTGGAGGA AGCAAAATCC GAACTGCGCG GGCTCCATCC GTTCGACGAG AAGATCCAGG TCGGCATCAT GATCGAGATC CCCTCGGCCG TGGTCATCGC CGACCTACTG GCCGCGGAGG TCGACTTCTT CAGCGTCGGG ACCAACGATC TCATCCAGTA CACCCTCGCC ATCGACCGTA CCAACGAACA TCTGTCCTCG CTTTACGAGC CGCTGCACCC GGCGGTGCTC CGTAGTCTGA AGATGGTGGT GGACGCGGCG CACGGCGCGG GAATCGACGC CTGCATCTGC GGCGAGATGG CGGGGGAGCC GGAGTACCTA CCCGTCCTTC TCGGTCTCGG CTTCGACGAG CTTTCCATGA ACGCGGTCTC AATCCCGCGC GTCAAGAAGA TCCTGCGCCG CTGCAGCATG GAGGAGGCGC GCCTGGTCGC CTCGCGCGCC CTTTCCTTTT CGACCGCGGC CGAAGCGGAC GCCTACCTGA AGGGAGAGAT AGCGGCACGC TTCTCCGAGA GCTTCGACTG A
|
Protein sequence | MGLERSDSKE LKGIGASAGI AIGQVRITDR RRVSVTEVVV AHEEIPGEVA RFAKAIERAK GELCALKDQL SSTHGPEHLC VIDAHLMLLD DTMLVGETTQ YIERLGINAE GALKRTLSRF KSFFDGVEDD YLRERGNDVE TVVERVLRNM VGQKQESIAK IEGKVIVVAH DLSPADILQI DKEKVIGFIT DLGGKTSHSS ILARSFEIPA VVGLERATVE VGEGDTLIVD GATGVVIVNP GVEQFRDYLQ KKQRYEYVER ELLKLRDLPA VTLDGHRMRL KGNVEFIQEV AGIHRHGGEG IGLYRTEMLF LNRNTLPGEE EQFEAYAALV KKMAPQSVTI RTLDIGGDKC LTDLELTDEM NPALGVRAIR LSLRQPEAFK AQLRAILRAS AFGEVRLFFP MVSGVAEVRA AKALLEEAKS ELRGLHPFDE KIQVGIMIEI PSAVVIADLL AAEVDFFSVG TNDLIQYTLA IDRTNEHLSS LYEPLHPAVL RSLKMVVDAA HGAGIDACIC GEMAGEPEYL PVLLGLGFDE LSMNAVSIPR VKKILRRCSM EEARLVASRA LSFSTAAEAD AYLKGEIAAR FSESFD
|
| |