Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2022 |
Symbol | gapC |
ID | 6967263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1921811 |
End bp | 1922812 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643385939 |
Product | glyceraldehyde-3-phosphate dehydrogenase |
Protein accession | YP_002270428 |
Protein GI | 209399787 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase |
TIGRFAM ID | [TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.00000572719 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTAAAG TTGGTATTAA CGGTTTTGGT CGTATCGGTC GACTGGTGTT GCGTCGATTA CTTGAAGTCA AAAGCAACAT AGACGTTGTC GCTATTAATG ATCTTACTTC CCCAAAAATT CTCGCCTATC TGCTGAAACA TGATTCAAAC TATGGTCCGT TCCCCTGGAG CGTTGATTTT ACGGAAGATT CACTTATCGT TGATGGGAAA AGTATCGCGG TTTACGCCGA AAAAGAGGCT AAAAATATTC CGTGGAAAGC GAAAGGTGCA GAAATCATTG TCGAATGTAC TGGCTTTTAT ACCTCCGCCG AGAAATCGCA GGCGCATCTT GATGCTGGCG CGAAGAAGGT GTTGATTTCC GCCCCTGCAG GTGAAATGAA AACTATCGTT TATAAAGTCA ATGACGACAC TCTGGATGGC AACGACACCA TTGTTTCCGT GGCATCATGC ACTACTAACT GTCTTGCGCC GATGGCCAAA GCCTTGCATG ACAGTTTCGG GATAGAAGTC GGCACGATGA CGACCATTCA TGCCTATACT GGCACCCAGT CACTGGTGGA TGGCCCGCGT GGTAAAGATT TACGTGCTTC ACGCGCCGCG GCAGAAAATA TCATTCCCCA CACTACGGGC GCGGCAAAAG CCATTGGTCT GGTGATCCCG GAACTGAGCG GCAAACTGAA AGGTCATGCG CAACGCGTGC CGGTGAAAAC AGGTTCGGTC ACTGAACTGG TATCGATTCT CGGAAAAAAA GTGACTGCCG AAGAGGTGAA TAACGCACTT AAACAAGCAA CCACCAATAA CGAGTCATTT GGTTATACCG ATGAAGAAAT AGTCTCTTCC GATATCATTG GCAGCCATTT CGGTTCGGTG TTTGATGCCA CGCAAACGGA AATTACCGCC GTGGGCGATT TACAACTGGT GAAAACGGTC GCCTGGTACG ACAACGAATA TGGCTTCGTC ACGCAGCTTA TTCGCACCCT CGAAAAATTC GCTAAACTCT GA
|
Protein sequence | MSKVGINGFG RIGRLVLRRL LEVKSNIDVV AINDLTSPKI LAYLLKHDSN YGPFPWSVDF TEDSLIVDGK SIAVYAEKEA KNIPWKAKGA EIIVECTGFY TSAEKSQAHL DAGAKKVLIS APAGEMKTIV YKVNDDTLDG NDTIVSVASC TTNCLAPMAK ALHDSFGIEV GTMTTIHAYT GTQSLVDGPR GKDLRASRAA AENIIPHTTG AAKAIGLVIP ELSGKLKGHA QRVPVKTGSV TELVSILGKK VTAEEVNNAL KQATTNNESF GYTDEEIVSS DIIGSHFGSV FDATQTEITA VGDLQLVKTV AWYDNEYGFV TQLIRTLEKF AKL
|
| |